API Consumer - Judicial Data Extraction

Jun 1, 2025 · 1 min read

Rama Judicial API Consumer is an ETL (Extract, Transform, Load) system that consumes the Colombian Judicial Branch’s public REST API to extract judicial process information, including PDF documents, and transform it into structured text format for analysis.

Key Features

  • REST API Consumption: Integration with the Judicial Branch’s public API
  • PDF Extraction: Automatic download and parsing of procedural documents
  • Rate Limiting: Request throttling to respect API limits
  • Text Transformation: Conversion of PDFs to structured plain text
  • Error Handling: Automatic retries and failure logging
  • Caching: Local storage of responses to avoid redundant requests

Technologies Used

  • Language: TypeScript
  • Runtime: Node.js
  • HTTP Client: axios
  • Rate Limiting: p-limit
  • PDF Parsing: pdf-parse
  • Output: Structured TXT files

Impact

This system enables automated extraction of judicial data for statistical analysis and data mining, facilitating research on procedural patterns and report generation based on public information from the Judicial Branch.

Developed as a data pipeline to feed statistical analyses and intelligence tools on Colombian judicial processes, demonstrating integration capabilities with government APIs and document processing.