

Pdf Extract Api
Overview :
The pdf-extract-api is an API that utilizes modern OCR technology and Ollama-supported models to convert any document or image into structured JSON or Markdown text. It is built using FastAPI and employs Celery for asynchronous task processing, with Redis used for caching OCR results. The API has no reliance on cloud services or external dependencies, ensuring that all processing is completed in a local development or server environment, thereby safeguarding data security. It supports high-precision conversion from PDF to Markdown, including tabular data, numerical or mathematical formulas, and can convert PDFs to JSON using Ollama-supported models. Additionally, the API supports LLM-enhanced OCR results, removing personally identifiable information (PII) from PDFs, as well as distributed queue processing and caching.
Target Users :
The target audience includes developers and businesses in need of high-precision document conversion services, particularly those with stringent data privacy and security requirements. This API is ideal for scenarios that require converting a large volume of documents into structured data, such as legal documents, medical reports, and financial invoices.
Use Cases
Convert MRI reports to Markdown and JSON
Convert invoices to JSON while removing PII
Use different OCR strategies for PDF to Markdown conversion
Features
High-precision conversion from PDF to Markdown and JSON
Local processing using PyTorch-based Marker OCR and Ollama models
LLM for improved OCR text results
Removal of personally identifiable information (PII) from PDFs
Distributed queue processing using Celery
Utilization of Redis to cache OCR results
Command-line tool for task submission and result processing
How to Use
1. Clone the repository to your local machine
2. Set environment variables and create a .env file
3. Use Docker Compose to build and run the Docker container
4. Use the CLI tool to upload files for OCR conversion
5. Retrieve the OCR results
6. Clear the OCR cache
Featured AI Tools

Pseudoeditor
PseudoEditor is a free online pseudocode editor. It features syntax highlighting and auto-completion, making it easier for you to write pseudocode. You can also use our pseudocode compiler feature to test your code. No download is required, start using it immediately.
Development & Tools
3.8M

Coze
Coze is a next-generation AI chatbot building platform that enables the rapid creation, debugging, and optimization of AI chatbot applications. Users can quickly build bots without writing code and deploy them across multiple platforms. Coze also offers a rich set of plugins that can extend the capabilities of bots, allowing them to interact with data, turn ideas into bot skills, equip bots with long-term memory, and enable bots to initiate conversations.
Development & Tools
3.8M