Discover and explore top open-source AI tools and projects—updated daily.
majcheradamPDF to structured data extraction API
New!
Top 52.5% on SourcePulse
OCRBase provides a self-hostable API for converting PDF documents into structured data (Markdown/JSON) using advanced OCR and LLM techniques. It targets developers and power users needing to process large volumes of documents, offering a scalable, real-time solution with a type-safe TypeScript SDK for seamless integration.
How It Works
This project leverages PaddleOCR-VL-0.9B for high-accuracy text extraction from PDFs, followed by LLM-powered parsing to structure the extracted text according to user-defined schemas. Its architecture is built for scale, employing a queue-based processing system and providing real-time job progress updates via WebSockets, all accessible through a comprehensive, type-safe TypeScript SDK.
Quick Start & Requirements
bun add ocrbaseHighlighted Details
Maintenance & Community
No specific details regarding contributors, sponsorships, or community channels (e.g., Discord, Slack) were present in the provided README snippet.
Licensing & Compatibility
The project is released under the MIT License, which is generally permissive for commercial use and integration into closed-source projects.
Limitations & Caveats
Self-hosting requires familiarity with Docker and the Bun runtime. The primary SDK is TypeScript-focused, which may present a learning curve or integration challenge for teams not using that ecosystem.
1 day ago
Inactive
getomni-ai