OCR framework for building custom pipelines
Top 93.3% on sourcepulse
MyOCR is an advanced OCR pipeline builder designed for engineers and researchers to create and integrate custom OCR systems. It offers a modular and extensible framework for end-to-end OCR development, enabling flexible training, integration of deep learning models, and production-ready deployment.
How It Works
MyOCR provides a unified pipeline for detection and recognition, allowing users to mix and match components like models and processors. It leverages ONNX runtime for efficient CPU/GPU inference and supports structured OCR output through integration with large language models like Qwen for data extraction.
Quick Start & Requirements
pip install -e .
(after cloning the repo)Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The structured output pipeline requires configuration for LLM APIs (Ollama, OpenAI) and specific model setups. The README mentions a UI (doc-insight-ui) but does not provide a direct link.
5 days ago
Inactive