Discover and explore top open-source AI tools and projects—updated daily.
jingsongliujingFast, framework-free OCR powered by ONNXRuntime
Top 23.4% on SourcePulse
A high-performance, multilingual OCR system optimized for fast inference by decoupling from deep learning training frameworks and leveraging ONNXRuntime. It targets engineers and researchers needing deployable OCR solutions, offering cross-architecture compatibility and advanced integration capabilities.
How It Works
The project converts PaddleOCR models to ONNX format, enabling unified inference across x86 and ARM architectures via ONNXRuntime. A core inference_engine.py manages ONNX sessions. This approach facilitates deployment-readiness and allows integration of features like multilingual recognition, layout analysis, and information extraction with local LLMs.
Quick Start & Requirements
pip install -r requirements.txt (using the specified index URL for China-based users).python>=3.8.python scripts/download_models.py, which supports ModelScope (default) and HuggingFace sources.test_ocr.py, examples/) serve as functional demonstrations.Highlighted Details
app-service.py) and web UI (webui.py) services, with Docker support for streamlined deployment.Maintenance & Community
The project acknowledges contributions from PaddleOCR and RapidAI communities. Issues and Pull Requests are welcomed, but specific community channels or roadmaps are not detailed.
Licensing & Compatibility
The repository's license is not explicitly stated in the README, which is a critical omission for assessing commercial use or derivative works.
Limitations & Caveats
Specialized OCR features require downloading large, optional model files post-installation. The absence of a clear license is a significant adoption blocker.
1 day ago
Inactive