OCR tool combining multiple engines with LLM for improved text detection
Top 57.4% on sourcepulse
This project provides a robust solution for improving Optical Character Recognition (OCR) accuracy, particularly for non-English languages or noisy inputs. It targets developers and researchers needing enhanced text extraction by intelligently combining multiple OCR engines with Large Language Models (LLMs) for correction and reconstruction.
How It Works
BetterOCR leverages a multi-engine approach, integrating EasyOCR, Tesseract, and Pororo (for Korean/English). It then utilizes OpenAI's chat models to refine and correct the combined OCR outputs. An optional custom context feature allows users to provide specific keywords or product names, significantly improving accuracy for specialized terminology and reducing errors.
Quick Start & Requirements
pip install betterocr
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The package is under rapid development, with features like async support and an improved interface noted as "coming soon." Performance may vary based on OCR engine updates and OpenAI API availability.
1 month ago
1 week