Offline OCR SDK for general verification codes
Top 4.1% on sourcepulse
DdddOcr is an open-source, offline SDK for general-purpose CAPTCHA recognition, developed collaboratively by sml2h3 and kerlomz. It aims to provide a user-friendly experience with minimal dependencies, leveraging deep neural networks trained on large datasets of randomly generated data. The library is suitable for developers and researchers needing to automate CAPTCHA solving tasks, offering features beyond basic OCR.
How It Works
DdddOcr utilizes ONNX Runtime for inference, with models trained using PyTorch via the dddd_trainer framework. It offers two built-in OCR models, selectable during initialization. The library also includes capabilities for image color filtering using HSV color spaces, object detection to identify potential target locations (bounding boxes), and slide puzzle matching using OpenCV algorithms.
Quick Start & Requirements
pip install ddddocr
pip install ddddocr[api]
use_gpu=True
.Highlighted Details
Maintenance & Community
The project is actively maintained by sml2h3. Community interaction and issue reporting are encouraged via GitHub Issues.
Licensing & Compatibility
Limitations & Caveats
Slide detection algorithms may not be robust for all screenshotting methods. The README notes that recognition effectiveness can be "mystical" and may vary. Users are advised against re-initializing the DdddOcr
instance for every recognition task, as initializations are slow.
1 month ago
Inactive