Discover and explore top open-source AI tools and projects—updated daily.
byteferPowerful macOS command-line OCR tool
Top 96.6% on SourcePulse
A powerful command-line OCR tool for macOS, macos-vision-ocr leverages Apple's native Vision framework to provide efficient, on-device text recognition. It targets developers and power users needing to integrate OCR capabilities into macOS workflows or applications, offering detailed positional data and supporting batch processing for high-throughput tasks.
How It Works
This tool utilizes Apple's Vision framework, a high-performance framework for image analysis integrated into macOS. By processing images directly on the device, it avoids external dependencies and potential latency associated with cloud-based OCR services. The approach allows for precise text detection, providing bounding box coordinates and confidence scores for each recognized text element, outputted in a structured JSON format.
Quick Start & Requirements
swift build -c release --arch arm64swift build -c release --arch x86_64./macos-vision-ocr --img <path> or ./macos-vision-ocr --img-dir <path> --output-dir <path>Highlighted Details
Maintenance & Community
The provided README does not detail specific contributors, sponsorships, or community channels (e.g., Discord, Slack).
Licensing & Compatibility
Limitations & Caveats
This tool is exclusively for macOS. Installation requires building from source, and common issues may arise from incorrect image paths, unsupported image formats, insufficient file permissions, or images with unclear text, very small text (less than 1% of image height), or unsupported languages.
1 year ago
Inactive