ocr-based-qwen by Cunninger

OCR tool using QwenLM

Created 1 year ago

261 stars

Top 97.3% on SourcePulse

Project Summary

This project provides an OCR interface leveraging QwenLM's capabilities, designed for users needing to extract text from images, including specialized support for mathematical formulas and CAPTCHAs. It offers a user-friendly web UI with drag-and-drop and clipboard support, alongside API access for programmatic integration.

How It Works

The system acts as a proxy, interacting with the QwenLM API to perform OCR. It handles image uploads, text extraction, and crucially, formats the output according to a detailed prompt that prioritizes LaTeX for mathematical content and specific rules for CAPTCHAs. This approach allows users to benefit from QwenLM's advanced models without direct API key management, while the prompt engineering ensures structured and usable output.

Quick Start & Requirements

Docker: docker run -p 3000:3000 sexgirls/qwen-ocr-app:latest
Cloudflare Workers: Deploy worker.js to Cloudflare.
Prerequisites: Requires a QwenLM account cookie for full functionality; test cookies are provided but have upload limits.
Demo: 智能图片识别
Docs: GitHub Repository

Highlighted Details

Optimized for mathematical formula recognition, outputting in LaTeX format.
Includes CAPTCHA recognition for common character types.
Supports custom prompts for advanced users to control output formatting.
Offers API endpoints for base64 encoded images, image URLs, and file uploads.

Maintenance & Community

The project is actively maintained by Cunninger. Further community engagement details (like Discord/Slack) are not explicitly mentioned in the README.

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive MIT license allows for commercial use and integration into closed-source projects.

Limitations & Caveats

The project relies on external QwenLM API access, which may be subject to rate limits or changes. The provided test cookies have upload limitations, necessitating the use of personal cookies for stable operation. The API documentation link provided points to an Apifox page that notes potential debugging issues.

Health Check

Last Commit

9 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days