Umi-OCR_plugins by hiroi-sora

Plugins for Umi-OCR

Created 2 years ago

528 stars

Top 59.8% on SourcePulse

Project Summary

This repository hosts plugins for Umi-OCR, an open-source OCR application that supports extensible OCR engines and components. It caters to users seeking to enhance Umi-OCR's capabilities with various OCR technologies, offering offline and cloud-based solutions with different performance and compatibility trade-offs.

How It Works

Plugins are designed as modular components that integrate with Umi-OCR v2 and later. Users download pre-compiled plugin packages from the Releases page and place them in the UmiOCR-data/plugins directory. The repository details several OCR engines, including PaddleOCR-json (optimized for CPU, AVX instruction set required), RapidOCR-json (lightweight, low resource usage), Pix2Text (for math formulas), TesseractOCR (multi-language, good English accuracy), ChineseOCR (lightweight), WechatOCR (offline WeChat OCR), and Mistral AI OCR (cloud-based API).

Quick Start & Requirements

Installation: Download plugin zip from Releases, extract to UmiOCR-data/plugins.
Prerequisites:
- Umi-OCR v2 or later.
- PaddleOCR-json requires CPU with AVX instruction set (excludes Atom, Itanium, some Celeron/Pentium).
- Other plugins generally have no special hardware requirements.
Resources: Plugin sizes vary; PaddleOCR-json is noted as large.
Links: Releases, Plugin Development, Umi-OCR Main Repo

Highlighted Details

Offers a diverse range of OCR engines: PaddleOCR, RapidOCR, Pix2Text, Tesseract, ChineseOCR, WechatOCR, and Mistral AI.
Includes both offline (CPU-bound) and cloud-based (API) OCR solutions.
PaddleOCR-json plugin supports mkldnn acceleration for enhanced CPU performance.
TesseractOCR plugin includes a layout analysis model that can improve document parsing accuracy when Umi-OCR's default is disabled.

Maintenance & Community

The project is part of the larger Umi-OCR ecosystem. Further community and maintenance details would likely be found on the main Umi-OCR repository.

Licensing & Compatibility

The README does not explicitly state a license for the plugins themselves. The underlying OCR engines have their own licenses (e.g., PaddleOCR, Tesseract). Compatibility for commercial use depends on the specific plugin's underlying OCR technology license.

Limitations & Caveats

Users are strongly cautioned against downloading the repository's source code directly, emphasizing the need to use pre-compiled releases.
PaddleOCR-json has specific CPU hardware requirements (AVX instruction set).
Pix2Text plugin is noted as having a large file size and slower loading times.

Health Check

Last Commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)

1

Issues (30d)

1

Star History

9 stars in the last 30 days

Explore Similar Projects

YomiNinja by matt-m-o

Open-source OCR tool for language learners

Created 2 years ago

Updated 6 months ago

BetterOCR by junhoyeo

OCR tool combining multiple engines with LLM for improved text detection

Created 2 years ago

Updated 8 months ago

DeepSeek-OCR-WebUI by neosun100

Intelligent OCR web application for diverse document and image analysis

Created 4 months ago

Updated 5 days ago

TTime by InkTimeRecord

Screenshot, OCR, and translation software

Created 3 years ago

Updated 1 year ago

comic-translate by ogkalu2

Desktop app for translating comics in multiple formats/languages

Created 2 years ago

Updated 5 days ago

STranslate by STranslate

WPF tool for translation and OCR tasks

Created 3 years ago

Updated 1 week ago

awesome-ocr by wanghaisheng

Curated list of OCR resources

Created 10 years ago

Updated 3 years ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic).

Bob by ripperhe

macOS translation/OCR app

Created 6 years ago

Updated 1 month ago

RapidOCR by RapidAI

Fast, multi-platform OCR toolkit

Created 5 years ago

Updated 1 week ago

Starred by

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI).

GOT-OCR2.0 by Ucas-HaoranWei

OCR research paper for unified end-to-end model

Created 1 year ago

Updated 1 year ago

Starred by

Lyumin Zhang

Lyumin Zhang(Author of ControlNet).

manga-image-translator by zyddnys

Image translator for manga/images, supporting multiple languages

Created 5 years ago

Updated 4 days ago

LunaTranslator by HIllya51

Galgame translator for visual novels

Created 3 years ago

Updated 1 day ago

Feedback? Help us improve.