Discover and explore top open-source AI tools and projects—updated daily.
dnoboriAI tool for crystal-clear scanned PDF enhancement
Top 43.3% on SourcePulse
Summary
This project addresses the poor readability of scanned PDFs from physical books on digital devices. It offers an AI-powered tool to enhance scanned documents, removing artifacts like stains and ghosting, correcting alignment, and standardizing margins. Designed for individuals digitizing personal libraries, it aims to make scanned books as comfortable to read as professional e-books, enabling full-text search and improved study efficiency.
How It Works
The tool uses Real-ESRGAN for AI image upscaling and artifact removal. It employs OCR for page number recognition, coupled with heuristic algorithms to correct page offsets, uniformly trim margins, and align PDF viewer page numbers with book page numbers. Additional features include automatic detection and metadata embedding for double-page spreads and vertical text orientation.
Quick Start & Requirements
ConvertPdf command with source/destination directories.Highlighted Details
Maintenance & Community
Developed for personal use and released due to demand. The author encourages forking for extensions rather than pull requests due to limited review time. No specific community channels or contributor details are provided.
Licensing & Compatibility
Core C# code is AGPL v3. External dependencies have their own licenses (GPL, Apache, ImageMagick). AGPLv3 is a strong copyleft license, potentially impacting commercial use or integration with closed-source projects. The tool is explicitly for personal use; redistribution of converted PDFs is warned against due to copyright.
Limitations & Caveats
Primarily Windows-focused; Linux/macOS support is not guaranteed. Setup is highly complex, requiring manual installation of numerous external tools. Processing is resource-intensive (RAM, GPU). Lacks built-in Japanese OCR for book content. Page number detection can fail for certain layouts. AGPLv3 and personal-use focus may restrict broader adoption.
1 month ago
Inactive