Discover and explore top open-source AI tools and projects—updated daily.
AI tool for paperless-ngx document management
Top 29.1% on SourcePulse
This project provides an AI-powered enhancement for paperless-ngx, automating document titling, tagging, and metadata generation. It targets users of paperless-ngx seeking to improve document organization and text extraction accuracy, offering significant time savings and more intelligent document management.
How It Works
paperless-gpt leverages Large Language Models (LLMs) and LLM Vision capabilities to process documents. It can use OpenAI or Ollama for LLM-based OCR, offering superior accuracy on challenging scans compared to traditional OCR. Alternatively, it integrates with enterprise-grade OCR services like Google Document AI or Azure Document Intelligence. The tool generates context-aware titles, tags, and correspondent information, and can create searchable PDFs with accurate text layers.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is actively maintained by icereed. Further community engagement details are not explicitly provided in the README.
Licensing & Compatibility
Limitations & Caveats
Enhanced PDF features, including text layer generation and hOCR integration, are exclusively supported when using Google Document AI as the OCR provider. The PDF_REPLACE: "true"
option for overwriting original documents is flagged as potentially dangerous and requires extreme caution due to the risk of data loss.
20 hours ago
1 day