AI tool for paperless-ngx document management
Top 31.7% on sourcepulse
This project provides an AI-powered enhancement for paperless-ngx, automating document titling, tagging, and metadata generation. It targets users of paperless-ngx seeking to improve document organization and text extraction accuracy, offering significant time savings and more intelligent document management.
How It Works
paperless-gpt leverages Large Language Models (LLMs) and LLM Vision capabilities to process documents. It can use OpenAI or Ollama for LLM-based OCR, offering superior accuracy on challenging scans compared to traditional OCR. Alternatively, it integrates with enterprise-grade OCR services like Google Document AI or Azure Document Intelligence. The tool generates context-aware titles, tags, and correspondent information, and can create searchable PDFs with accurate text layers.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is actively maintained by icereed. Further community engagement details are not explicitly provided in the README.
Licensing & Compatibility
Limitations & Caveats
Enhanced PDF features, including text layer generation and hOCR integration, are exclusively supported when using Google Document AI as the OCR provider. The PDF_REPLACE: "true"
option for overwriting original documents is flagged as potentially dangerous and requires extreme caution due to the risk of data loss.
3 days ago
1 day