paperless-gpt  by icereed

AI tool for paperless-ngx document management

created 10 months ago
1,284 stars

Top 31.7% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides an AI-powered enhancement for paperless-ngx, automating document titling, tagging, and metadata generation. It targets users of paperless-ngx seeking to improve document organization and text extraction accuracy, offering significant time savings and more intelligent document management.

How It Works

paperless-gpt leverages Large Language Models (LLMs) and LLM Vision capabilities to process documents. It can use OpenAI or Ollama for LLM-based OCR, offering superior accuracy on challenging scans compared to traditional OCR. Alternatively, it integrates with enterprise-grade OCR services like Google Document AI or Azure Document Intelligence. The tool generates context-aware titles, tags, and correspondent information, and can create searchable PDFs with accurate text layers.

Quick Start & Requirements

  • Installation: Docker Compose is the recommended installation method.
  • Prerequisites: Docker, a running instance of paperless-ngx, and access to an LLM provider (OpenAI API key or Ollama server).
  • Configuration: Requires setting environment variables for paperless-ngx connection details and LLM provider configuration.
  • Documentation: Detailed configuration options and usage examples are available in the README.

Highlighted Details

  • LLM-Enhanced OCR: Utilizes OpenAI or Ollama for improved text extraction accuracy.
  • Multiple OCR Providers: Supports LLM-based OCR, Azure Document Intelligence, Google Document AI, and Docling Server.
  • Advanced PDF Features: Generates searchable PDFs with accurate text layers (currently only with Google Document AI).
  • Automatic Metadata: Generates titles, tags, and correspondent information.
  • Customizable Prompts: Allows users to tailor AI behavior via prompt templates.

Maintenance & Community

The project is actively maintained by icereed. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Compatible with paperless-ngx. The MIT license permits commercial use and linking with closed-source applications.

Limitations & Caveats

Enhanced PDF features, including text layer generation and hOCR integration, are exclusively supported when using Google Document AI as the OCR provider. The PDF_REPLACE: "true" option for overwriting original documents is flagged as potentially dangerous and requires extreme caution due to the risk of data loss.

Health Check
Last commit

3 days ago

Responsiveness

1 day

Pull Requests (30d)
47
Issues (30d)
5
Star History
454 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Dan Guido Dan Guido(Cofounder of Trail of Bits), and
8 more.

markitdown by microsoft

0.9%
70k
Python tool for converting files to Markdown for LLM text analysis
created 8 months ago
updated 2 months ago
Feedback? Help us improve.