llama-scan by ngafar

PDF to text transcription with local LLMs

Created 7 months ago

817 stars

Top 43.4% on SourcePulse

View on GitHub

2 Experts Love This Project

Jonathan Ragan-Kelley

Professor at MIT

Jeffrey Morgan

Cofounder of Ollama

Project Summary

This tool enables local PDF transcription and analysis using Ollama's multimodal LLMs, offering a cost-effective solution for extracting text and image descriptions from documents without relying on cloud services. It is designed for users who need to process sensitive or large PDF collections locally.

How It Works

The tool leverages Ollama to run large language models locally, processing PDF files page by page. It extracts text content and utilizes multimodal capabilities to generate detailed descriptions of images and diagrams within the PDFs, converting the entire document into a text-based format.

Quick Start & Requirements

Primary install / run command: pip install llama-scan or uv tool install llama-scan
Non-default prerequisites: Python 3.10+, Ollama installed and running locally.
Usage: llama-scan path/to/your/file.pdf
Documentation: [Not explicitly linked, but usage examples are provided in README]

Highlighted Details

Local processing eliminates token costs and enhances data privacy.
Supports the latest multimodal LLMs available through Ollama.
Capable of transcribing both text and image content from PDFs.
Offers options for specifying output directory, model, page range, and image resizing.

Maintenance & Community

Project maintained by ngafar.
No community links (Discord/Slack) or roadmap are provided in the README.

Licensing & Compatibility

License: Not specified in the README.
Compatibility: Designed for local execution, compatible with any system running Python 3.10+ and Ollama.

Limitations & Caveats

The tool's effectiveness is dependent on the performance and capabilities of the locally installed Ollama models. The README does not specify the license, which may impact commercial use.

Health Check

Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

83 stars in the last 30 days