Discover and explore top open-source AI tools and projects—updated daily.
PDF to text transcription with local LLMs
Top 50.0% on SourcePulse
This tool enables local PDF transcription and analysis using Ollama's multimodal LLMs, offering a cost-effective solution for extracting text and image descriptions from documents without relying on cloud services. It is designed for users who need to process sensitive or large PDF collections locally.
How It Works
The tool leverages Ollama to run large language models locally, processing PDF files page by page. It extracts text content and utilizes multimodal capabilities to generate detailed descriptions of images and diagrams within the PDFs, converting the entire document into a text-based format.
Quick Start & Requirements
pip install llama-scan
or uv tool install llama-scan
llama-scan path/to/your/file.pdf
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The tool's effectiveness is dependent on the performance and capabilities of the locally installed Ollama models. The README does not specify the license, which may impact commercial use.
2 weeks ago
Inactive