Discover and explore top open-source AI tools and projects—updated daily.
gavrielcAI-powered CLI for PDF editing and slide generation
Top 38.9% on SourcePulse
This project provides a command-line interface (CLI) tool for editing PDF documents, specifically presentations, using natural language prompts powered by Google's Gemini 3 Pro Image model. It targets users who need to make quick, content-based or stylistic modifications to PDFs without resorting to traditional graphical editors, offering a significant time-saving benefit through AI automation.
How It Works
Nano-PDF employs a multi-stage process: first, it renders target PDF pages into images using the Poppler library. These images, optionally accompanied by style reference pages, are then sent along with natural language editing prompts to Google's Gemini 3 Pro Image model. The AI generates an edited image, which is subsequently processed by Tesseract OCR to re-hydrate a searchable text layer. Finally, the tool stitches these AI-edited images back into the original PDF structure, preserving the document's integrity and text selectability. This approach allows for non-destructive, context-aware edits and parallel processing across multiple pages for efficiency.
Quick Start & Requirements
pip install nano-pdfGEMINI_API_KEY environment variable must be set.poppler (for PDF rendering) and tesseract (for OCR). Installation instructions are provided for macOS, Windows, and Linux (Ubuntu/Debian).Highlighted Details
--style-refs) to guide the AI in matching fonts, colors, and layout.--resolution (4K, 2K, 1K).Maintenance & Community
No specific details regarding maintainers, community channels (like Discord or Slack), or project roadmap were found in the provided README excerpt.
Licensing & Compatibility
Limitations & Caveats
Operation requires a paid Google Gemini API tier, and free tier keys will not function. The accuracy of the OCR re-hydration can vary, particularly with highly stylized fonts or very small text. Processing speed is directly influenced by the chosen image resolution, with higher resolutions leading to slower execution. The tool's effectiveness is also dependent on the correct installation and accessibility of system dependencies like Poppler and Tesseract.
1 month ago
Inactive