CLI tool for converting PDFs into podcast episodes
Top 45.5% on sourcepulse
This project converts PDF documents into podcast episodes using AI for dialogue generation and text-to-speech. It targets users who want to repurpose written content into an audio format, offering a simple way to create podcast-like audio from PDFs.
How It Works
The tool processes PDF content, feeding it to Google's Gemini LLM to generate natural, podcast-suitable dialogue. This AI-generated script is then converted into audio using OpenAI's text-to-speech models, producing an MP3 output. This approach leverages advanced AI for content summarization and natural language generation, aiming for high-quality audio output.
Quick Start & Requirements
uv sync
python main.py
OPENAI_API_KEY
environment variable or provided via the interface).Highlighted Details
Maintenance & Community
No specific details on contributors, sponsorships, or community channels are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The project requires API keys for both Google Gemini and OpenAI, which may incur costs. The quality of the output is dependent on the PDF content and the performance of the underlying AI models.
4 months ago
1+ week