Podcast production agent
Top 80.9% on sourcepulse
smol-podcaster automates podcast production tasks, including transcription, chapter generation, and content ideation for social media. It targets podcast creators and researchers seeking to streamline post-production workflows. The primary benefit is the automated generation of structured content from raw audio files.
How It Works
The tool leverages large language models (LLMs) for transcription, speaker diarization, chapter creation, and content summarization. It processes audio files, extracts key information, and formats it into usable show notes, chapter lists, and promotional content. The system supports both CLI and a web UI with background task processing via Celery, allowing for parallel execution and remote operation.
Quick Start & Requirements
pip install -r requirements.txt
.env
file.python smol_podcaster.py AUDIO_FILE_URL GUEST_NAME NUMBER_OF_SPEAKERS
honcho start
or run celery -A tasks worker --loglevel=INFO
and flask --app web.py --debug run
separately. Access at localhost:5000
.Highlighted Details
Maintenance & Community
The project is maintained by FanaHOVA. No specific community channels or roadmap details are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The project requires configuration of external LLM API keys and a message broker for distributed task execution. Audio/video timestamp synchronization relies on string similarity, which may not perfectly align content if transcription errors are significant.
10 months ago
1 day