Discover and explore top open-source AI tools and projects—updated daily.
zarazhangruiAI podcast generator from any content
Top 78.4% on SourcePulse
This project provides a "coding agent skill" that transforms any text-based content into a personalized, two-host AI podcast. It targets users who prefer audio consumption of information, individuals seeking self-reflection through AI analysis of their personal data, and developers interested in automated podcast generation. The primary benefit is the ability to listen to newsletters, research papers, meeting notes, or even personal documents on the go, with customizable hosts, voices, and scripts, delivered directly to standard podcast apps via an RSS feed.
How It Works
The core approach leverages a coding agent to read input content (pasted text, files, or URLs) and generate a natural-sounding conversational script between two distinct AI hosts. Text-to-speech synthesis is handled by Fish Audio, offering a vast library of voices. The generated audio segments are then stitched together using pydub and ffmpeg, incorporating natural pacing and fade effects. This pipeline runs entirely locally, eliminating the need for separate backends or hosted services. Customization is achieved through editing a PROMPT.md file for script behavior and a config.yaml file for host personalities, voices, and show parameters.
Quick Start & Requirements
gh repo clone zarazhangrui/personalized-podcast-skill ~/.claude/skills/personalized-podcast/podcast <paste content, point to files, or describe a topic>ffmpeg (install via brew install ffmpeg on macOS)..env file.fish.audio/discovery.Highlighted Details
.txt, .md, .pdf), and URLs.Maintenance & Community
The project is built by Zara Zhang. No specific details regarding additional contributors, community channels (like Discord or Slack), or a public roadmap are provided in the README.
Licensing & Compatibility
The open-source license is not explicitly stated in the README. This omission creates ambiguity regarding commercial use and derivative works. The project is designed for local execution via a coding agent.
Limitations & Caveats
The project's functionality is contingent on the availability and compatibility of a suitable coding agent and the Fish Audio TTS service. The absence of a specified license poses a significant caveat for adoption, particularly in commercial or collaborative contexts, as usage terms are unclear.
1 month ago
Inactive
FanaHOVA