Discover and explore top open-source AI tools and projects—updated daily.
Agents365-aiAutomated video podcast generation skill
Top 75.8% on SourcePulse
Automated video podcast creation is addressed by this project, which streamlines the process from topic generation to a professional video podcast, specifically optimized for Bilibili. It targets content creators and users seeking to produce video podcasts without deep technical expertise, offering a benefit of simplified, AI-assisted production.
How It Works
This project employs an AI-driven workflow, orchestrated by agents like Claude Code, to guide users through natural language prompts. It integrates several key components: web-based research for content gathering, structured scriptwriting with chapter markers, multi-engine Text-to-Speech (TTS) synthesis (Azure Speech, CosyVoice, Edge TTS), and React-based video rendering via Remotion. Visual styles are editable in Remotion Studio, with audio synchronized using FFmpeg, background music layered, and optional SRT subtitles burned in. The approach is novel in its comprehensive automation and Bilibili-specific optimizations, including AI-generated covers and chapter timestamps.
Quick Start & Requirements
npx create-video@latest my-video-projectcd my-video-projectnpm ipip install azure-cognitiveservices-speech dashscope edge-tts requestsnpx remotion studio src/remotion/index.tsTTS_BACKEND), Azure keys/region, and other API keys via ~/.zshrc or ~/.bashrc.npx create-video, npx remotion studio.Highlighted Details
Maintenance & Community
The project is actively developed by Agents365-ai, with a roadmap indicating ongoing improvements. Support options include WeChat Pay, Alipay, and Buy Me a Coffee. The GitHub repository is available for contributions and issue tracking.
Licensing & Compatibility
The project is released under the MIT license, which permits commercial use and integration into closed-source projects.
Limitations & Caveats
The project is under continuous iteration, and some features may not yet be fully mature. The quality of the initial video generation is dependent on the underlying AI model used (e.g., Codex, Claude Code, GLM-5). Windows compatibility requires further validation via WSL.
2 days ago
Inactive