Discover and explore top open-source AI tools and projects—updated daily.
AI framework for immersive narrated storybook video generation
Top 94.1% on SourcePulse
MM-StoryAgent is a multi-agent framework for generating immersive narrated storybook videos. It leverages Large Language Models (LLMs) and specialized tools across text, image, and audio modalities to create expressive storytelling content, offering a customizable workflow for users to integrate their own expert agents.
How It Works
MM-StoryAgent employs a multi-agent, multi-stage pipeline to generate stories and corresponding assets. LLMs are used to write high-quality stories based on user-defined settings. Separate agents handle the generation of image, speech, sound, and music assets, which are then composed into a final video. This modular, agent-based approach allows for customization and improved generation quality for each component.
Quick Start & Requirements
pip install -r requirements.txt
pip install -e .
python run.py -c configs/mm_story_agent.yaml
__init__
and call
methods and registering them.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
Inactive