MM_StoryAgent by X-PLUG

AI framework for immersive narrated storybook video generation

Created 1 year ago

302 stars

Top 88.6% on SourcePulse

Project Summary

MM-StoryAgent is a multi-agent framework for generating immersive narrated storybook videos. It leverages Large Language Models (LLMs) and specialized tools across text, image, and audio modalities to create expressive storytelling content, offering a customizable workflow for users to integrate their own expert agents.

How It Works

MM-StoryAgent employs a multi-agent, multi-stage pipeline to generate stories and corresponding assets. LLMs are used to write high-quality stories based on user-defined settings. Separate agents handle the generation of image, speech, sound, and music assets, which are then composed into a final video. This modular, agent-based approach allows for customization and improved generation quality for each component.

Quick Start & Requirements

Install dependencies and the package:

pip install -r requirements.txt
pip install -e .

Run generation via configuration files:

python run.py -c configs/mm_story_agent.yaml

Custom agents can be added by implementing __init__ and call methods and registering them.

Highlighted Details

Generates expressive storytelling videos by composing text, image, speech, sound, and music assets.
Features a customizable workflow allowing users to define and integrate their own expert tools.
Employs a multi-agent, multi-stage pipeline for story writing, outperforming direct LLM prompting in evaluations.
Evaluation data, rubrics, and prompts are provided for story quality assessment.

Maintenance & Community

The initial version was released on August 16, 2024.
A demo video is available.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README.

Limitations & Caveats

The README does not specify hardware requirements (e.g., GPU) or detailed setup time.
No explicit mention of supported operating systems or specific Python versions beyond general dependency installation.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

TypeTale by TypeTale

AIGC video generation toolkit for content creators

Created 10 months ago

Updated 1 month ago

MoneyPrinterAICreate by q1uki

AI video generation and editing suite

Created 10 months ago

Updated 9 months ago

jianying-editor-skill by luoluoluo22

AI agent skill for automated video editing

Created 1 month ago

Updated 2 weeks ago

univa by univa-agent

AI-powered system for universal video creation and direction

Created 3 months ago

Updated 4 weeks ago

ai_story by xhongc

AI video generation platform for automated story creation

Created 3 months ago

Updated 1 week ago

StoryGen-Atelier by 0xsline

AI-driven tool for automated storyboard and video generation

Created 2 months ago

Updated 2 months ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

Mora by lichao-sun

Multi-agent framework for generalist video generation

Created 1 year ago

Updated 1 year ago

director_ai by freestylefly

AI video generation app for comic dramas

Created 1 month ago

Updated 3 weeks ago

Pixelle-Video by AIDC-AI

AI engine for fully automated short video creation

Created 3 months ago

Updated 3 weeks ago

ViMax by HKUDS

Agentic video creation powered by multi-modal AI agents

Created 11 months ago

Updated 2 months ago

jaaz by 11cafe

Open-source AI design agent

Created 8 months ago

Updated 3 months ago

MoneyPrinterPlus by ddean2009

AI tool for one-click short video generation and multi-platform publishing

Created 1 year ago

Updated 11 months ago

Feedback? Help us improve.