motionagent  by modelscope

AI assistant for converting ideas into motion pictures

Created 2 years ago
304 stars

Top 88.0% on SourcePulse

GitHubView on GitHub
Project Summary

MotionAgent is an AI-powered tool for generating videos from user-provided scripts, targeting creators and developers who want to automate video production. It offers a comprehensive workflow from scriptwriting to final video and music generation, leveraging large language models and diffusion models.

How It Works

MotionAgent integrates multiple AI models to achieve its functionality. It uses LLMs like Qwen-7B-Chat for script generation based on themes, diffusion models (SDXL 1.0) for generating movie stills from scripts, image-to-video models (I2VGen-XL) for video creation, and MusicGen for background music composition. This modular approach allows for a complete pipeline from text to multimedia output.

Quick Start & Requirements

  • Install: Clone the repository and install dependencies via pip3 install -r requirements.txt. Run the application with python3 app.py.
  • Prerequisites: Python 3.8, PyTorch 2.0.1, CUDA 11.7, Ubuntu 20.04.
  • Hardware: Requires an Nvidia A100 40GB GPU (minimum 36GB VRAM) and over 50GB disk space.
  • Docs: ModelScope library

Highlighted Details

  • Script generation using LLMs (e.g., Qwen-7B-Chat).
  • Movie still generation via SDXL 1.0.
  • Video generation from images using I2VGen-XL.
  • Custom background music generation with MusicGen.

Maintenance & Community

The project is powered by the ModelScope community, an open-source model ecosystem from Damo Academy. Further details on contributing models are available.

Licensing & Compatibility

Licensed under the Apache License (Version 2.0). This license is permissive and generally compatible with commercial use and closed-source linking.

Limitations & Caveats

The current setup is verified for a specific environment (Python 3.8, CUDA 11.7, Ubuntu 20.04) and requires significant GPU memory (36GB+). The --clear_cache flag is noted for low-disk environments, implying potential re-downloading of large models with speed impacts.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Starred by John Mullan John Mullan(MTS at xAI; Cofounder of Hotshot AI) and Patrick von Platen Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral).

Pallaidium by tin2tin

0.6%
1k
Generative AI movie studio integrated into Blender
Created 2 years ago
Updated 2 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Jiaming Song Jiaming Song(Chief Scientist at Luma AI).

MoneyPrinterTurbo by harry0703

0.4%
40k
AI tool for one-click short video generation from text prompts
Created 1 year ago
Updated 3 months ago
Feedback? Help us improve.