generative-manim  by marcelo-earth

GPT-4o powered generative videos concept

Created 2 years ago
677 stars

Top 49.9% on SourcePulse

GitHubView on GitHub
Project Summary

Generative Manim (GM) is a suite of tools designed to empower users with no prior programming or video editing experience to create animated videos using Manim, driven by Large Language Models (LLMs) like GPT-4o and Claude. It translates natural language descriptions into Manim code, facilitating accessible video generation.

How It Works

GM leverages LLMs as the core "models" to convert textual prompts into executable Manim code. This approach capitalizes on the LLMs' natural language understanding and code generation capabilities, abstracting away the complexities of Manim scripting. The project supports various models, including fine-tuned GPT-3.5 variants for general and physics animations, and multiple Claude models, all adapted with custom system prompts for optimized video generation.

Quick Start & Requirements

  • Installation and usage details are not explicitly provided in the README.
  • Requires access to LLM APIs (e.g., OpenAI, Anthropic) and the Manim rendering engine.
  • Refer to the Generative Manim Demo and Generative Manim API for more information.

Highlighted Details

  • Supports multiple LLM backends including GPT-4o, GPT-3.5 (fine-tuned), and Claude Sonnet 3/3.5.
  • Offers fine-tuned models specifically for physics animations.
  • Aims to democratize video animation creation for non-technical users.
  • Includes a legacy Streamlit application for initial LLM explorations.

Maintenance & Community

  • Actively seeking new model suggestions and contributions via GitHub issues and Discord.
  • Sponsored by "The Astronomical Software Company".
  • Community engagement is encouraged via their Discord server.

Licensing & Compatibility

  • The repository's license is not explicitly stated in the README.
  • Compatibility for commercial use or closed-source linking is undetermined without a specified license.

Limitations & Caveats

The README does not detail specific installation instructions, system requirements, or provide benchmarks. The project appears to be in an active development phase, with a legacy Streamlit app mentioned, suggesting potential for ongoing changes and API evolution.

Health Check
Last Commit

6 days ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
22 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Elvis Saravia Elvis Saravia(Founder of DAIR.AI).

NExT-GPT by NExT-GPT

0.1%
4k
Any-to-any multimodal LLM research paper
Created 2 years ago
Updated 4 months ago
Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Zack Li Zack Li(Cofounder of Nexa AI), and
19 more.

LLaVA by haotian-liu

0.2%
24k
Multimodal assistant with GPT-4 level capabilities
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.