Ai-movie-clip  by LumingMelody

AI system for automated video editing

Created 2 months ago
252 stars

Top 99.6% on SourcePulse

GitHubView on GitHub
Project Summary

Summary This project presents an AI-powered automatic video editing system designed to intelligently analyze video content and generate customized edited clips based on user-defined requirements. It targets users and developers seeking to automate video post-production workflows, offering significant benefits by reducing manual editing effort and accelerating content creation through sophisticated AI analysis and generative capabilities.

How It Works The system's core functionality relies on advanced Computer Vision (CV) and Machine Learning (ML) models to perform in-depth analysis of video content, identifying key elements and scenes. It supports a diverse range of pre-defined video style templates, catering to various output formats such as social media, commercial advertisements, and educational materials. A key differentiator is its integration of generative AI for content creation, encompassing text generation, image synthesis, and speech synthesis to enrich video narratives. The system is built with extensibility in mind, offering a robust FastAPI-based API service that facilitates programmatic control, efficient batch processing, and seamless integration with external protocols like the Model Context Protocol (MCP). Furthermore, it incorporates cutting-edge features through Coze integration, enabling dynamic digital human performances and intelligent, context-aware image insertion directly into video streams.

Quick Start & Requirements

  • Installation: Begin by cloning the project repository (git clone https://github.com/LumingMelody/Ai-movie-clip.git), navigating into the directory (cd Ai-movie-clip), and installing Python dependencies (pip install -r requirements.txt).
  • Prerequisites: Requires Python 3.8 or higher, FFmpeg for video processing, and optionally CUDA for GPU acceleration.
  • Configuration: Essential setup involves creating and populating an .env file with API keys for services like DashScope, OpenAI, Alibaba Cloud OSS, and Coze (including a Coze Workflow ID). A specific resource package must also be downloaded and extracted to the resources/ directory.
  • Documentation: Comprehensive API documentation, frontend integration guides, and system architecture design documents are provided.
  • Resource Package Download: https://pan.quark.cn/s/5a16054e18eb

Highlighted Details

  • Automated Video Analysis: Employs CV/ML models to dissect video content, enabling intelligent scene understanding and segmentation.
  • AI Content Generation Suite: Integrates text generation, image synthesis, and speech synthesis to dynamically create and insert new content into videos.
  • Coze Digital Human & Image Insertion: Features advanced capabilities for generating text-driven digital human performances and intelligently inserting images into video scenes with optimized placement.
  • Extensible API Service: A FastAPI backend provides a flexible interface for remote control, batch job execution, and integration into larger application ecosystems.
  • Template-Driven Editing: Offers a variety of stylistic templates and allows users to define custom templates for tailored video outputs.

Maintenance & Community The project is actively maintained by its author, LumingMelody. The README does not specify dedicated community channels such as Discord or Slack, nor does it detail a public roadmap. The project acknowledges contributions and support from Alibaba Cloud DashScope and OpenAI.

Licensing & Compatibility Released under the permissive MIT License, this project is suitable for commercial use and can be integrated into closed-source applications without significant licensing restrictions.

Limitations & Caveats The initial setup is complex, requiring the procurement and configuration of multiple third-party API keys and services. While the system supports automatic sharding for large video files, optimal performance may necessitate GPU acceleration or careful tuning of concurrency settings. The reliance on external AI models and cloud services introduces potential operational costs and dependencies on provider availability and policies.

Health Check
Last Commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
30 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Jiaming Song Jiaming Song(Chief Scientist at Luma AI).

MoneyPrinterTurbo by harry0703

1.2%
47k
AI tool for one-click short video generation from text prompts
Created 1 year ago
Updated 4 months ago
Feedback? Help us improve.