AIComicBuilder  by twwch

AI comic generation from script to animated video

Created 4 weeks ago

New!

804 stars

Top 43.8% on SourcePulse

GitHubView on GitHub
Project Summary

AI Comic Builder is an AI-powered, automated pipeline for generating animated comic videos from scripts. It targets creators and developers looking to streamline the animation production process, offering AI-driven character design, storyboarding, and video synthesis to transform text-based narratives into visual content.

How It Works

The project employs a multi-stage AI pipeline starting from script input, progressing through character extraction and 3-view generation, intelligent storyboarding with shot lists, reference frame creation, video prompt generation, and finally, AI video generation and synthesis. This approach leverages multiple AI models (text, image, video) and FFmpeg for video processing, allowing for automated video creation while providing users granular control over each step of the production flow.

Quick Start & Requirements

  • Installation: Requires Node.js 18+ and pnpm. Install dependencies with pnpm install, initialize the database with pnpm drizzle-kit push, and run with pnpm dev.
  • Docker: Available via docker run or docker compose up -d.
  • Prerequisites: FFmpeg is necessary for video synthesis.
  • Configuration: AI model providers (OpenAI, Gemini, Seedance) must be configured in the settings.
  • Data Persistence: Volumes are used for ./data (SQLite DB) and ./uploads (generated assets).
  • Demos: BV19rwVzUEeD, BV1RrwVzUE3x, BV15rwVzSEKZ, BV15kwiz7E6Q, BV1hTw1zAEgY

Highlighted Details

  • Supports multiple languages: Chinese, English, Japanese, Korean.
  • Integrates with various AI providers including OpenAI, Gemini, and Seedance for text, image, and video generation.
  • Offers multiple collaboration views for storyboarding: drawer, inline panel, and Kanban board.
  • Provides fine-grained control over the generation pipeline, allowing individual or batch triggering of stages.
  • Data and generated assets are persisted via Docker volumes.

Maintenance & Community

A Feishu (Lark) group is mentioned for community interaction, but direct links to community channels, roadmaps, or notable contributors are not provided in the README.

Licensing & Compatibility

The project is licensed under the Apache License 2.0, which is generally permissive for commercial use and integration into closed-source projects.

Limitations & Caveats

As version v0.1.0, the project is in its early stages of development. Full functionality relies on access to and configuration of external AI model APIs, and FFmpeg is a required dependency for video processing. The README is primarily in Chinese, suggesting a potential primary development focus.

Health Check
Last Commit

3 days ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
3
Star History
811 stars in the last 28 days

Explore Similar Projects

Feedback? Help us improve.