ai-comic-factory by jbilcke-hf

AI comic panel generator using LLM + SDXL

Created 2 years ago

1,281 stars

Top 30.7% on SourcePulse

Project Summary

This project provides a web interface for generating comic panels using a combination of Large Language Models (LLMs) for story and dialogue generation and Stable Diffusion XL (SDXL) for image rendering. It targets users who want to create custom AI-generated comics with a single prompt, offering flexibility in choosing LLM and rendering backends.

How It Works

The application orchestrates a multi-stage generation process. An LLM, configurable to use Hugging Face Inference API, Inference Endpoints, OpenAI, Groq, or Anthropic, generates a narrative and dialogue based on user input. This output then feeds into an SDXL model, which can be accessed via Hugging Face Inference API, Replicate, or a custom endpoint, to render the visual comic panels. This modular design allows users to leverage different AI providers for both text and image generation.

Quick Start & Requirements

Install/Run: Deployable via Docker (app_port: 3000). Local setup requires creating a .env.local file for configuration.
Prerequisites: API keys for chosen LLM and rendering services (Hugging Face, OpenAI, Groq, Anthropic, Replicate).
Resources: Requires configuration for LLM and rendering engines. Specific resource needs depend on the chosen backend models.
Links: Official website: aicomicfactory.app

Highlighted Details

Supports multiple LLM backends including Hugging Face Inference API/Endpoints, OpenAI, Groq, and Anthropic.
Offers flexibility in SDXL rendering via Hugging Face Inference API, Replicate, or custom endpoints.
Allows customization of LLM and SDXL models used.
Includes experimental support for Groq and Anthropic LLMs.

Maintenance & Community

Developed by jbilcke-hf, with mentions of community sharing features (though noted as not required for local deployment).
Funding is accepted via tips.
The project is open-source with multiple repositories linked.

Licensing & Compatibility

The README states the project is open-source but does not explicitly list a license.
Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is described as not being a monolithic, immediately runnable Space, requiring significant configuration for local deployment. Documentation for the custom "VideoChain" rendering API is not yet available.

ai-comic-factory by jbilcke-hf

Explore Similar Projects

OmniGen2 by VectorSpaceLab

LLMGA by JIA-Lab-research

UltraPixel by catcathh

NextCreator by MoonWeSif

FableForge by e-johnstonn

make-comics by Nutlope

DiffSensei by jianzongwu

RPG-DiffusionMaster by YangLing0818

OmniGen by VectorSpaceLab

story-flicks by alecm20

Omost by lllyasviel

presenton by presenton