ai-comic-factory  by jbilcke-hf

AI comic panel generator using LLM + SDXL

created 1 year ago
1,221 stars

Top 32.9% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a web interface for generating comic panels using a combination of Large Language Models (LLMs) for story and dialogue generation and Stable Diffusion XL (SDXL) for image rendering. It targets users who want to create custom AI-generated comics with a single prompt, offering flexibility in choosing LLM and rendering backends.

How It Works

The application orchestrates a multi-stage generation process. An LLM, configurable to use Hugging Face Inference API, Inference Endpoints, OpenAI, Groq, or Anthropic, generates a narrative and dialogue based on user input. This output then feeds into an SDXL model, which can be accessed via Hugging Face Inference API, Replicate, or a custom endpoint, to render the visual comic panels. This modular design allows users to leverage different AI providers for both text and image generation.

Quick Start & Requirements

  • Install/Run: Deployable via Docker (app_port: 3000). Local setup requires creating a .env.local file for configuration.
  • Prerequisites: API keys for chosen LLM and rendering services (Hugging Face, OpenAI, Groq, Anthropic, Replicate).
  • Resources: Requires configuration for LLM and rendering engines. Specific resource needs depend on the chosen backend models.
  • Links: Official website: aicomicfactory.app

Highlighted Details

  • Supports multiple LLM backends including Hugging Face Inference API/Endpoints, OpenAI, Groq, and Anthropic.
  • Offers flexibility in SDXL rendering via Hugging Face Inference API, Replicate, or custom endpoints.
  • Allows customization of LLM and SDXL models used.
  • Includes experimental support for Groq and Anthropic LLMs.

Maintenance & Community

  • Developed by jbilcke-hf, with mentions of community sharing features (though noted as not required for local deployment).
  • Funding is accepted via tips.
  • The project is open-source with multiple repositories linked.

Licensing & Compatibility

  • The README states the project is open-source but does not explicitly list a license.
  • Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is described as not being a monolithic, immediately runnable Space, requiring significant configuration for local deployment. Documentation for the custom "VideoChain" rendering API is not yet available.

Health Check
Last commit

5 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
43 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.