Qwen-Image-Lightning  by ModelTC

Distilled image generation model for faster inference

Created 2 months ago
899 stars

Top 40.4% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Qwen-Image-Lightning offers distilled versions of the Qwen-Image model, designed to significantly accelerate image generation while retaining complex text rendering capabilities. This project is targeted at researchers and developers seeking faster, yet high-fidelity image synthesis.

How It Works

This project leverages knowledge distillation to create faster, lower-step versions of the Qwen-Image model. By training smaller, faster models to mimic the output of a larger teacher model, it achieves substantial speedups with minimal degradation in quality, particularly for text rendering tasks.

Quick Start & Requirements

  • Installation: Install huggingface_hub and download models using huggingface-cli download lightx2v/Qwen-Image-Lightning --local-dir ./Qwen-Image-Lightning.
  • Prerequisites: Requires Python environment and base Qwen-Image model setup as per the Qwen-Image project.
  • Usage: Evaluation scripts and ComfyUI workflows are provided. Example run commands for 8-step, 4-step, and base models are included.
  • ComfyUI: Workflows qwen-image-8steps.json and qwen-image-4steps.json are available, requiring the base model and LoRA weights in the models/loras/ directory.
  • Links: Qwen-Image (for base model setup), ComfyUI

Highlighted Details

  • Distilled models offer 8-step and 4-step generation options.
  • Preserves complex text rendering capabilities.
  • Includes evaluation scripts for comparing distilled vs. base models.
  • Provides verified ComfyUI workflows for seamless integration.

Maintenance & Community

The project has seen recent releases and updates, indicating active development. Links to community resources are not explicitly provided in the README.

Licensing & Compatibility

Models are licensed under the Apache 2.0 License. This license permits commercial use and linking with closed-source projects, with the caveat that users are responsible for ensuring their generated content complies with applicable laws and ethical guidelines.

Limitations & Caveats

The project acknowledges that neither the distilled nor the base Qwen-Image models consistently generate perfect results, and performance can vary across different prompts and resolutions. A "badcase" example of the distilled model's performance is noted.

Health Check
Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
16
Star History
153 stars in the last 30 days

Explore Similar Projects

Starred by Jiaming Song Jiaming Song(Chief Scientist at Luma AI) and Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI).

LightX2V by ModelTC

6.3%
741
Video generation inference framework for efficient synthesis
Created 7 months ago
Updated 9 hours ago
Feedback? Help us improve.