SkyPaint-AI-Diffusion by SkyWorkAIGC

Text-to-image model optimized from Stable Diffusion

Created 3 years ago

648 stars

Top 51.6% on SourcePulse

Project Summary

SkyPaint-AI-Diffusion offers an optimized text-to-image generation model based on Stable Diffusion, capable of producing high-quality images in modern art styles from both Chinese and English text prompts. It is targeted at users seeking advanced AI art generation with bilingual input capabilities.

How It Works

The project comprises two main components: an optimized text encoder and a diffusion model. The text encoder, SkyCLIP, is a distilled, bilingual (Chinese/English) CLIP model trained efficiently using text data. This approach significantly reduces data and computational requirements for reproduction and fine-tuning. The diffusion model is fine-tuned from stable-diffusion-v1.5, with prompts augmented by the tag 'sai-v1 art' to guide the learning of specific styles and quality.

Quick Start & Requirements

Install via diffusers library.
Requires a CUDA-enabled GPU (training utilized 16x A100s).
Example usage provided in the README.

Highlighted Details

Supports Chinese, English, and mixed-language prompts.
Generates images in modern art styles.
Compatible with stable_diffusion_1.x official models and fine-tuned variants.
SkyCLIP model offers efficient bilingual CLIP training and evaluation on Flickr30K-CN.

Maintenance & Community

Project is under continuous development and optimization.
WeChat QR code provided for joining a developer group.

Licensing & Compatibility

License: CreativeML Open RAIL-M.
This license permits commercial use but may have specific use-case restrictions.

Limitations & Caveats

The model is still under continuous optimization, with the expectation of more stable updates in the future.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

CrossFlow by qihao067

PyTorch text-to-image generation framework

Created 1 year ago

Updated 7 months ago

X-Omni by X-Omni-Team

Unified discrete autoregressive model for image and language generation

Created 5 months ago

Updated 4 months ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

DiffuseIT by cyclomon

Diffusion-based image translation research paper

Created 3 years ago

Updated 3 years ago

Forgedit by witcherofresearch

Text-guided image editor via diffusion model fine-tuning

Created 2 years ago

Updated 1 year ago

HunyuanImage-2.1 by Tencent-Hunyuan

High-resolution 2K text-to-image generation

Created 4 months ago

Updated 2 months ago

Starred by

Chaoyu Yang

Chaoyu Yang(Founder of Bento),

Georgios Konstantopoulos

Georgios Konstantopoulos(CTO, General Partner at Paradigm), and

1 more.

long_stable_diffusion by sharonzhou

AI pipeline for long-form text-to-image generation

Created 3 years ago

Updated 3 years ago

Starred by

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral),

Chenlin Meng

Chenlin Meng(Cofounder of Pika), and

2 more.

clip-guided-diffusion by afiaka87

CLI tool for text-to-image generation using CLIP-guided diffusion

Created 4 years ago

Updated 1 week ago

CogView4 by zai-org

Text-to-image generation system using cascading diffusion

Created 1 year ago

Updated 9 months ago

Monkey by Yuliang-Liu

Research paper on multimodal models, image resolution, and text labels

Created 2 years ago

Updated 2 months ago

BallonsTranslator by dmMaze

Comic/manga translation tool aided by deep learning

Created 3 years ago

Updated 3 weeks ago

Starred by

Shengjia Zhao

Shengjia Zhao(Chief Scientist at Meta Superintelligence Lab),

Edward Sun

Edward Sun(Research Scientist at Meta Superintelligence Lab), and

7 more.

glide-text2im by openai

Text-conditional image synthesis model from research paper

Created 4 years ago

Updated 1 year ago

Starred by

Deepak Pathak

Deepak Pathak(Cofounder of Skild AI; Professor at CMU),

Travis Fischer

Travis Fischer(Founder of Agentic), and

8 more.

sygil-webui by Sygil-Dev

Web UI for Stable Diffusion

Created 3 years ago

Updated 1 month ago

Feedback? Help us improve.