catvton-flux by nftblackmagic

Virtual try-on solution combining diffusion models with inpainting

Created 1 year ago

607 stars

Top 53.9% on SourcePulse

Project Summary

This repository provides CatVTON-Flux, a state-of-the-art virtual try-on solution combining CATVTON with the Flux inpainting model for realistic clothing transfer. It targets researchers and developers in computer vision and fashion technology, offering improved garment detail and text rendering.

How It Works

CatVTON-Flux leverages a concatenation-based approach, inspired by In-Context LoRA, for prompt engineering. It utilizes the Flux fill inpainting model for enhanced realism and accuracy in clothing transfer. The system can also extract and reconstruct garment fronts from images of people wearing them (try-off functionality).

Quick Start & Requirements

Install: pip install -r requirements.txt within a conda environment (Python 3.10 recommended).
Prerequisites: VRAM >= 40GB is required; 80GB recommended for training. The model is trained on the VITON-HD dataset.
Demo: Run python app.py for Gradio demo with LoRA weights.
Docs: CATVTON-FLUX-TRY-ON, CAT-TRYOFF-FLUX

Highlighted Details

Achieved SOTA FID: 5.59 on VITON-HD dataset.
Released LoRA weights with FID: 6.07 on VITON-HD.
Offers both try-on and try-off (garment reconstruction) capabilities.
Training code for Flux inpainting full parameter fine-tuning is available.

Maintenance & Community

The project has seen recent updates, including new try-on and try-off models, training notes, and ComfyUI support. Links to Hugging Face spaces for demos are provided.

catvton-flux by nftblackmagic

Explore Similar Projects

LinFusion by Huage001

ComfyUI-OmniGen by 1038lab

Lumina-Image-2.0 by Alpha-VLLM

flymyai-lora-trainer by FlyMyAI

BLIP3o by JiuhaiChen

ACE_plus by ali-vilab

CatVTON by Zheng-Chong

DemoFusion by PRIS-CV

img2img-turbo by GaParmar

sd-webui-EasyPhoto by aigc-apps

Auto-Photoshop-StableDiffusion-Plugin by AbdullahAlfaraj

guided-diffusion by openai