Virtual try-on solution combining diffusion models with inpainting
Top 56.4% on SourcePulse
This repository provides CatVTON-Flux, a state-of-the-art virtual try-on solution combining CATVTON with the Flux inpainting model for realistic clothing transfer. It targets researchers and developers in computer vision and fashion technology, offering improved garment detail and text rendering.
How It Works
CatVTON-Flux leverages a concatenation-based approach, inspired by In-Context LoRA, for prompt engineering. It utilizes the Flux fill inpainting model for enhanced realism and accuracy in clothing transfer. The system can also extract and reconstruct garment fronts from images of people wearing them (try-off functionality).
Quick Start & Requirements
pip install -r requirements.txt
within a conda
environment (Python 3.10 recommended).python app.py
for Gradio demo with LoRA weights.Highlighted Details
Maintenance & Community
The project has seen recent updates, including new try-on and try-off models, training notes, and ComfyUI support. Links to Hugging Face spaces for demos are provided.
Licensing & Compatibility
The code is licensed under the MIT License. Model weights inherit licenses from Flux Fill and VITON-HD.
Limitations & Caveats
High VRAM requirements (>= 40GB) are noted as a potential barrier. Training requires significant resources (2xH100 80GB).
5 months ago
Inactive