Discover and explore top open-source AI tools and projects—updated daily.
aimagelabAI model for fashion image editing via multimodal prompts
Top 67.8% on SourcePulse
This repository provides the official implementation for "Multimodal Garment Designer," a novel approach to fashion image editing using human-centric latent diffusion models. It enables fashion image generation and editing guided by multimodal prompts like text, body poses, and garment sketches, targeting fashion designers and researchers in computer vision and AI for fashion.
How It Works
The project leverages latent diffusion models, a technique not previously applied to fashion image editing. It introduces a new architecture designed to handle multimodal conditioning, allowing for precise control over generated fashion imagery. The approach is advantageous for its ability to integrate diverse input modalities for realistic and coherent fashion image manipulation.
Quick Start & Requirements
git clone https://github.com/aimagelab/multimodal-garment-designer followed by conda env create -n mgd -f environment.yml and conda activate mgd.python src/eval.py --dataset_path <path> --batch_size <int> --mixed_precision fp16 --output_dir <path> --save_name <string> --num_workers_test <int> --sketch_cond_rate 0.2 --dataset <dresscode|vitonhd> --start_cond_rate 0.0 --test_order <paired|unpaired>Highlighted Details
torch.hub.MGDPipe integrating the MGD denoising UNet with standard diffusers components.Maintenance & Community
The project is associated with ICCV 2023. Further work on multimodal fashion image editing is available in a more recent publication. No specific community channels (Discord/Slack) or roadmap are mentioned in the README.
Licensing & Compatibility
Licensed under Creative Commons BY-NC 4.0. This license permits redistribution and adaptation for non-commercial purposes only, requiring appropriate credit and indication of changes.
Limitations & Caveats
The repository explicitly states that training code is a future TODO item. The license restricts commercial use, which may limit adoption in commercial product development.
1 year ago
Inactive
orpatashnik