PyTorch image-to-image translation for multiple domains (CVPR 2020)
Top 13.7% on sourcepulse
StarGAN v2 provides a PyTorch implementation for diverse image synthesis across multiple domains, addressing limitations in existing image-to-image translation models. It's designed for researchers and practitioners in computer vision and generative AI who need scalable and high-quality image translation capabilities.
How It Works
StarGAN v2 employs a single generator and discriminator architecture capable of translating images between multiple domains. It utilizes a mapping network to generate latent codes that control style variations and a style encoder to extract style information from reference images. This approach enables diverse image generation and efficient scalability across numerous domains within a unified framework.
Quick Start & Requirements
download.sh
.Highlighted Details
Maintenance & Community
The project is an official implementation from Clova AI (NAVER AI Lab). While there's no explicit mention of ongoing maintenance or community channels like Discord/Slack, the project is associated with a CVPR 2020 paper.
Licensing & Compatibility
The source code, pre-trained models, and dataset are released under the Creative Commons BY-NC 4.0 license. This license permits non-commercial use, modification, and distribution, provided appropriate credit is given and changes are indicated. Commercial use requires contacting clova-jobs@navercorp.com.
Limitations & Caveats
The project requires specific older versions of PyTorch (1.4.0) and CUDA (10.0), which may pose compatibility challenges with newer hardware and software stacks. The non-commercial license restricts its use in commercial products.
2 years ago
Inactive