informative-drawings  by carolineec

Line drawing generation via unpaired image translation (CVPR 2022)

created 3 years ago
389 stars

Top 74.9% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a PyTorch implementation for generating line drawings from unpaired image data, focusing on conveying geometry and semantics. It is targeted at researchers and practitioners in computer vision and graphics interested in controllable image synthesis and style transfer. The key benefit is the ability to create informative line drawings that capture essential structural and semantic information from input images.

How It Works

The approach leverages a Generative Adversarial Network (GAN) architecture, specifically adapted from pix2pixHD and CycleGAN. It learns to map image features to geometric representations (depth maps) and then uses these to generate line drawings. This method allows for unpaired training, meaning the model doesn't require perfectly aligned pairs of photos and drawings, making it more flexible and scalable.

Quick Start & Requirements

  • Install dependencies using conda env create -f environment.yml and activate with conda activate drawings.
  • CLIP installation for training: pip install git+https://github.com/openai/CLIP.git.
  • PyTorch 1.7.1 is required.
  • Pre-trained model weights should be placed in the checkpoints directory.
  • To run the pre-trained model: python test.py --name anime_style --dataroot examples/test.
  • Official project page, paper, video, and demo links are provided in the README.

Highlighted Details

  • Generates line drawings that convey geometry and semantics.
  • Utilizes an unpaired image-to-image translation framework.
  • Adapted from established GAN architectures (pix2pixHD, CycleGAN).
  • Requires depth maps for training, which can be generated using external models.

Maintenance & Community

The project is associated with CVPR 2022 and cites academic work, indicating a research-oriented origin. No specific community channels or active maintenance indicators are present in the README.

Licensing & Compatibility

The README does not explicitly state a license. The code is adapted from pix2pixHD and pytorch-CycleGAN-and-pix2pix, which are typically released under permissive licenses like MIT. However, users should verify the exact licensing terms.

Limitations & Caveats

The project requires specific versions of PyTorch (1.7.1) and may have compatibility issues with newer versions. Generating high-quality depth maps for training is a prerequisite, which adds an extra step to the workflow.

Health Check
Last commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
13 stars in the last 90 days

Explore Similar Projects

Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), and
2 more.

glide-text2im by openai

0.1%
4k
Text-conditional image synthesis model from research paper
created 3 years ago
updated 1 year ago
Feedback? Help us improve.