informative-drawings by carolineec

Line drawing generation via unpaired image translation (CVPR 2022)

Created 3 years ago

421 stars

Top 69.8% on SourcePulse

Project Summary

This project provides a PyTorch implementation for generating line drawings from unpaired image data, focusing on conveying geometry and semantics. It is targeted at researchers and practitioners in computer vision and graphics interested in controllable image synthesis and style transfer. The key benefit is the ability to create informative line drawings that capture essential structural and semantic information from input images.

How It Works

The approach leverages a Generative Adversarial Network (GAN) architecture, specifically adapted from pix2pixHD and CycleGAN. It learns to map image features to geometric representations (depth maps) and then uses these to generate line drawings. This method allows for unpaired training, meaning the model doesn't require perfectly aligned pairs of photos and drawings, making it more flexible and scalable.

Quick Start & Requirements

Install dependencies using conda env create -f environment.yml and activate with conda activate drawings.
CLIP installation for training: pip install git+https://github.com/openai/CLIP.git.
PyTorch 1.7.1 is required.
Pre-trained model weights should be placed in the checkpoints directory.
To run the pre-trained model: python test.py --name anime_style --dataroot examples/test.
Official project page, paper, video, and demo links are provided in the README.

Highlighted Details

Generates line drawings that convey geometry and semantics.
Utilizes an unpaired image-to-image translation framework.
Adapted from established GAN architectures (pix2pixHD, CycleGAN).
Requires depth maps for training, which can be generated using external models.

Maintenance & Community

The project is associated with CVPR 2022 and cites academic work, indicating a research-oriented origin. No specific community channels or active maintenance indicators are present in the README.

Licensing & Compatibility

The README does not explicitly state a license. The code is adapted from pix2pixHD and pytorch-CycleGAN-and-pix2pix, which are typically released under permissive licenses like MIT. However, users should verify the exact licensing terms.

Limitations & Caveats

The project requires specific versions of PyTorch (1.7.1) and may have compatibility issues with newer versions. Generating high-quality depth maps for training is a prerequisite, which adds an extra step to the workflow.

informative-drawings by carolineec

Explore Similar Projects

PointCLIP_V2 by yangyangyang127

ShapeLLM-Omni by JAMESYJL

LayoutGPT by weixi-feng

qwen2vl-flux by erwold

Clip-Forge by AutodeskAILab

FlipSketch by hmrishavbandy

Fantasia3D by Gorilla-Lab-SCUT

Awesome-Sketch-Synthesis by MarkMoHR

MiniGPT-5 by eric-ai-lab

Coherent-Line-Drawing by SSARCandy

CLIPasso by yael-vinker

Qwen-Image by QwenLM