SVGDreamer by ximinng

Research paper for text-guided SVG generation using diffusion

Created 2 years ago

434 stars

Top 68.6% on SourcePulse

1 Expert Loves This Project

bcherny

Creator of Claude Code; MTS at Anthropic

Project Summary

SVGDreamer is a CVPR 2024 paper implementing a diffusion-based approach for text-guided SVG generation. It targets researchers and artists seeking to synthesize high-quality vector graphics from textual descriptions, offering control over style and editing capabilities.

How It Works

SVGDreamer utilizes a diffusion model to generate SVG paths. It employs a two-stage process: first, a Sketch-Inference-and-Vector-Editing (SIVE) stage for initial shape generation and refinement, followed by a Vector-SVG-Path-Diffusion (VPSD) stage to produce the final SVG output. This approach aims to balance synthesis quality with vector graphic editing potential.

Quick Start & Requirements

Installation: Run bash script/install.sh or use the provided Docker script bash script/run_svgdreamer_docker.sh.
Prerequisites: Requires a pretrained Stable Diffusion model (e.g., Stable Diffusion 2.1 Base). The model can be auto-downloaded by setting diffuser.download=True in conf/config.yaml.
Resources: enable_xformers=True is recommended for faster optimization. state.mprec='fp16' can reduce GPU memory usage.
Documentation: Examples.md

Highlighted Details

Supports multiple generation styles including iconography, painting, pixel art, low-poly, sketch, and ink/wash.
Offers control over generation through various parameters like skip_sive, token_ind, result_path, and x.vpsd.t_schedule.
Includes a newer version, SVGDreamer++, with enhanced visual representation and editing capabilities.

Maintenance & Community

The project is associated with the CVPR 2024 paper. Links to community channels or roadmaps are not explicitly provided in the README.

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive license suitable for commercial use and integration with closed-source projects.

Limitations & Caveats

The README mentions a "TODO" list, indicating ongoing development. Specific limitations or known bugs are not detailed.

Health Check

Last Commit

10 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

BoxDiff by showlab

Text-to-image synthesis research paper using box-constrained diffusion

Created 2 years ago

Updated 1 year ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind) and

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral).

mixture-of-diffusers by albarji

Image generation method for scene composition using multiple diffusion processes

Created 3 years ago

Updated 2 years ago

PyTorch-SVGRender by ximinng

PyTorch library for differentiable SVG rendering methods

Created 2 years ago

Updated 1 year ago

AutoFigure-Edit by ResearAI

Generating and refining publication-ready scientific illustrations

Created 3 weeks ago

Updated 6 days ago

semantic-draw by ironjr

Interactive content creation from image diffusion models

Created 2 years ago

Updated 8 months ago

Starred by

Chaoyu Yang

Chaoyu Yang(Founder of Bento),

Georgios Konstantopoulos

Georgios Konstantopoulos(CTO, General Partner at Paradigm), and

1 more.

long_stable_diffusion by sharonzhou

AI pipeline for long-form text-to-image generation

Created 3 years ago

Updated 3 years ago

Starred by

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral),

Chenlin Meng

Chenlin Meng(Cofounder of Pika), and

2 more.

clip-guided-diffusion by afiaka87

CLI tool for text-to-image generation using CLIP-guided diffusion

Created 4 years ago

Updated 1 month ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

RPG-DiffusionMaster by YangLing0818

Training-free paradigm for text-to-image generation/editing

Created 2 years ago

Updated 1 year ago

pytorch-stable-diffusion by hkproj

PyTorch code for Stable Diffusion image generation

Created 2 years ago

Updated 1 year ago

Starred by

Alberto Taiuti

Alberto Taiuti(Cofounder of Luma AI) and

Saining Xie

Saining Xie(Professor at NYU).

zero123 by cvlab-columbia

Research paper for zero-shot one image to 3D object generation

Created 2 years ago

Updated 2 years ago

Starred by

Stella Rose Biderman

Stella Rose Biderman(Executive Director at EleutherAI),

Travis Fischer

Travis Fischer(Founder of Agentic), and

2 more.

StableCascade by Stability-AI

Image generation model using cascaded diffusion

Created 2 years ago

Updated 1 year ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Benjamin Bolte

Benjamin Bolte(Cofounder of K-Scale Labs), and

13 more.

latent-diffusion by CompVis

Image synthesis research paper using latent diffusion models

Created 4 years ago

Updated 2 years ago

Feedback? Help us improve.