glide-text2im  by openai

Text-conditional image synthesis model from research paper

created 3 years ago
3,651 stars

Top 13.5% on sourcepulse

GitHubView on GitHub
Project Summary

GLIDE is an open-source implementation of a diffusion-based text-to-image synthesis model, offering capabilities for generating photorealistic images from text prompts and performing image inpainting. It is targeted at researchers and developers interested in state-of-the-art generative AI models.

How It Works

GLIDE utilizes a diffusion model architecture, a class of generative models that learn to reverse a noise-adding process. This approach allows for high-quality image generation by progressively denoising a random noise input, guided by text conditioning. The model employs classifier-free guidance, a technique that enhances the adherence of generated images to the input text prompts without requiring a separate classifier.

Quick Start & Requirements

  • Install via pip install -e . after cloning the repository.
  • Requires Python and PyTorch. Specific version requirements are not detailed in the README.
  • Refer to the notebooks directory for usage examples, including text-to-image generation and inpainting.

Highlighted Details

  • Implements the small, filtered-data GLIDE model.
  • Supports text-guided image generation and image inpainting.
  • Includes a CLIP-guided generation notebook.

Maintenance & Community

This repository is maintained by OpenAI. No specific community channels or roadmap details are provided in the README.

Licensing & Compatibility

The repository's license is not specified in the provided README text.

Limitations & Caveats

The README focuses on the "small, filtered-data" version of GLIDE, implying larger or differently trained versions may exist with different capabilities or requirements. Specific hardware or software dependencies beyond Python and PyTorch are not detailed.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
36 stars in the last 90 days

Explore Similar Projects

Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), and
3 more.

guided-diffusion by openai

0.2%
7k
Image synthesis codebase for diffusion models
created 4 years ago
updated 1 year ago
Feedback? Help us improve.