CLI tool for text-to-image generation
Top 18.6% on sourcepulse
Big Sleep is a command-line tool and Python library for generating images from text prompts using OpenAI's CLIP and a BigGAN. It's designed for users with GPUs who want to experiment with text-to-image synthesis through a simple interface.
How It Works
The tool leverages CLIP to interpret text prompts and guide a BigGAN generator towards producing corresponding images. This approach allows for creative image generation by "dreaming" visuals based on natural language descriptions, offering a straightforward way to explore AI-powered art.
Quick Start & Requirements
pip install big-sleep
dream "a pyramid made of ice"
Highlighted Details
--larger-model
) for potentially improved generations.--save-best
).Maintenance & Community
The project is based on work by Ryan Murdock and is available on GitHub. Links to original and simplified notebooks are provided.
Licensing & Compatibility
The repository does not explicitly state a license in the provided README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The README mentions that Big Sleep can sometimes steer off-manifold into noise due to the class-conditioned nature of the GAN. The --max-classes
flag is suggested for stability at the cost of expressivity.
3 years ago
Inactive