UltraPixel  by catcathh

Research paper implementation for ultra-high-resolution image synthesis

created 1 year ago
601 stars

Top 55.2% on sourcepulse

GitHubView on GitHub
Project Summary

UltraPixel implements a novel approach to ultra-high-resolution image synthesis, targeting researchers and practitioners in generative AI who need to produce highly detailed images. It builds upon StableCascade to achieve state-of-the-art quality and resolution.

How It Works

UltraPixel leverages a cascaded diffusion model architecture, similar to StableCascade, but introduces specific optimizations for ultra-high-resolution synthesis. The system likely employs a multi-stage generation process where lower-resolution outputs are progressively upscaled and refined, enabling the creation of images with exceptional detail and coherence at resolutions exceeding typical diffusion model capabilities. This staged approach allows for efficient memory management and computational scaling.

Quick Start & Requirements

  • Install dependencies: pip install -r requirements.txt
  • Download pre-trained models from StableCascade and UltraPixel.
  • Requires PyTorch and Torchvision.
  • Inference examples provided via app.py (Gradio) and inference/test_t2i.py.
  • Tiled decoding is recommended for memory saving on lower-VRAM GPUs.
  • See Project Page for more details and demos.

Highlighted Details

  • Accepted to NeurIPS 2024.
  • Achieves ~60 seconds for 2560x5120 generation on RTX 4090.
  • Supports text-to-image, personalized image generation (cat example), and ControlNet integration.
  • Detailed memory and performance benchmarks provided for various GPUs (A100, V100, RTX 4090) and resolutions.

Maintenance & Community

  • Project is built upon StableCascade and Trans-inr.
  • Contact information for authors is available on the project page.

Licensing & Compatibility

  • The repository does not explicitly state a license. It is built upon StableCascade, which is typically released under a permissive license, but this should be verified for UltraPixel specifically.

Limitations & Caveats

  • ControlNet integration is limited to 4K resolution.
  • Performance and memory requirements vary significantly with resolution and GPU, with some configurations potentially leading to Out-Of-Memory (OOM) errors without tiled decoding.
Health Check
Last commit

10 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
10 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.