High-resolution image generation research paper
Top 22.4% on sourcepulse
DemoFusion provides a framework for generating high-resolution images using existing Latent Diffusion Models (LDMs), aiming to democratize access to advanced AI image generation. It is designed for researchers and users interested in pushing the boundaries of image resolution without requiring extensive computational resources for training.
How It Works
DemoFusion extends LDMs with three core mechanisms: Progressive Upscaling, Skip Residual, and Dilated Sampling. This approach allows for higher-resolution outputs by iteratively refining the image. The progressive nature also enables rapid prompt iteration by providing intermediate "previews" during generation.
Quick Start & Requirements
pip install -r requirements.txt
.Highlighted Details
Maintenance & Community
The project was accepted to CVPR 2024. Community contributions have led to ComfyUI and Replicate integrations, as well as ControlNet and low-VRAM implementations.
Licensing & Compatibility
The repository does not explicitly state a license in the README. Users should verify licensing for commercial or closed-source use.
Limitations & Caveats
The Image2Image functionality is noted to have a strong bias correlated with SDXL's training data. Default hyper-parameters are recommended but may not be optimal for all use cases.
1 year ago
1 week