Research paper for real-world image super-resolution using diffusion prior
Top 19.1% on sourcepulse
StableSR addresses real-world image super-resolution by leveraging diffusion models, offering a powerful solution for researchers and users seeking high-fidelity upscaling. It aims to improve image quality beyond traditional methods by integrating diffusion priors into the super-resolution process.
How It Works
StableSR utilizes a diffusion model as a prior to guide the super-resolution process. It works by generating high-resolution images conditioned on low-resolution inputs, effectively "hallucinating" plausible details. This approach allows for more realistic and artifact-free results compared to methods relying solely on interpolation or single-stage generative models. The use of diffusion models enables arbitrary upscaling factors and supports advanced features like negative prompts for finer control.
Quick Start & Requirements
environment.yaml
, install xformers
, taming-transformers
, clip
, and the package itself.xformers
(0.0.16) is optional but recommended.Highlighted Details
Maintenance & Community
The project is actively maintained, with recent updates including SD-Turbo support and ComfyUI integration. Links to demos on Hugging Face, OpenXLab, and Replicate are provided. Contact is available via email.
Licensing & Compatibility
Licensed under NTU S-Lab License 1.0. Redistribution and use must follow this license.
Limitations & Caveats
Testing on arbitrary sizes without tiling requires over 10GB GPU memory. Tiled inference for arbitrary sizes requires at least 18GB, with potential border artifacts depending on tile size and stride. FaceSR requires pre-generated reference images from models like CodeFormer.
1 year ago
1 day