DyPE by guyyariv

Diffusion models for ultra-high resolution image synthesis

Created 4 months ago

339 stars

Top 81.7% on SourcePulse

Project Summary

DyPE (Dynamic Position Extrapolation) enables pre-trained diffusion transformers to generate ultra-high-resolution images far beyond their training scale. It dynamically adjusts positional encodings during denoising to match evolving frequency content, achieving faithful 4K × 4K results without retraining or extra sampling cost. This is beneficial for users needing to scale image generation beyond standard resolutions efficiently.

How It Works

The core approach involves dynamically adjusting positional encodings during the diffusion model's denoising process. This dynamic adjustment allows the model to adapt to evolving frequency content, enabling it to extrapolate to resolutions far exceeding its training data scale. This method is advantageous as it avoids the need for retraining or additional sampling steps, making high-resolution generation efficient.

Quick Start & Requirements

Installation: Create a conda environment (conda create -n dype python=3.10, conda activate dype) and install dependencies (pip install -r requirements.txt).
Prerequisites: Python 3.10.
Usage: python run_dype.py --prompt "Your text prompt here". Key arguments include --height, --width, --steps, --seed, --method (yarn, ntk, or base), and --no_dype.
Links: Project Page: https://noamissachar.github.io/DyPE/, arXiv Paper: https://arxiv.org/abs/2510.20766.

Highlighted Details

Enables generation of ultra-high-resolution images (e.g., 4K × 4K) from pre-trained diffusion transformers.
Achieves high-resolution output without retraining or extra sampling cost.
Dynamically adjusts positional encodings during denoising to match evolving frequency content.
Supports different position encoding methods: yarn, ntk, and base.

Maintenance & Community

No specific details on contributors, community channels, or roadmap are provided in the README.

Licensing & Compatibility

The work is patent pending. For commercial use or licensing inquiries, users must contact the authors. This implies that standard open-source licensing does not apply, and commercial use requires explicit permission.

Limitations & Caveats

The primary caveat is the patent-pending status and the requirement to contact authors for commercial use, which could be an adoption blocker for commercial projects. The README does not explicitly state other limitations like unsupported platforms or known bugs.

DyPE by guyyariv

Explore Similar Projects

DC-Gen by dc-ai-projects

diffusion-4k by zhang0jhon

DatasetDM by showlab

LinFusion by Huage001

TaylorSeer by Shenyi-Z

ComfyUI-DyPE by wildminder

DiffPIR by yuanzhi-zhu

HashNeRF-pytorch by yashbhalgat

pytorch-pretrained-BigGAN by huggingface

Sana by NVlabs

consistency_models by openai

guided-diffusion by openai