AlignProp by mihirp1998

Finetuning method for text-to-image diffusion models

Created 2 years ago

310 stars

Top 86.9% on SourcePulse

Project Summary

AlignProp offers a more efficient method for fine-tuning text-to-image diffusion models to align with specific reward functions, such as aesthetic quality or semantic accuracy. Targeting researchers and practitioners working with large diffusion models, it provides a computationally and sample-efficient alternative to reinforcement learning approaches like PPO.

How It Works

AlignProp utilizes direct reward backpropagation through the diffusion model's denoising process. To manage memory constraints, it fine-tunes low-rank adapter weight modules and employs gradient checkpointing. This approach allows for end-to-end optimization against differentiable reward functions, simplifying the alignment process compared to RL methods.

Quick Start & Requirements

Install: Create a conda environment (conda create -n alignprop python=3.10) and install dependencies (pip install -r requirements.txt).
Prerequisites: Python 3.10, CUDA-enabled GPUs. Experiments used 4x A100 (40GB RAM); users with less VRAM should adjust train_batch_size or use K=1.
Training: Scripts aesthetic.sh and hps.sh are provided for aesthetic and HPSv2 reward models, respectively. Variants for memory-constrained environments (_k1.sh) are also available.
Resources: Official implementation built on DDPO. See arXiv and Website.

Highlighted Details

25x more sample and compute efficient than PPO for Stable Diffusion fine-tuning.
Achieves higher rewards in fewer training steps.
Conceptually simpler than RL-based alignment methods.
Supports fine-tuning for image-text semantic alignment, aesthetics, compressibility, and controllability.

Maintenance & Community

The codebase is built upon DDPO. No specific community channels or roadmap are detailed in the README.

Licensing & Compatibility

The repository does not explicitly state a license. The absence of a license may restrict commercial use or closed-source linking.

Limitations & Caveats

The project is presented as an official implementation of a research paper, suggesting it may be experimental. Specific details on long-term maintenance or community support are not provided.

AlignProp by mihirp1998

Explore Similar Projects

awesome-alignment-of-diffusion-models by xie-lab-ml

VisionZip by JIA-Lab-research

VisionThink by JIA-Lab-research

InstructCV by AlaaLab

LightningDiT by hustvl

X-VLM by zengyan-97

ddpo-pytorch by kvablack

Show-o by showlab

text2image-gui by n00mkrad

NeMo-Aligner by NVIDIA

custom-diffusion by adobe-research

InternVL by OpenGVLab