MixGRPO by Tencent-Hunyuan

Enhancing generative model efficiency with mixed ODE-SDE

Created 7 months ago

1,104 stars

Top 34.4% on SourcePulse

Project Summary

Summary

MixGRPO enhances flow-based Generative Reward Policy Optimization (GRPO) efficiency using a novel mixed Ordinary Differential Equation (ODE) and Stochastic Differential Equation (SDE) approach. Targeting researchers and practitioners in generative AI, it aims to improve performance and unlock new capabilities, particularly in text-to-image generation tasks.

How It Works

The project employs a hybrid ODE-SDE formulation to optimize flow-based GRPO. This strategy combines deterministic ODE modeling with stochastic SDEs, aiming for more effective and faster policy optimization in generative models. The specific architecture and data flow are detailed in the associated paper.

Quick Start & Requirements

Installation: Python 3.12 via Conda (conda create -n MixGRPO python=3.12). System dependencies include pdsh, pssh, mesa-libGL (CentOS), and env_setup.sh.
Prerequisites: Hugging Face CLI (login required), Weights & Biases (WandB) key. Requires downloading FLUX.1-dev, HPS-v2.1, ImageReward, Pick Score, and CLIP Score reward models.
Hardware: Training supports multi-node setups (e.g., 4 nodes, 32 GPUs) using pdsh and torchrun. Inference/evaluation use single-node scripts.
Links: Paper: https://arxiv.org/abs/2507.21802. FLUX Model: black-forest-labs/FLUX.1-dev. HPSv2 Code: https://github.com/tgxs002/HPSv2.git. MixGRPO Weights: tulvgengenr/MixGRPO.

Highlighted Details

Novel mixed ODE-SDE approach for flow-based GRPO efficiency.
Supports multi-reward fine-tuning (HPSv2, ImageReward, Pick Score) on FLUX.1 Dev.
Provides scripts for data preprocessing, multi-node training, inference, and evaluation.

Maintenance & Community

No specific maintenance details, community channels, or roadmap links are provided in the README. The project is associated with Tencent Hunyuan.

Licensing & Compatibility

License: "License Terms of MixGRPO" (details in ./License.txt). Specific terms require consulting the License.txt file.
Compatibility: No explicit notes on commercial use or closed-source linking are present; license terms will govern.

Limitations & Caveats

Project is under active development; TODOs include updating technical reports and FlowGRPO comparisons.
License terms require consulting License.txt and may impose restrictions.
Multi-node training setup is complex, requiring specific environment variables and cluster tools.
Model downloads necessitate huggingface-cli login.

Health Check

Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)

2

Issues (30d)

0

Star History

19 stars in the last 30 days

Explore Similar Projects

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

TCD by jabir-zheng

Distillation method for fast, high-quality image generation

Created 2 years ago

Updated 1 year ago

awesome-flow-matching by dongzhuoyao

Generative modeling with flow matching and stochastic interpolants

Created 3 years ago

Updated 3 weeks ago

TwinFlow by inclusionAI

Taming large-scale few-step generative model training

Created 4 months ago

Updated 2 days ago

gflownet by alexhernandezgarcia

PyTorch library for training and extending Generative Flow Networks (GFlowNets)

Created 3 years ago

Updated 1 day ago

FastGen by NVlabs

Fast generation from diffusion models

Created 1 month ago

Updated 5 days ago

Generative_Models_Tutorial_with_Demo by omerbsezer

Tutorial for generative models, including code and papers

Created 7 years ago

Updated 7 years ago

Starred by

Andreas Jansson

Andreas Jansson(Cofounder of Replicate).

Generative-AI by fnzhan

Survey paper for multimodal image synthesis/editing & visual AIGC

Created 4 years ago

Updated 2 years ago

AdvancedML by sjhwang82

Curated reading list for advanced machine learning course

Created 7 years ago

Updated 3 years ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Andreas Blattmann

Andreas Blattmann(Cofounder of Black Forest Labs), and

1 more.

intro_dgm by jmtomczak

Introductory examples for deep generative models research paper

Created 5 years ago

Updated 6 months ago

Starred by

Lilian Weng

Lilian Weng(Cofounder of Thinking Machines Lab),

Patrick Kidger

Patrick Kidger(Core Contributor to JAX ecosystem), and

12 more.

glow by openai

Generative flow research paper code

Created 7 years ago

Updated 1 year ago

Generative_Deep_Learning_2nd_Edition by davidADSP

Code repo for generative deep learning book

Created 3 years ago

Updated 1 year ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

awesome-generative-ai by filipecalegario

Curated list of Generative AI tools, works, models, and references

Created 4 years ago

Updated 2 months ago

Feedback? Help us improve.