Discover and explore top open-source AI tools and projects—updated daily.
meta-pytorchAgentic RL library for scalable PyTorch experimentation
Top 49.8% on SourcePulse
Summary
Torchforge is a PyTorch-native library designed to simplify Reinforcement Learning (RL) experimentation by abstracting infrastructure concerns, allowing researchers to focus on algorithms. It offers a scalable implementation of RL abstractions, catering to both rapid research and power-user hackability. However, development has been paused, with efforts consolidated into torchtitan.
How It Works
The library provides clear RL abstractions and a single, scalable implementation. It enables fine-grained control over distributed training aspects like placement, fault handling, and communication patterns, while also allowing users to ignore infrastructure when desired. This approach supports shifting between asynchronous and synchronous training modes across thousands of GPUs.
Quick Start & Requirements
conda create -n forge python=3.12 && conda activate forge && ./scripts/install.shconda create -n forge python=3.12 && conda activate forge && ./scripts/install_rocm.sh (requires manual setting of PYTORCH_ROCM_ARCH/ROCM_VERSION for ROCm 7.x, defaults to nightly wheels; RDMA/distributed tensor features disabled).curl -fsSL https://pixi.sh/install.sh | bash then pixi run install (Conda recommended; uv support incomplete).python -m apps.grpo.main --config apps/grpo/qwen3_1_7b.yaml (requires minimum 2 GPUs).Highlighted Details
Maintenance & Community
Development in Torchforge has been paused and consolidated into torchtitan. No specific community links (Discord, Slack, etc.) or roadmap details are provided in the README.
Licensing & Compatibility
Source code is licensed under BSD 3-clause. Users must be aware of potential legal obligations related to third-party data and models linked within the repository.
Limitations & Caveats
Development is paused, with the project's future consolidated into torchtitan. ROCm builds disable RDMA and distributed tensor features due to USE_TENSOR_ENGINE=0. Pure uv installation via Pixi is not yet fully functional. Tutorials are still under development.
1 day ago
Inactive
zuoxingdong
alibaba
pytorch
inclusionAI
thu-ml