DisTrO  by NousResearch

Distributed optimizers research paper

created 11 months ago
951 stars

Top 39.4% on sourcepulse

GitHubView on GitHub
Project Summary

DisTrO is a framework for low-latency distributed optimizers designed to drastically reduce inter-GPU communication overhead in large-scale model training. It targets researchers and engineers working with distributed deep learning systems who need to optimize communication efficiency.

How It Works

DisTrO implements a family of optimizers that achieve communication reduction by three to four orders of magnitude. The core innovation lies in its approach to minimizing the data exchanged between GPUs, enabling more efficient distributed training, particularly over the internet.

Quick Start & Requirements

  • Installation: Not specified in README.
  • Prerequisites: Not specified in README.
  • Resources: Not specified in README.
  • Links:
    • Preliminary Report: [x] Aug. 26th, 2024
    • DeMo Optimization Paper: [x] Dec. 2nd, 2024
    • DeMo Optimization Code: [x] Dec. 2nd, 2024

Highlighted Details

  • Achieves 3-4 orders of magnitude reduction in inter-GPU communication.
  • Demonstrated training of a 15b model using DisTrO.
  • Related projects include Psyche Network and Nous Consilience 40b LLM.

Maintenance & Community

  • Community: Discord server available for collaboration.
  • Roadmap: Upcoming paper and code release.

Licensing & Compatibility

  • License: Not specified in README.
  • Compatibility: Not specified in README.

Limitations & Caveats

The project is presented as preliminary, with a formal paper and code release pending. Specific installation, requirements, and compatibility details are not yet available.

Health Check
Last commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
44 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jaret Burkett Jaret Burkett(Founder of Ostris), and
1 more.

nunchaku by nunchaku-tech

2.1%
3k
High-performance 4-bit diffusion model inference engine
created 8 months ago
updated 16 hours ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Anton Bukov Anton Bukov(Cofounder of 1inch Network), and
16 more.

tinygrad by tinygrad

0.1%
30k
Minimalist deep learning framework for education and exploration
created 4 years ago
updated 20 hours ago
Feedback? Help us improve.