picotron  by huggingface

Minimalist distributed training framework for educational use

created 10 months ago
1,619 stars

Top 26.5% on sourcepulse

GitHubView on GitHub
Project Summary

Picotron is a minimalist, hackable distributed training framework designed for educational purposes, enabling users to learn and experiment with 4D parallelism (Data, Tensor, Pipeline, Context) for large language models. It offers a simplified codebase, making complex distributed training concepts accessible to researchers and students.

How It Works

Picotron implements 4D parallelism by breaking down model training across data, tensor, pipeline, and context dimensions. This approach allows for efficient distribution of large models and datasets across multiple GPUs, facilitating training of models that would otherwise be too large for single devices. The framework prioritizes code readability and simplicity, with core components like train.py and parallelism strategies under 300 lines each.

Quick Start & Requirements

Health Check
Last commit

3 weeks ago

Responsiveness

1 week

Pull Requests (30d)
1
Issues (30d)
0
Star History
221 stars in the last 90 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera) and Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

InternEvo by InternLM

1.0%
402
Lightweight training framework for model pre-training
created 1 year ago
updated 1 week ago
Starred by Peter Norvig Peter Norvig(Author of Artificial Intelligence: A Modern Approach; Research Director at Google), Didier Lopes Didier Lopes(Founder of OpenBB), and
15 more.

llm.c by karpathy

0.2%
27k
LLM training in pure C/CUDA, no PyTorch needed
created 1 year ago
updated 1 month ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Anton Bukov Anton Bukov(Cofounder of 1inch Network), and
16 more.

tinygrad by tinygrad

0.1%
30k
Minimalist deep learning framework for education and exploration
created 4 years ago
updated 23 hours ago
Feedback? Help us improve.