llm.mojo by dorjeduck

Mojo port of Karpathy's llm.c for GPT-2 training

Created 1 year ago

362 stars

Top 77.6% on SourcePulse

3 Experts Love This Project

jph00

Cofounder of fast.ai

iamtimdavis

Cofounder of Modular

hammer

Jeff Hammerbacher

Cofounder of Cloudera

Project Summary

This project ports Andrej Karpathy's llm.c to Mojo, aiming to demonstrate Mojo's performance and low-level capabilities for C-like applications. It's targeted at developers interested in high-performance AI model implementation and systems programming, offering a potential speed advantage over C with OpenMP.

How It Works

The project directly translates the C implementation of a GPT-2 model training loop into Mojo. It leverages Mojo's features, including its Python-like syntax, static typing, and low-level memory management capabilities, to achieve performance comparable to or exceeding optimized C code. The use of vectorize and unroll_factor optimizations is highlighted.

Quick Start & Requirements

Install dependencies: pip install -r requirements.txt
Run preparatory scripts: python prepro_tinyshakespeare.py and python train_gpt2.py
Requires Modular's magic CLI tool.
Run training: magic shell then mojo train_gpt2.mojo
Detailed usage: https://github.com/dorjeduck/llm.mojo/blob/main/usage.md

Highlighted Details

Benchmarks on an M2 MacBook Pro show train_gpt2.mojo achieving 1819ms loop time, slightly faster than train_gpt2.c with OpenMP (1849ms) and significantly faster than C without OpenMP (7473ms).
Includes a ported test suite (test_gpt2.mojo) for validation.
Actively updated to track Mojo language releases.

Maintenance & Community

The project is primarily a proof of concept, with development focused on keeping pace with Mojo updates. The author is open to collaboration.

Licensing & Compatibility

License: MIT
Compatible with commercial and closed-source applications.

Limitations & Caveats

The project is currently in beta and serves as a proof of concept, with no further development planned beyond Mojo version compatibility.

Health Check

Last Commit

5 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

Starred by

Julien Chaumond

Julien Chaumond(Cofounder of Hugging Face).

gigax by GigaxGames

Runtime for LLM-powered game NPCs

Created 1 year ago

Updated 1 year ago

gigaGPT by Cerebras

Simple codebase for training large language models

Created 2 years ago

Updated 8 months ago

Starred by

Meng Zhang

Meng Zhang(Cofounder of TabbyML).

crabml by crabml

Llama.cpp compatible inference engine in Rust

Created 2 years ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

MAmmoTH by TIGER-AI-Lab

LLM for math problem-solving, targeting generalizability

Created 2 years ago

Updated 1 year ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect).

nano-aha-moment by McGill-NLP

Single-file library for "RL for LLMs" training

Created 10 months ago

Updated 3 months ago

CoLLiE by OpenMOSS

LLM training toolkit for efficient collaborative tuning

Created 2 years ago

Updated 1 year ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI),

Zhiqiang Xie

Zhiqiang Xie(Coauthor of SGLang), and

2 more.

KernelBench by ScalingIntelligence

Benchmark for LLMs generating GPU kernels from PyTorch ops

Created 1 year ago

Updated 1 day ago

Starred by

Andreas Jansson

Andreas Jansson(Cofounder of Replicate) and

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

llama2.mojo by tairov

Mojo code for Llama 2 inference

Created 2 years ago

Updated 1 month ago

Starred by

Yineng Zhang

Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI).

FlagGems by flagos-ai

Operator library for LLM training/inference, implemented in Triton

Created 1 year ago

Updated 2 days ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

6 more.

xTuring by stochasticai

SDK for fine-tuning and customizing open-source LLMs

Created 2 years ago

Updated 1 week ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind) and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

Baichuan-7B by baichuan-inc

7B-parameter LLM for commercial use

Created 2 years ago

Updated 1 year ago

llm-action by liguodongiot

LLM resource for techniques and deployment

Created 2 years ago

Updated 1 week ago

Feedback? Help us improve.