cookbook  by EleutherAI

Deep learning resource for practical model work

Created 1 year ago
813 stars

Top 43.6% on SourcePulse

GitHubView on GitHub
Project Summary

This repository serves as a comprehensive guide and collection of practical resources for deep learning practitioners, particularly those working with large language models (LLMs). It aims to demystify complex training and inference processes by providing detailed explanations, utilities, and curated links to external tools and papers, targeting beginners and experienced engineers alike.

How It Works

The Cookbook organizes information into distinct sections covering LLM calculations, benchmarks, foundational concepts, distributed training, best practices, and educational code examples. It leverages a curated approach, linking to external calculators, visualizations, and annotated papers to explain core concepts like FLOPs, VRAM estimation, and parallelism strategies. The project also includes minimal code repositories for pedagogical purposes, demonstrating key LLM architectures.

Quick Start & Requirements

This repository is primarily a collection of curated links and explanations, not a runnable application. No direct installation or execution commands are provided. Users will need to follow links to external resources for specific tools or code examples.

Highlighted Details

  • Provides utilities for calculating FLOPs, memory overhead, and parameter counts for LLMs.
  • Includes benchmarks for communication and transformer sizing.
  • Features a reading list with annotated papers and visualizations for understanding transformer architecture.
  • Offers minimal code repositories for pedagogical purposes, covering various LLM architectures.

Maintenance & Community

The project is associated with EleutherAI, a prominent research collective in the AI space. Contributions are welcomed via GitHub Issues and Pull Requests.

Licensing & Compatibility

The repository itself is not directly licensed for use as a software package. The content is intended for educational purposes. Specific licenses for linked external code or papers would need to be consulted individually.

Limitations & Caveats

As a curated collection of resources, this repository does not provide a unified, runnable framework. Users must navigate and integrate various external tools and codebases, each with its own dependencies and setup requirements.

Health Check
Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
5 stars in the last 30 days

Explore Similar Projects

Starred by Théophile Gervet Théophile Gervet(Cofounder of Genesis AI), Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), and
6 more.

lingua by facebookresearch

0.1%
5k
LLM research codebase for training and inference
Created 11 months ago
Updated 2 months ago
Starred by Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), Lewis Tunstall Lewis Tunstall(Research Engineer at Hugging Face), and
15 more.

torchtune by pytorch

0.2%
5k
PyTorch library for LLM post-training and experimentation
Created 1 year ago
Updated 1 day ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Roy Frostig Roy Frostig(Coauthor of JAX; Research Scientist at Google DeepMind), and
19 more.

ml-engineering by stas00

0.4%
15k
Open book for LLM/VLM training engineers
Created 5 years ago
Updated 22 hours ago
Feedback? Help us improve.