stanford-cme-295-transformers-large-language-models by afshinea

Cheatsheet for Stanford's Transformers & LLMs course

Created 9 months ago

3,907 stars

Top 12.4% on SourcePulse

Project Summary

This repository provides a comprehensive cheatsheet for Stanford's CME 295 course on Transformers and Large Language Models. It aims to consolidate key concepts for students and practitioners in NLP and deep learning, offering a structured overview of essential topics.

How It Works

The cheatsheet summarizes core concepts from the "Super Study Guide: Transformers & Large Language Models" book, which features extensive illustrations. It covers transformer architectures, attention mechanisms, optimization techniques, LLM fine-tuning methods, and applications like RAG and agents.

Highlighted Details

Covers transformer variants and optimization techniques like sparse, low-rank, and FlashAttention.
Details LLM fine-tuning methods including SFT, LoRA, and preference tuning.
Explains optimization techniques such as Mixture of Experts, distillation, and quantization.
Includes applications like LLM-as-a-judge, RAG, agents, and reasoning models.

Maintenance & Community

The project is authored by Afshine Amidi and Shervine Amidi, associated with Stanford University. Further details can be found on the course website: cme295.stanford.edu.

stanford-cme-295-transformers-large-language-models by afshinea

Explore Similar Projects

llm_illustrated by chaoswork

Multimodal-AND-Large-Language-Models by Yangyi-Chen

intro-llm-code by intro-llm

llms by IbrahimSobh

transformer by sannykim

llms-interview-questions by Devinterview-io

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing by ghimiresunil

LLMs_interview_notes by km1994

Generative-AI-with-LLMs by Ryota-Kawamura

LLMForEverybody by luhengshiwo

happy-llm by datawhalechina

llm-course by mlabonne