Foundations-of-LLMs  by ZJU-LLMs

LLM textbook for systematically explaining foundational knowledge

Created 1 year ago
11,785 stars

Top 4.3% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This repository provides a comprehensive textbook on the foundations of Large Language Models (LLMs), aimed at students, researchers, and practitioners interested in the field. It offers a systematic introduction to core concepts and cutting-edge techniques, with a focus on readability, rigor, and depth.

How It Works

The book systematically covers key LLM topics, including traditional language models, LLM architecture evolution, prompt engineering, parameter-efficient fine-tuning, model editing, and retrieval-augmented generation. Each chapter uses an animal analogy for illustrative purposes, enhancing accessibility. The content is derived from the authors' research and understanding, with a commitment to monthly updates and incorporating community feedback.

Quick Start & Requirements

The complete PDF version of the book is available as 大模型基础.pdf. Chapter-specific PDFs are located in the 大语言模型分章节内容 folder, and related papers are in the 大语言模型相关论文 folder.

Highlighted Details

  • Covers six core chapters: Language Model Basics, Large Language Models, Prompt Engineering, Parameter-Efficient Fine-Tuning, Model Editing, and Retrieval-Augmented Generation.
  • Includes paper lists for each chapter to track the latest advancements.
  • Features animal-themed illustrations for each chapter to aid comprehension.
  • Plans to expand coverage to include LLM inference acceleration and LLM agents in future versions.

Maintenance & Community

The project is actively maintained with a commitment to monthly updates. Feedback and suggestions are encouraged via GitHub issues. Contact is available via email at xuwenyi@zju.edu.cn.

Licensing & Compatibility

The repository content is not explicitly licensed. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The book is presented as a first edition, with ongoing development and potential for revisions based on community input. Specific technical prerequisites or compatibility notes for accessing or utilizing the content are not detailed.

Health Check
Last Commit

8 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
236 stars in the last 30 days

Explore Similar Projects

Starred by Rodrigo Nader Rodrigo Nader(Cofounder of Langflow), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
11 more.

Awesome-LLM by Hannibal046

0.3%
25k
Curated list of Large Language Model resources
Created 2 years ago
Updated 1 month ago
Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

Hands-On-Large-Language-Models by HandsOnLLM

1.4%
16k
Code examples for "Hands-On Large Language Models" book
Created 1 year ago
Updated 1 month ago
Feedback? Help us improve.