LLMs-from-scratch-CN  by MLNLP-World

Chinese translation of LLM tutorial from scratch

created 6 months ago
1,354 stars

Top 30.3% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides a Chinese translation of the "LLMs-from-scratch" project, offering detailed Markdown notes and Jupyter notebooks for learning how to build Large Language Models. It's targeted at Chinese-speaking learners and researchers interested in understanding and implementing LLMs from the ground up, including Transformer architecture, GPT, and BERT.

How It Works

The project follows the structure of the original "LLMs-from-scratch" book and repository, guiding users through the foundational concepts and practical implementation of LLMs. It covers text data processing, attention mechanisms, building a GPT model, pre-training, and fine-tuning for tasks like text classification and instruction following. The approach emphasizes hands-on coding with detailed Chinese comments in Jupyter notebooks.

Quick Start & Requirements

Highlighted Details

  • Comprehensive Chinese translation of a popular LLM development guide.
  • Jupyter notebooks with detailed Chinese comments for practical implementation.
  • Covers foundational LLM concepts, Transformer, GPT, BERT, pre-training, and fine-tuning.
  • Includes supplementary materials and exercises for deeper learning.

Maintenance & Community

The project is a translation of Sebastian Raschka's work. Contributions to the translated content are welcome via GitHub discussions. Direct contributions to the core code are not accepted to maintain consistency with the original book.

Licensing & Compatibility

This repository is a translation and educational resource. The original code is associated with the book "Build A Large Language Model (From Scratch)" by Sebastian Raschka, published by Manning. Specific licensing for the translated content is not explicitly stated but is intended as a learning aid.

Limitations & Caveats

This repository is a translation and does not modify or extend the original content. Potential inaccuracies or omissions in translation may exist. Contributions that alter the core code are not accepted to maintain fidelity with the original book.

Health Check
Last commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
556 stars in the last 90 days

Explore Similar Projects

Starred by Peter Norvig Peter Norvig(Author of Artificial Intelligence: A Modern Approach; Research Director at Google), Bojan Tunguz Bojan Tunguz(AI Scientist; Formerly at NVIDIA), and
4 more.

LLMs-from-scratch by rasbt

1.7%
61k
Educational resource for LLM construction in PyTorch
created 2 years ago
updated 1 day ago
Feedback? Help us improve.