LLMs-from-scratch-CN  by MLNLP-World

Chinese translation of LLM tutorial from scratch

Created 8 months ago
1,566 stars

Top 26.7% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a Chinese translation of the "LLMs-from-scratch" project, offering detailed Markdown notes and Jupyter notebooks for learning how to build Large Language Models. It's targeted at Chinese-speaking learners and researchers interested in understanding and implementing LLMs from the ground up, including Transformer architecture, GPT, and BERT.

How It Works

The project follows the structure of the original "LLMs-from-scratch" book and repository, guiding users through the foundational concepts and practical implementation of LLMs. It covers text data processing, attention mechanisms, building a GPT model, pre-training, and fine-tuning for tasks like text classification and instruction following. The approach emphasizes hands-on coding with detailed Chinese comments in Jupyter notebooks.

Quick Start & Requirements

Highlighted Details

  • Comprehensive Chinese translation of a popular LLM development guide.
  • Jupyter notebooks with detailed Chinese comments for practical implementation.
  • Covers foundational LLM concepts, Transformer, GPT, BERT, pre-training, and fine-tuning.
  • Includes supplementary materials and exercises for deeper learning.

Maintenance & Community

The project is a translation of Sebastian Raschka's work. Contributions to the translated content are welcome via GitHub discussions. Direct contributions to the core code are not accepted to maintain consistency with the original book.

Licensing & Compatibility

This repository is a translation and educational resource. The original code is associated with the book "Build A Large Language Model (From Scratch)" by Sebastian Raschka, published by Manning. Specific licensing for the translated content is not explicitly stated but is intended as a learning aid.

Limitations & Caveats

This repository is a translation and does not modify or extend the original content. Potential inaccuracies or omissions in translation may exist. Contributions that alter the core code are not accepted to maintain fidelity with the original book.

Health Check
Last Commit

5 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
121 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Junyang Lin Junyang Lin(Core Maintainer at Alibaba Qwen), and
3 more.

Alpaca-CoT by PhoebusSi

0.1%
3k
IFT platform for instruction collection, parameter-efficient methods, and LLMs
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.