LLMs-from-scratch-CN by MLNLP-World

Chinese translation of LLM tutorial from scratch

Created 1 year ago

2,348 stars

Top 19.0% on SourcePulse

Project Summary

This repository provides a Chinese translation of the "LLMs-from-scratch" project, offering detailed Markdown notes and Jupyter notebooks for learning how to build Large Language Models. It's targeted at Chinese-speaking learners and researchers interested in understanding and implementing LLMs from the ground up, including Transformer architecture, GPT, and BERT.

How It Works

The project follows the structure of the original "LLMs-from-scratch" book and repository, guiding users through the foundational concepts and practical implementation of LLMs. It covers text data processing, attention mechanisms, building a GPT model, pre-training, and fine-tuning for tasks like text classification and instruction following. The approach emphasizes hands-on coding with detailed Chinese comments in Jupyter notebooks.

Quick Start & Requirements

Install Python packages as per the setup directory's README.
Code is designed to run on standard laptops; GPUs are automatically utilized if available.
Official English repository: https://github.com/raschbt/LLMs-from-scratch.git
Official English book: https://www.manning.com/books/build-a-large-language-model-from-scratch

Highlighted Details

Comprehensive Chinese translation of a popular LLM development guide.
Jupyter notebooks with detailed Chinese comments for practical implementation.
Covers foundational LLM concepts, Transformer, GPT, BERT, pre-training, and fine-tuning.
Includes supplementary materials and exercises for deeper learning.

Maintenance & Community

The project is a translation of Sebastian Raschka's work. Contributions to the translated content are welcome via GitHub discussions. Direct contributions to the core code are not accepted to maintain consistency with the original book.

Licensing & Compatibility

This repository is a translation and educational resource. The original code is associated with the book "Build A Large Language Model (From Scratch)" by Sebastian Raschka, published by Manning. Specific licensing for the translated content is not explicitly stated but is intended as a learning aid.

Limitations & Caveats

This repository is a translation and does not modify or extend the original content. Potential inaccuracies or omissions in translation may exist. Contributions that alter the core code are not accepted to maintain fidelity with the original book.

Health Check

Last Commit

4 months ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

94 stars in the last 30 days