llm-book by ghmagazine

Code examples for a Japanese LLM intro textbook

Created 2 years ago

456 stars

Top 66.2% on SourcePulse

Project Summary

This repository provides code and resources for the books "Introduction to Large Language Models" (2023) and "Introduction to Large Language Models II: Implementation and Evaluation of Generative LLMs" (2024). It targets engineers and researchers interested in practical LLM implementation, offering hands-on examples for fine-tuning, evaluation, and various NLP tasks.

How It Works

The project utilizes Google Colaboratory notebooks for all code execution, ensuring accessibility and ease of use. Models and datasets are hosted on Hugging Face Hub. The approach focuses on practical application of LLM concepts, covering transformer architectures, fine-tuning techniques like LoRA, and evaluation methodologies using tools like llm-jp-eval.

Quick Start & Requirements

Code is designed to run in Google Colaboratory.
Datasets and models are available on Hugging Face Hub.
A data access issue with the MARC-ja dataset is noted, with a workaround provided using the WRIME dataset for relevant sections.
Links to specific Colab notebooks for each chapter/section are provided in a table.

Highlighted Details

Comprehensive coverage of LLM topics from foundational transformers to advanced techniques like instruction tuning, preference tuning, and Retrieval-Augmented Generation (RAG).
Practical implementation examples for tasks including sentiment analysis, named entity recognition, summarization, question answering, and semantic similarity.
Includes sections on distributed parallel training and evaluation benchmarks like llm-jp-eval and Japanese Vicuna QA Benchmark.
Code is confirmed to run on Google Colaboratory, facilitating easy experimentation.

Maintenance & Community

The repository is associated with published books, indicating a structured development and release process.
Links to publisher and Amazon pages for both books are provided.
A link to errata for the books is also available.

Licensing & Compatibility

The repository itself does not explicitly state a license.
Code examples likely depend on the licenses of the libraries used (e.g., Hugging Face Transformers, PyTorch).

Limitations & Caveats

A specific dataset (MARC-ja) used in some examples had a broken download link as of July 2023, though a workaround using the WRIME dataset is provided.
The repository's license is not specified, which may impact commercial use or integration into closed-source projects.

Health Check

Last Commit

1 week ago

Responsiveness

1 day

Pull Requests (30d)

1

Issues (30d)

0

Star History

5 stars in the last 30 days

Explore Similar Projects

InstructionZoo by FreedomIntelligence

Instruction-tuning dataset collection for chat-based LLMs

Created 2 years ago

Updated 1 year ago

Awesome-LLM-RAG by jxzhangjhu

Curated list of papers on retrieval augmented generation (RAG) in LLMs

Created 2 years ago

Updated 10 months ago

ChatGPTBook by liucongg

Code examples for a ChatGPT book

Created 2 years ago

Updated 2 years ago

nlp-paper by changwookjun

Created 6 years ago

Updated 1 year ago

100-Days-of-NLP by graviraja

NLP learning resources, including code samples in Jupyter notebooks

Created 5 years ago

Updated 2 years ago

training-fine-tuning-large-language-models-workshop-dhs2024 by dipanjanS

Workshop for training and fine-tuning large language models

Created 1 year ago

Updated 10 months ago

Starred by

Andreas Jansson

Andreas Jansson(Cofounder of Replicate),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

4 more.

self-rag by AkariAsai

Self-RAG implementation for learning retrieval, generation, and critique via self-reflection

Created 2 years ago

Updated 1 year ago

KoAlpaca by Beomi

Korean LLM fine-tuning project

Created 2 years ago

Updated 1 year ago

LLMs-from-scratch-CN by MLNLP-World

Chinese translation of LLM tutorial from scratch

Created 11 months ago

Updated 2 months ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

nlp-journey by msgi

NLP resource collection: papers, code, and articles

Created 6 years ago

Updated 2 days ago

nlp_paper_study by km1994

NLP paper study notes for algorithm engineers

Created 6 years ago

Updated 2 years ago

Starred by

Peter Norvig

Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

3 more.

Hands-On-Large-Language-Models by HandsOnLLM

Code examples for "Hands-On Large Language Models" book

Created 1 year ago

Updated 3 weeks ago

Feedback? Help us improve.