LLMsPracticalGuide by Mooler0410

Curated list of LLM practical guide resources (tree, examples, papers)

Created 3 years ago

10,199 stars

Top 5.1% on SourcePulse

View on GitHub

11 Experts Love This Project

Eugene Yan

AI Scientist at AWS

Shawn Wang

Editor of Latent Space

Gabriel Almeida

Cofounder of Langflow

Yaowei Zheng

Author of LLaMA-Factory

and 7 more!

Project Summary

This repository provides a curated collection of practical resources for Large Language Models (LLMs), aimed at practitioners and researchers. It offers a structured overview of LLM development, applications, data, and ethical considerations, drawing from a comprehensive survey paper.

How It Works

The project organizes information into practical guides covering LLM architectures (BERT-style, GPT-style), data (pretraining, finetuning, test), NLP tasks, scaling abilities, efficiency, trustworthiness, and alignment efforts. It includes an evolutionary tree of modern LLMs and a detailed table of usage and licensing restrictions for various models and their datasets.

Quick Start & Requirements

This is a curated list of resources, not a runnable software project. No installation or execution commands are provided.

Highlighted Details

Comprehensive categorization of LLMs, including encoder-only, encoder-decoder, and decoder-only architectures.
Detailed breakdown of data sources, NLP tasks, and LLM abilities like emergent abilities and scaling laws.
Extensive coverage of LLM alignment, safety, truthfulness, and prompting techniques.
A thorough table detailing model and data licenses, commercial use permissions, and notable restrictions.

Maintenance & Community

The repository is actively updated, with recent additions including usage and restrictions sections, and new models like AlexaTM and UniLM. It cites a survey paper and welcomes pull requests for refinement.

Licensing & Compatibility

The content itself is not explicitly licensed, but the repository links to numerous papers and resources with varying licenses. The "Usage and Restrictions" table clearly outlines licenses (e.g., Apache 2.0, MIT, CC BY-SA 4.0, CC BY-NC 4.0, BigScience RAIL License, TII Falcon LLM License) and commercial use permissions for many LLMs and their datasets, with some models explicitly prohibiting commercial use or having specific restrictions.

Limitations & Caveats

This repository is a curated list of links and information, not a functional tool. Users must independently verify the licensing and usage terms for each model and dataset referenced.

Health Check

Last Commit

3 months ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

18 stars in the last 30 days