LLMsPracticalGuide  by Mooler0410

Curated list of LLM practical guide resources (tree, examples, papers)

created 2 years ago
9,998 stars

Top 5.1% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides a curated collection of practical resources for Large Language Models (LLMs), aimed at practitioners and researchers. It offers a structured overview of LLM development, applications, data, and ethical considerations, drawing from a comprehensive survey paper.

How It Works

The project organizes information into practical guides covering LLM architectures (BERT-style, GPT-style), data (pretraining, finetuning, test), NLP tasks, scaling abilities, efficiency, trustworthiness, and alignment efforts. It includes an evolutionary tree of modern LLMs and a detailed table of usage and licensing restrictions for various models and their datasets.

Quick Start & Requirements

This is a curated list of resources, not a runnable software project. No installation or execution commands are provided.

Highlighted Details

  • Comprehensive categorization of LLMs, including encoder-only, encoder-decoder, and decoder-only architectures.
  • Detailed breakdown of data sources, NLP tasks, and LLM abilities like emergent abilities and scaling laws.
  • Extensive coverage of LLM alignment, safety, truthfulness, and prompting techniques.
  • A thorough table detailing model and data licenses, commercial use permissions, and notable restrictions.

Maintenance & Community

The repository is actively updated, with recent additions including usage and restrictions sections, and new models like AlexaTM and UniLM. It cites a survey paper and welcomes pull requests for refinement.

Licensing & Compatibility

The content itself is not explicitly licensed, but the repository links to numerous papers and resources with varying licenses. The "Usage and Restrictions" table clearly outlines licenses (e.g., Apache 2.0, MIT, CC BY-SA 4.0, CC BY-NC 4.0, BigScience RAIL License, TII Falcon LLM License) and commercial use permissions for many LLMs and their datasets, with some models explicitly prohibiting commercial use or having specific restrictions.

Limitations & Caveats

This repository is a curated list of links and information, not a functional tool. Users must independently verify the licensing and usage terms for each model and dataset referenced.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
173 stars in the last 90 days

Explore Similar Projects

Starred by Dan Guido Dan Guido(Cofounder of Trail of Bits), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
6 more.

open-llms by eugeneyan

0.2%
12k
Curated list of commercially-usable open LLMs
created 2 years ago
updated 5 months ago
Feedback? Help us improve.