Large-Language-Model-Notebooks-Course  by peremartra

LLM course with hands-on projects using OpenAI, Hugging Face

Created 2 years ago
1,675 stars

Top 25.3% on SourcePulse

GitHubView on GitHub
Project Summary

This repository offers a practical, hands-on course for learning about Large Language Models (LLMs), targeting engineers and researchers. It provides a structured curriculum covering LLM techniques, libraries, project implementation, and enterprise solutions, enabling users to build and deploy LLM-powered applications.

How It Works

The course is structured into three main sections: Techniques and Libraries, Projects, and Enterprise Solutions. It emphasizes a learn-by-doing approach, with each lesson featuring accompanying notebooks (often hosted on Google Colab or Kaggle) and detailed articles. The content is grounded in published research papers and covers a wide range of LLM tools and methodologies, including OpenAI API, Hugging Face, LangChain, vector databases, and various fine-tuning techniques like LoRA and QLoRA.

Quick Start & Requirements

  • Notebooks are primarily designed to run in Google Colab or Kaggle, with some requiring Colab Pro for higher memory needs.
  • Users can also run notebooks in their own environments.
  • Key dependencies include Python, OpenAI API access, Hugging Face libraries, and potentially specific databases like ChromaDB.
  • Links to specific lessons, notebooks, and articles are provided within the README.

Highlighted Details

  • Covers advanced fine-tuning techniques such as PEFT, LoRA, and QLoRA.
  • Explores Retrieval Augmented Generation (RAG) with vector databases (ChromaDB, Faiss) and semantic caching.
  • Includes practical projects like building chatbots, NL2SQL translators, and data analyst assistants using LangChain agents.
  • Details LLM evaluation metrics (BLEU, ROUGE) and monitoring tools (LangSmith, Giskard).

Maintenance & Community

The repository is under permanent development, with new lessons and examples being added. Users are encouraged to contribute by opening issues for problems or starting discussions for suggestions. The author is active on LinkedIn and Twitter.

Licensing & Compatibility

The repository content is provided for educational purposes. Specific library licenses apply to the code within the notebooks.

Limitations & Caveats

Some notebooks require more memory than the free tier of Google Colab offers. The course is an unofficial repository for a book, and does not contain all information from the published book. Some sections are marked as "Work in Progress" (WIP).

Health Check
Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
16 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Elvis Saravia Elvis Saravia(Founder of DAIR.AI), and
2 more.

awesome-llm-apps by Shubhamsaboo

2.6%
69k
LLM app collection with AI agents and RAG examples
Created 1 year ago
Updated 3 days ago
Feedback? Help us improve.