Courseware for LLM application through production
Top 44.7% on sourcepulse
This repository provides notebooks and slides for the "Large Language Models: Application through Production" course, targeting individuals seeking to learn and apply LLMs from development to deployment. It offers a structured curriculum for mastering LLM applications.
How It Works
The course material is delivered via Databricks notebooks, designed to be imported directly into a Databricks workspace. It leverages Databricks Runtime for Machine Learning, specifically version 13.3 LTS, to ensure compatibility with pre-installed ML packages. The notebooks cover a range of LLM topics, including fine-tuning, with specific cluster configurations recommended for different modules.
Quick Start & Requirements
https://github.com/databricks-academy/large-language-models.git
) or download .dbc
files from GitHub releases.g5.2xlarge
) are needed for fine-tuning notebooks (LLM 04a
, LLM04L
). CPU instances (i3.xlarge
, i3.2xlarge
) are sufficient for other notebooks.LLM 00a - Install Datasets
notebook first; installation can take up to 25 minutes.Highlighted Details
Maintenance & Community
This repository is part of the Databricks Academy curriculum. Further community or maintenance details are not specified in the README.
Licensing & Compatibility
The repository's license is not specified in the README. Compatibility is primarily with the Databricks platform and specific Databricks Runtime versions.
Limitations & Caveats
The courseware is explicitly tested on Databricks Runtime 13.3 LTS for Machine Learning; using other versions may require significant additional library installations and is not guaranteed to run. GPU instances are mandatory for specific fine-tuning notebooks.
1 year ago
1+ week