dive-into-llms  by Lordog

LLM tutorial for practical programming

Created 1 year ago
8,067 stars

Top 6.4% on SourcePulse

GitHubView on GitHub
Project Summary

《动手学大模型》系列教程为大模型初学者提供免费的编程实践指导,旨在帮助学生和研究人员快速入门大模型技术,支持课程设计和学术研究。该项目源自上海交通大学的《人工智能安全技术》课程,并已扩展至包含国产化大模型开发全流程的公益教程。

How It Works

该教程通过一系列编程实践,涵盖了预训练模型微调与部署、提示学习与思维链、知识编辑、模型水印、越狱攻击、多模态模型以及大模型智能体与安全等核心大模型技术。其方法论侧重于通过实际操作,让学习者能够动手实践,理解并掌握大模型在不同场景下的应用和安全挑战。

Quick Start & Requirements

  • Installation: No explicit installation instructions are provided, suggesting a focus on understanding concepts and potentially running code snippets within existing Python environments.
  • Prerequisites: Python environment, familiarity with machine learning concepts. Specific hardware requirements (e.g., GPU) are not detailed but are implied for practical model training and deployment.
  • Resources: Links to slides and tutorials are provided for each topic. A separate "大模型开发全流程" series, developed with Huawei Ascend, offers PPT, lab manuals, and videos, targeting different skill levels.
  • Links:

Highlighted Details

  • Comprehensive coverage of LLM lifecycle: from fine-tuning and deployment to advanced topics like jailbreaking and AI agents.
  • Includes a new series on the full LLM development process, co-developed with Huawei Ascend, offering a localized perspective.
  • Content is derived from university coursework, emphasizing academic rigor and practical application.
  • Focus on AI safety, including adversarial attacks and model alignment.

Maintenance & Community

The project is actively maintained and welcomes contributions via Issues and Pull Requests. Key contributors are listed from Shanghai Jiao Tong University and the National University of Singapore, with significant contributions from Huawei Ascend for the new series.

Licensing & Compatibility

The project is offered as a free, public good. Specific licensing details are not explicitly stated in the README, but the "公益性质、完全免费" (public welfare nature, completely free) suggests a permissive stance for educational and research use. Commercial use compatibility is not detailed.

Limitations & Caveats

The content is based on contributors' personal experience and internet data, with a disclaimer that accuracy is not guaranteed. The project is described as ongoing, implying potential for omissions or changes.

Health Check
Last Commit

2 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
1
Star History
647 stars in the last 30 days

Explore Similar Projects

Starred by Edward Sun Edward Sun(Research Scientist at Meta Superintelligence Lab), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
2 more.

ama_prompting by HazyResearch

0%
547
Language model prompting strategy research paper
Created 3 years ago
Updated 2 years ago
Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

PromptWizard by microsoft

0.4%
4k
Agent-driven framework for task-aware prompt optimization
Created 1 year ago
Updated 1 month ago
Starred by Rodrigo Nader Rodrigo Nader(Cofounder of Langflow), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
11 more.

Awesome-LLM by Hannibal046

0.3%
25k
Curated list of Large Language Model resources
Created 2 years ago
Updated 1 month ago
Feedback? Help us improve.