Open-source course for building an AI assistant with LLMs, agents, and RAG
Top 29.5% on sourcepulse
This open-source course teaches how to build a production-ready "Second Brain" AI assistant using LLMs, agents, and Retrieval-Augmented Generation (RAG). It's designed for ML/AI engineers, data engineers, and data scientists who learn by building, offering practical skills beyond typical notebook tutorials. The course provides a template for creating personalized GenAI applications.
How It Works
The course guides users through building an end-to-end agentic RAG system. It covers data ingestion from sources like Notion or web crawls, data normalization, quality scoring using LLMs, dataset generation via distillation, fine-tuning open-source LLMs (e.g., Llama 3.1 8B) with tools like Unsloth, and deploying them. Advanced RAG techniques like contextual or parent retrieval are implemented, alongside agent building with smolagents and LLMOps for monitoring and evaluation using ZenML and Opik.
Quick Start & Requirements
apps/second-brain-offline
and apps/second-brain-online
documentation.Highlighted Details
Maintenance & Community
The course is a collaboration between Decoding ML, MongoDB, Comet, Opik, Unsloth, and ZenML. Core contributors are listed. Users can get help via GitHub Issues.
Licensing & Compatibility
Licensed under the MIT License, permitting commercial use and integration with closed-source projects.
Limitations & Caveats
While cloud alternatives are provided, optimal performance may benefit from a GPU. The course focuses on specific tools and techniques, and adapting to significantly different workflows might require substantial modification.
2 months ago
1 week