A-Guide-to-Retrieval-Augmented-LLM by Wang-Shuo

Intro to retrieval augmented LLMs

Created 2 years ago

304 stars

Top 88.2% on SourcePulse

Project Summary

This repository provides a comprehensive guide to Retrieval Augmented Large Language Models (LLM), explaining their importance in addressing LLM limitations like hallucinations and data freshness. It targets engineers and researchers seeking to understand and implement RAG systems, offering a structured overview of concepts, modules, and applications.

How It Works

Retrieval Augmented LLMs enhance LLM responses by first retrieving relevant information from an external knowledge base using Information Retrieval (IR) techniques. This retrieved context is then provided to the LLM, typically within its context window, to inform its generation process. This approach mitigates LLM weaknesses by grounding responses in factual, up-to-date, or private data, improving accuracy and explainability.

Quick Start & Requirements

Install: pip install llama-index (for a comprehensive framework) or pip install langchain (for a broader LLM application framework).
Prerequisites: Python 3.x, potentially specific LLM API keys (e.g., OpenAI), and a vector database (e.g., Chroma, Milvus, Pinecone).
Resources: Setup involves installing libraries and potentially configuring API access and a vector database.
Links:
- LlamaIndex: https://github.com/run-llama/llama_index
- LangChain: https://github.com/langchain-ai/langchain

Highlighted Details

Detailed breakdown of key modules: Data and Indexing, Query and Retrieval, and Response Generation.
Explores various indexing strategies: Chain, Tree, Keyword Table, and Vector Indexes.
Discusses text chunking strategies and their impact on retrieval effectiveness.
Analyzes real-world applications like ChatGPT Retrieval Plugin, GitHub Copilot, and knowledge base Q&A systems.

Maintenance & Community

The guide references popular frameworks like LlamaIndex and LangChain, which have active communities and ongoing development.
Mentions various open-source and commercial tools (e.g., Danswer, PandaGPT, Quivr, ChatFiles, Mendable) in the RAG space.

Licensing & Compatibility

The repository itself is a guide and does not appear to have a specific license.
The referenced frameworks (LlamaIndex, LangChain) are typically Apache 2.0 licensed, allowing for commercial use.

Limitations & Caveats

The guide acknowledges that it is a personal learning summary and may not be exhaustive or entirely professional. The effectiveness of RAG systems heavily depends on the quality of data indexing, retrieval algorithms, and prompt engineering, which require careful tuning for specific use cases.

A-Guide-to-Retrieval-Augmented-LLM by Wang-Shuo

Explore Similar Projects

ToolQA by night-chen

embedding_rerank_retrieval by percent4

LLM4IR-Survey by RUC-NLPIR

stark by snap-stanford

deep-seek by dzhng

raglite by superlinear-ai

elasticsearch-labs by elastic

kernel-memory by microsoft

langchain-experiments by daveebbelaar

local-deep-researcher by langchain-ai

rag-from-scratch by langchain-ai

llama_index by run-llama