Retrieval-augmented language model research paper
Top 30.7% on sourcepulse
RAPTOR offers a novel retrieval-augmented generation (RAG) approach by building a recursive tree structure from documents, enabling more efficient and context-aware information retrieval. It is designed for researchers and developers working with large text corpora who need to improve the accuracy and relevance of language model responses.
How It Works
RAPTOR constructs a hierarchical tree of summaries from input documents. It recursively summarizes chunks of text, then summarizes those summaries, creating an abstractive tree. This structure allows for targeted retrieval of relevant information by traversing the tree, leading to more precise answers from language models. The framework is extensible, allowing users to integrate custom summarization, question-answering, and embedding models.
Quick Start & Requirements
pip install -r requirements.txt
OPENAI_API_KEY
).demo.ipynb
for examples with custom models.Highlighted Details
Maintenance & Community
The project is the official implementation of the RAPTOR paper, co-authored by Christopher D. Manning. Further examples and configuration guides are planned.
Licensing & Compatibility
Released under the MIT License, permitting commercial use and integration with closed-source applications.
Limitations & Caveats
The project is marked as "Work in Progress" (WIP) with forthcoming documentation and advanced features. Initial setup requires an OpenAI API key, and custom model integration details are still being developed.
11 months ago
1 day