dsRAG by D-Star-AI

RAG engine for unstructured data, excelling on dense text QA

Created 1 year ago

1,549 stars

Top 26.6% on SourcePulse

1 Expert Loves This Project

andreasjansson

Andreas Jansson

Cofounder of Replicate

Project Summary

dsRAG is a high-performance retrieval engine designed for complex question-answering over unstructured text, targeting users who need superior accuracy on challenging datasets like financial reports and legal documents. It significantly outperforms vanilla RAG baselines by employing advanced techniques to enhance context and relevance.

How It Works

dsRAG improves retrieval accuracy through three core methods: Semantic Sectioning, which uses an LLM to break documents into semantically cohesive sections with descriptive titles; AutoContext, which prepends these section titles to text chunks to provide richer context to embedding and reranking models; and Relevant Segment Extraction (RSE), a query-time process that intelligently combines relevant chunks into longer segments for improved LLM comprehension. This layered approach aims to reduce irrelevant results and increase the precision of retrieved information.

Quick Start & Requirements

Install via pip: pip install dsRAG
Install with vector database support: pip install dsRAG[faiss], pip install dsRAG[chroma], etc., or pip install dsRAG[all-vector-dbs] for all.
Requires API keys for default providers (OpenAI, Cohere) set as environment variables (OPENAI_API_KEY, CO_API_KEY).
Official quickstart and documentation available.

Highlighted Details

Achieves 96.6% accuracy on FinanceBench, compared to 32% for vanilla RAG.
Evaluated on custom KITE benchmark across AI Papers, financial reports, company handbooks, and legal documents, showing significant gains with CCH+RSE.
Highly customizable architecture with interchangeable components for VectorDB, ChunkDB, Embedding, Reranker, LLM, and FileSystem.
Supports VLM for PDF parsing and metadata filtering for targeted queries.

Maintenance & Community

Developed by a two-person applied AI consulting firm.
Community support via Discord.
Users in production are encouraged to fill out a form for feature prioritization and potential priority email support.

Licensing & Compatibility

No explicit license mentioned in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Semantic sectioning and VLM features are noted as still undergoing improvements.
Default configuration relies on proprietary LLM and embedding providers, requiring API keys and potentially incurring costs.
The absence of a specified license raises concerns about usage rights and commercial compatibility.

Health Check

Last Commit

2 months ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

10 stars in the last 30 days

Explore Similar Projects

Meta-Chunking by IAAR-Shanghai

LLM-powered text chunking for logical document segmentation

Created 1 year ago

Updated 3 months ago

advanced-chunker by rango-ramesh

Semantic chunker for retrieval-augmented generation (RAG) pipelines

Created 9 months ago

Updated 9 months ago

tiny-rag by wdndev

Tiny RAG system for retrieval-augmented LLM

Created 1 year ago

Updated 8 months ago

vectordb by kagisearch

Python package for local, embeddings-based text retrieval

Created 2 years ago

Updated 1 year ago

spacy-layout by explosion

spaCy plugin for structured PDF/document processing

Created 1 year ago

Updated 10 months ago

Starred by

Simon Willison

Simon Willison(Coauthor of Django) and

Anton Troynikov

Anton Troynikov(Cofounder of Chroma).

chunking_evaluation by brandonstarxel

SDK for text chunking and evaluation research

Created 1 year ago

Updated 4 weeks ago

Starred by

Xiaofan Luan

Xiaofan Luan(VP Engineering at Zilliz) and

Bryan Helmig

Bryan Helmig(Cofounder of Zapier).

open-parse by Filimoa

File parser for improved LLM document chunking

Created 1 year ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Rodrigo Nader

Rodrigo Nader(Cofounder of Langflow), and

2 more.

llmsherpa by nlmatics

Developer APIs for LLM project acceleration

Created 2 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

WeKnora by Tencent

LLM framework for deep document understanding and RAG

Created 5 months ago

Updated 2 days ago

Starred by

Jeffrey Morgan

Jeffrey Morgan(Cofounder of Ollama),

Dan Guido

Dan Guido(Cofounder of Trail of Bits), and

2 more.

langextract by google

Extract structured data from text with LLMs

Created 6 months ago

Updated 1 week ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind),

Yiran Wu

Yiran Wu(Coauthor of AutoGen), and

2 more.

RAG_Techniques by NirDiamant

RAG techniques showcase for enhanced generation systems

Created 1 year ago

Updated 1 month ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Rodrigo Nader

Rodrigo Nader(Cofounder of Langflow), and

9 more.

ragflow by infiniflow

Open-source RAG engine for deep document understanding

Created 2 years ago

Updated 1 day ago

Feedback? Help us improve.