lennyhub-rag by traversaal-ai

Production RAG system for podcast knowledge exploration

Created 5 months ago

520 stars

Top 59.7% on SourcePulse

Project Summary

Summary

LennyHub RAG offers a production-ready Retrieval-Augmented Generation system built on 297 podcast transcripts featuring industry leaders. It provides structured access to expert insights on product management, growth, and leadership. The system features a user-friendly setup, an interactive web interface, and advanced knowledge graph-based retrieval, benefiting researchers and professionals seeking curated expert knowledge.

How It Works

This RAG system uses the RAG-Anything framework with LightRAG for entity and relationship extraction (GPT-4o-mini) and OpenAI's text-embedding-3-small for embeddings. Data is stored locally in Qdrant. Queries leverage a hybrid search strategy, combining local entity-focused, global relationship-focused, and pure vector similarity searches for comprehensive results, synthesized by GPT-4o-mini.

Quick Start & Requirements

Installation involves cloning the repository and installing Python dependencies (pip install -r requirements.txt). A crucial prerequisite is an OpenAI API key. The automated setup script, setup_rag.py, handles Qdrant installation and data indexing. A quick test with 10 transcripts takes approximately 5 minutes; processing 50 transcripts in parallel takes 6-8 minutes; and indexing all 297 transcripts in parallel requires 25-35 minutes. Recommended RAM is 4GB+ for full indexing.

Highlighted Details

One-Command Setup: setup_rag.py automates installation, Qdrant setup, and data indexing.
Visual Web Interface: A Streamlit application provides an interactive querying experience with status monitoring and transcript browsing.
Interactive Knowledge Graph: Visualizes connections between 544 individuals mentioned across transcripts, featuring a clickable network visualization.
Local Qdrant: Utilizes Qdrant for local, production-grade vector storage without Docker.
Advanced Retrieval: Employs LightRAG for entity and relationship extraction, enabling sophisticated RAG capabilities.
Parallel Processing: Offers 5-10x faster indexing compared to sequential methods.

Maintenance & Community

Contributions are welcomed, with a clear project structure and comprehensive documentation provided. Specific details on active maintainers, community channels (like Discord/Slack), or sponsorship are not detailed in the README.

Licensing & Compatibility

The project's license is specified in a separate LICENSE file. Commercial use compatibility is not explicitly stated but depends on the terms of OpenAI's API and Qdrant's licensing.

Limitations & Caveats

Operation is dependent on an active OpenAI API key, incurring per-query costs (though caching significantly reduces this). System performance and storage requirements scale with the number of transcripts processed, necessitating adequate RAM and disk space.

lennyhub-rag by traversaal-ai

Explore Similar Projects

dpr-scale by facebookresearch

rag by neuml

Easy-RAG by yuntianhe2014

lex-gpt by rlancemartin

EpsteinFiles-RAG by AnkitNayak-dev

ArXivChatGuru by redis-developer

bilibili-rag by via007

RAG-Interview-Questions-and-Answers-Hub by KalyanKS-NLP

nomic by nomic-ai

pdfGPT by bhaskatripathi

RAG-Anything by HKUDS

LightRAG by HKUDS