Discover and explore top open-source AI tools and projects—updated daily.
shinprLocal RAG for private code and document search
Top 98.3% on SourcePulse
Summary
This project offers a local-first Retrieval Augmented Generation (RAG) server designed for developers. It enables semantic and keyword search across local code and technical documentation, providing a fully private, zero-setup solution that operates offline after an initial model download. The primary benefit is empowering AI assistants with accurate, context-aware access to sensitive local data without external dependencies or costs.
How It Works
The system chunks documents based on semantic similarity rather than fixed character counts, preserving the integrity of code blocks and natural topic boundaries. Text is converted into vector embeddings locally using Transformers.js. Search queries leverage a hybrid approach, combining semantic vector search with a keyword boost that prioritizes exact technical terms like function names or error codes. Results are then filtered by relevance gaps, yielding fewer but more reliable information chunks.
Quick Start & Requirements
npx -y mcp-local-rag to start the server or use CLI commands.npx). The initial embedding model download (~90MB) requires an internet connection and takes 1-2 minutes; subsequent operation is fully offline.Highlighted Details
npx, eliminating complex dependencies.useEffect, class names).Maintenance & Community
No specific details on maintainers, community channels (e.g., Discord, Slack), or active sponsorships were found in the provided README. Development is facilitated via the linked GitHub repository: https://github.com/shinpr/mcp-local-rag.git.
Licensing & Compatibility
Limitations & Caveats
Embedding generation currently relies on CPU via Transformers.js; GPU acceleration is experimental. The system is designed for single-user, local access and lacks multi-user support or built-in authentication. Supported document formats include PDF, DOCX, TXT, Markdown, and HTML; Excel, PowerPoint, and image formats are not yet supported. Switching embedding models necessitates deleting and re-ingesting the entire vector database due to incompatible vector dimensions.
9 hours ago
Inactive
ZachNagengast
freedmand
oramasearch
tobi