Deep search alternative for private data, using LLMs and vector DBs
Top 7.7% on sourcepulse
DeepSearcher is an open-source Python framework for building private data search and reasoning systems. It integrates Large Language Models (LLMs) with vector databases to provide accurate answers and comprehensive reports from enterprise knowledge bases, targeting enterprise knowledge management and intelligent Q&A.
How It Works
DeepSearcher orchestrates interactions between various LLMs and embedding models, leveraging vector databases like Milvus for efficient data retrieval. Users can load local files or crawl websites, embed the content, store it in a vector database, and then query it using LLMs. This modular approach allows flexibility in choosing components for optimal performance and cost.
Quick Start & Requirements
pip install deepsearcher
or pip install "deepsearcher[ollama]"
for optional dependencies. Development installation via uv sync
is also supported.FIRECRAWL_API_KEY
.OPENAI_API_KEY
for basic functionality.Highlighted Details
Maintenance & Community
The project is maintained by Zilliz. Community engagement is encouraged via GitHub stars and forks.
Licensing & Compatibility
Limitations & Caveats
Some features like web crawling and certain document loaders are noted as "under development." Offline mode for Hugging Face model downloads may require network proxy or token configuration. Jupyter notebook usage may require nest_asyncio
.
3 weeks ago
1 day