deep-searcher  by zilliztech

Deep search alternative for private data, using LLMs and vector DBs

created 5 months ago
6,712 stars

Top 7.7% on sourcepulse

GitHubView on GitHub
Project Summary

DeepSearcher is an open-source Python framework for building private data search and reasoning systems. It integrates Large Language Models (LLMs) with vector databases to provide accurate answers and comprehensive reports from enterprise knowledge bases, targeting enterprise knowledge management and intelligent Q&A.

How It Works

DeepSearcher orchestrates interactions between various LLMs and embedding models, leveraging vector databases like Milvus for efficient data retrieval. Users can load local files or crawl websites, embed the content, store it in a vector database, and then query it using LLMs. This modular approach allows flexibility in choosing components for optimal performance and cost.

Quick Start & Requirements

  • Installation: pip install deepsearcher or pip install "deepsearcher[ollama]" for optional dependencies. Development installation via uv sync is also supported.
  • Prerequisites: Python 3.10+ recommended. API keys for chosen LLMs and embedding models (e.g., OpenAI, DeepSeek, Anthropic, Google Gemini) are required. For local vector storage, Milvus Lite is used; for larger deployments, a Milvus server or Zilliz Cloud is recommended. Web crawling requires a FIRECRAWL_API_KEY.
  • Demo: A Python quick-start example is provided, requiring OPENAI_API_KEY for basic functionality.
  • Docs: Configuration Details, Quick Start Demo

Highlighted Details

  • Supports a wide array of LLMs including OpenAI, Qwen, DeepSeek, Grok, Claude, and Llama.
  • Integrates with multiple embedding models and vector databases (Milvus, Zilliz Cloud, Qdrant).
  • Offers flexible data loading from local files and web crawling capabilities.
  • Provides a Python CLI for loading and querying, and a FastAPI service for API access.

Maintenance & Community

The project is maintained by Zilliz. Community engagement is encouraged via GitHub stars and forks.

Licensing & Compatibility

  • License: Apache 2.0.
  • Compatibility: Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

Some features like web crawling and certain document loaders are noted as "under development." Offline mode for Hugging Face model downloads may require network proxy or token configuration. Jupyter notebook usage may require nest_asyncio.

Health Check
Last commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
0
Star History
972 stars in the last 90 days

Explore Similar Projects

Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Alex Cheema Alex Cheema(Cofounder of EXO Labs), and
3 more.

Perplexica by ItzCrazyKns

0.3%
23k
AI-powered search engine alternative
created 1 year ago
updated 1 day ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind).

LightRAG by HKUDS

1.0%
19k
RAG framework for fast, simple retrieval-augmented generation
created 10 months ago
updated 1 day ago
Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Zhiqiang Xie Zhiqiang Xie(Author of SGLang), and
7 more.

milvus by milvus-io

0.4%
36k
Cloud-native vector database for scalable ANN search
created 5 years ago
updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Anton Troynikov Anton Troynikov(Cofounder of Chroma), and
20 more.

llama_index by run-llama

0.3%
43k
Data framework for building LLM-powered agents
created 2 years ago
updated 1 day ago
Feedback? Help us improve.