milvus  by milvus-io

Cloud-native vector database for scalable ANN search

created 5 years ago
36,262 stars

Top 0.9% on sourcepulse

GitHubView on GitHub
Project Summary

Milvus is a high-performance, cloud-native vector database designed for AI applications that need to efficiently organize and search vast amounts of unstructured data like text and images. It targets developers building applications such as RAG, recommendation systems, and semantic search, offering scalable vector search with metadata filtering and hybrid search capabilities.

How It Works

Milvus employs a distributed architecture separating compute and storage, allowing independent scaling of query and data nodes for optimal performance under varying workloads. It supports numerous vector index types (HNSW, IVF, SCANN, DiskANN) and hardware acceleration (GPU indexing via CAGRA), enabling efficient vector search with advanced features like metadata filtering and range search. The system also natively supports sparse vectors for full-text search (BM25, SPLADE) and allows combining dense and sparse vector searches for hybrid retrieval.

Quick Start & Requirements

  • Install Python SDK: pip install -U pymilvus
  • For local development, Milvus Lite is available via client = MilvusClient("milvus_demo.db").
  • Connect to deployed instances using uri and token.
  • See Milvus Docs for installation, usage, and deployment.

Highlighted Details

  • Supports multiple vector index types (HNSW, IVF, SCANN, DiskANN) and hardware acceleration (GPU).
  • Natively supports sparse vectors for full-text search and hybrid search (dense + sparse).
  • Offers multi-tenancy, hot/cold storage, and fine-grained access control (RBAC).
  • Integrates with AI tools like LangChain, LlamaIndex, OpenAI, and HuggingFace.

Maintenance & Community

Milvus is an LF AI & Data Foundation project with Zilliz as a major contributor. Community support is available on Discord. Follow on X, LinkedIn, Youtube, Medium.

Licensing & Compatibility

Licensed under the Apache 2.0 License. Compatible with commercial use and closed-source linking.

Limitations & Caveats

Building from source requires specific Go, CMake, GCC/LLVM, and Python versions depending on the OS. The README mentions support for Python versions up to 3.11 for building from source, while the quickstart examples use pymilvus which may support newer Python versions.

Health Check
Last commit

18 hours ago

Responsiveness

1 day

Pull Requests (30d)
384
Issues (30d)
333
Star History
1,999 stars in the last 90 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Mike McNeil Mike McNeil(Author of Sails.js; Cofounder of Fleet), and
10 more.

meilisearch by meilisearch

0.2%
53k
Search engine API for integrating AI-powered hybrid search
created 7 years ago
updated 1 day ago
Feedback? Help us improve.