infinity  by infiniflow

AI-native database for LLM applications

created 3 years ago
3,943 stars

Top 12.6% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Infinity is an AI-native database designed for LLM applications, offering hybrid search capabilities across dense vectors, sparse vectors, tensors, and full-text data. It targets developers building RAG applications, conversational AI, and recommendation systems, providing fast and flexible data retrieval.

How It Works

Infinity employs a hybrid search approach, combining multiple indexing and retrieval strategies. It supports dense vector search with low latency (0.1ms) and high throughput (15K+ QPS), alongside efficient full-text search (1ms, 12K+ QPS on 33M documents). The system handles rich data types and offers various reranking methods like RRF and weighted sum, enabling sophisticated query combinations.

Quick Start & Requirements

Highlighted Details

  • Achieves 0.1ms query latency and 15K+ QPS on million-scale vector datasets.
  • Supports hybrid search (dense, sparse, tensor, full-text) with filtering.
  • Offers a single-binary, dependency-free architecture for easy deployment.
  • Intuitive Python API for seamless integration into AI workflows.

Maintenance & Community

  • Active development with a public roadmap.
  • Community channels available via Discord and Twitter.

Licensing & Compatibility

  • License details are not explicitly stated in the provided README snippet.

Limitations & Caveats

  • The SDK version 0.6.0.dev3 indicates a pre-release or development version, potentially subject to instability or breaking changes.
  • Full licensing information is not immediately available in the README.
Health Check
Last commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
102
Issues (30d)
16
Star History
410 stars in the last 90 days

Explore Similar Projects

Starred by Jason Liu Jason Liu(Author of Instructor) and Ross Taylor Ross Taylor(Cofounder of General Reasoning; Creator of Papers with Code).

Search-R1 by PeterGriffinJin

1.3%
3k
RL framework for training LLMs to use search engines
created 5 months ago
updated 3 weeks ago
Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Zhiqiang Xie Zhiqiang Xie(Author of SGLang), and
7 more.

milvus by milvus-io

0.4%
36k
Cloud-native vector database for scalable ANN search
created 5 years ago
updated 22 hours ago
Feedback? Help us improve.