infinity  by infiniflow

AI-native database for LLM applications

Created 3 years ago
4,164 stars

Top 11.8% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Infinity is an AI-native database designed for LLM applications, offering hybrid search capabilities across dense vectors, sparse vectors, tensors, and full-text data. It targets developers building RAG applications, conversational AI, and recommendation systems, providing fast and flexible data retrieval.

How It Works

Infinity employs a hybrid search approach, combining multiple indexing and retrieval strategies. It supports dense vector search with low latency (0.1ms) and high throughput (15K+ QPS), alongside efficient full-text search (1ms, 12K+ QPS on 33M documents). The system handles rich data types and offers various reranking methods like RRF and weighted sum, enabling sophisticated query combinations.

Quick Start & Requirements

Highlighted Details

  • Achieves 0.1ms query latency and 15K+ QPS on million-scale vector datasets.
  • Supports hybrid search (dense, sparse, tensor, full-text) with filtering.
  • Offers a single-binary, dependency-free architecture for easy deployment.
  • Intuitive Python API for seamless integration into AI workflows.

Maintenance & Community

  • Active development with a public roadmap.
  • Community channels available via Discord and Twitter.

Licensing & Compatibility

  • License details are not explicitly stated in the provided README snippet.

Limitations & Caveats

  • The SDK version 0.6.0.dev3 indicates a pre-release or development version, potentially subject to instability or breaking changes.
  • Full licensing information is not immediately available in the README.
Health Check
Last Commit

11 hours ago

Responsiveness

1 day

Pull Requests (30d)
48
Issues (30d)
11
Star History
62 stars in the last 30 days

Explore Similar Projects

Starred by Chang She Chang She(Cofounder of LanceDB), Carol Willing Carol Willing(Core Contributor to CPython, Jupyter), and
11 more.

lancedb by lancedb

0.7%
8k
Embedded retrieval engine for multimodal AI
Created 2 years ago
Updated 4 days ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Anton Troynikov Anton Troynikov(Cofounder of Chroma), and
47 more.

llama_index by run-llama

0.3%
45k
Data framework for building LLM-powered agents
Created 3 years ago
Updated 21 hours ago
Feedback? Help us improve.