infinity  by infiniflow

AI-native database for LLM applications

Created 3 years ago
4,398 stars

Top 11.0% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Infinity is an AI-native database designed for LLM applications, offering hybrid search capabilities across dense vectors, sparse vectors, tensors, and full-text data. It targets developers building RAG applications, conversational AI, and recommendation systems, providing fast and flexible data retrieval.

How It Works

Infinity employs a hybrid search approach, combining multiple indexing and retrieval strategies. It supports dense vector search with low latency (0.1ms) and high throughput (15K+ QPS), alongside efficient full-text search (1ms, 12K+ QPS on 33M documents). The system handles rich data types and offers various reranking methods like RRF and weighted sum, enabling sophisticated query combinations.

Quick Start & Requirements

Highlighted Details

  • Achieves 0.1ms query latency and 15K+ QPS on million-scale vector datasets.
  • Supports hybrid search (dense, sparse, tensor, full-text) with filtering.
  • Offers a single-binary, dependency-free architecture for easy deployment.
  • Intuitive Python API for seamless integration into AI workflows.

Maintenance & Community

  • Active development with a public roadmap.
  • Community channels available via Discord and Twitter.

Licensing & Compatibility

  • License details are not explicitly stated in the provided README snippet.

Limitations & Caveats

  • The SDK version 0.6.0.dev3 indicates a pre-release or development version, potentially subject to instability or breaking changes.
  • Full licensing information is not immediately available in the README.
Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
39
Issues (30d)
26
Star History
52 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Pawel Garbacki Pawel Garbacki(Cofounder of Fireworks AI), and
1 more.

llm-app by pathwaycom

0.2%
56k
LLM app templates for RAG, AI pipelines, and enterprise search
Created 2 years ago
Updated 1 month ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Anton Troynikov Anton Troynikov(Cofounder of Chroma), and
47 more.

llama_index by run-llama

0.3%
47k
Data framework for building LLM-powered agents
Created 3 years ago
Updated 1 day ago
Feedback? Help us improve.