Discover and explore top open-source AI tools and projects—updated daily.
RyanCodraiAccelerate vector search with data-oblivious quantization
New!
Top 95.6% on SourcePulse
Fast vector quantization in Rust with Python bindings, implementing Google Research's TurboQuant (ICLR 2026). It offers a data-oblivious approach to vector compression, eliminating the need for training codebooks or retraining when data changes, unlike methods such as FAISS PQ. This results in faster index creation, simplified infrastructure, and competitive recall for similarity search applications.
How It Works
TurboQuant compresses vectors by treating them as directions on a hypersphere. The process involves normalizing vectors to unit length, applying a random orthogonal rotation to ensure predictable coordinate distributions (Beta, converging to Gaussian), and then using Lloyd-Max scalar quantization to optimally bucket each coordinate based on this known distribution. Finally, quantized coordinates are bit-packed for significant compression (e.g., 16x for 2-bit). Search is accelerated by rotating the query into the same domain and scoring directly against precomputed codebook values using SIMD intrinsics. This data-oblivious method avoids costly training steps and allows dynamic addition of new vectors.
Quick Start & Requirements
pip install maturin
cd turbovec-python
RUSTFLAGS="-C target-cpu=native" maturin build --release
pip install target/wheels/*.whl
cargo build --release
maturin.python3 benchmarks/download_data.py all). Individual benchmark scripts are located in benchmarks/suite/.Highlighted Details
Maintenance & Community
No specific details regarding maintainers, community channels (e.g., Discord, Slack), or roadmap were found in the provided README.
Licensing & Compatibility
The README does not explicitly state the project's license. This absence is a critical factor for assessing commercial use or integration into closed-source projects.
Limitations & Caveats
Performance on x86 architectures is currently significantly slower than FAISS, though optimization is in progress. Observed recall differences compared to FAISS vary and warrant further investigation. The lack of explicit licensing information presents a potential adoption blocker.
1 week ago
Inactive
tensorchord
timescale
unum-cloud
pgvector