ruvector  by ruvnet

AI vector database that learns and scales

Created 3 months ago
493 stars

Top 62.7% on SourcePulse

GitHubView on GitHub
Project Summary

RuVector is a distributed vector database designed to be "smarter" over time, addressing limitations of static vector stores. It targets developers and researchers needing advanced capabilities like self-learning indices, graph querying, and local LLM integration. The primary benefit is a continuously improving, highly scalable, and versatile data management system that can operate offline and integrate seamlessly across various platforms.

How It Works

RuVector employs a multi-faceted approach, integrating a core vector database engine (HNSW indexing, SIMD acceleration) with graph database capabilities (Cypher query language) and advanced AI features. Its key differentiator is the use of Graph Neural Networks (GNNs) to learn from query patterns, enhancing search result relevance over time. Distributed operation is managed via Raft consensus, ensuring fault tolerance and horizontal scalability. It also includes ruvllm, a runtime for executing Large Language Models locally with hardware acceleration, and supports WASM for browser/edge deployment.

Quick Start & Requirements

Installation is straightforward via npm (npm install ruvector) or its CLI (npx ruvector install). While Rust is the core language, typical usage requires Node.js. Hardware acceleration for ruvllm can leverage Metal, CUDA, or ANE, but is not strictly required for basic operation. Extensive documentation is available via links to specific crates and guides within the repository.

Highlighted Details

  • Self-Learning Index: GNN layers adapt search results based on usage patterns.
  • Hybrid Search: Combines semantic (vector) search with keyword (BM25/TF-IDF) and graph queries (Cypher).
  • Distributed Systems: Raft consensus, multi-master replication, and auto-sharding for horizontal scaling.
  • Local LLM Runtime: ruvllm supports GGUF models with hardware acceleration.
  • Platform Agnostic: Runs in Node.js, browsers (WASM), edge devices, and as a PostgreSQL extension.
  • Adaptive Compression: Reduces memory footprint by 2-32x automatically.
  • Extensive AI Features: Includes 39 attention mechanisms, SONA learning, and a Coherence Gate for AI safety.

Maintenance & Community

The project is structured across numerous Rust crates and npm packages, indicating a significant development effort. While specific community links like Discord/Slack are not immediately apparent, the project is hosted on GitHub (ruvnet/ruvector) and features models on Hugging Face (ruv/ruvltra), suggesting an active development base.

Licensing & Compatibility

RuVector is released under the MIT license, which is highly permissive and allows for commercial use, modification, and distribution, including integration into closed-source applications.

Limitations & Caveats

The sheer breadth of features and components, spanning vector databases, graph databases, distributed systems, multiple LLM runtimes, and numerous specialized AI algorithms, implies a potentially steep learning curve. While the MIT license is permissive, the complexity of integrating and managing such a feature-rich system could be a significant adoption hurdle for simpler use cases. Some advanced features, like FPGA acceleration or quantum coherence, may represent bleeding-edge research rather than production-ready components.

Health Check
Last Commit

18 hours ago

Responsiveness

Inactive

Pull Requests (30d)
47
Issues (30d)
23
Star History
473 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.