trieve  by devflowinc

All-in-one platform for search, recommendations, RAG, and analytics offered via API

Created 2 years ago
2,545 stars

Top 18.4% on SourcePulse

GitHubView on GitHub
Project Summary

Trieve is an all-in-one platform designed for developers to integrate search, recommendations, Retrieval-Augmented Generation (RAG), and analytics into their applications. It offers a comprehensive API and SDKs, with a focus on self-hosting and flexibility in model integration.

How It Works

Trieve employs a hybrid search approach, combining semantic vector search using OpenAI or Jina embeddings with Qdrant, and typo-tolerant neural sparse-vector search powered by naver/efficient-splade-VI-BT-large-query. It also supports cross-encoder re-ranking with models like BAAI/bge-reranker-large for enhanced relevance. Features like sub-sentence highlighting, recency biasing, and tunable merchandising are included to optimize search UX.

Quick Start & Requirements

  • Installation: Self-hosting guides are available for AWS, GCP, Kubernetes, and Docker Compose. Local development requires Rust, Node.js (via NVM), and Yarn.
  • Prerequisites: OpenAI API key (or other LLM provider via OpenRouter), curl, gcc, g++, make, pkg-config, python3, python3-pip, libpq-dev, libssl-dev, openssl, postgresql-libs.
  • Setup: Local development involves cloning the repo, installing dependencies, setting environment variables (including LLM API keys), building Rust components, and running Docker services.
  • Documentation: API Reference + Docs

Highlighted Details

  • Supports semantic, full-text, and hybrid search with cross-encoder re-ranking.
  • Offers RAG API routes with topic-based memory management or custom context selection.
  • Enables bringing your own embedding, SPLADE, cross-encoder, and LLM models.
  • Includes features like sub-sentence highlighting, grouping by file, and filtering.

Maintenance & Community

  • Active development with a small, hands-on team.
  • Community channels available via Discord and Matrix.
  • Professional services are offered.

Licensing & Compatibility

  • The project appears to be open-source, but a specific license is not explicitly stated in the README. Compatibility for commercial use or closed-source linking would require clarification on licensing terms.

Limitations & Caveats

  • The README does not explicitly state the open-source license, which is critical for commercial adoption. Local setup instructions are extensive and may require significant time and system configuration.
Health Check
Last Commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
41 stars in the last 30 days

Explore Similar Projects

Starred by Chang She Chang She(Cofounder of LanceDB), Carol Willing Carol Willing(Core Contributor to CPython, Jupyter), and
11 more.

lancedb by lancedb

0.7%
8k
Embedded retrieval engine for multimodal AI
Created 2 years ago
Updated 4 days ago
Feedback? Help us improve.