super-rag  by superagent-ai

RAG pipeline for AI apps

Created 1 year ago
384 stars

Top 74.4% on SourcePulse

GitHubView on GitHub
Project Summary

Super-Rag provides a high-performance, production-ready REST API for building Retrieval-Augmented Generation (RAG) pipelines. It targets AI application developers seeking a flexible and efficient solution for document summarization, retrieval, reranking, and computational question answering, offering a unified API for complex RAG workflows.

How It Works

Super-Rag employs a modular architecture, allowing customization of document ingestion, chunking, and encoding. It supports various document formats and integrates with multiple vector databases (e.g., Qdrant, Pinecone) and embedding models (e.g., Cohere, HuggingFace, OpenAI). A key feature is its built-in code interpreter powered by E2B.dev, enabling computational Q&A scenarios by executing code within sandboxed environments. Session management via unique IDs facilitates caching for improved performance.

Quick Start & Requirements

  • Installation: Clone the repository, set up a virtual environment, install dependencies with poetry install, rename .env.example to .env, and run the server with uvicorn main:app --reload.
  • Prerequisites: Python 3.x, Poetry, and potentially an API key for the E2B.dev code interpreter. Configuration of environment variables in .env is required.
  • Resources: Local setup involves standard Python dependencies. Cloud API usage is free within limits.
  • Links: GitHub Repository

Highlighted Details

  • Supports multiple document formats and vector databases.
  • Production-ready REST API powered by FastAPI.
  • Customizable document splitting and encoding with various providers.
  • Built-in code interpreter mode for computational Q&A via E2B.dev.
  • Session management for caching.

Maintenance & Community

The project is maintained by superagent-ai. Further community engagement details (Discord, Slack, roadmap) are not explicitly detailed in the README.

Licensing & Compatibility

The README does not specify a license. This requires clarification for commercial use or integration into closed-source projects.

Limitations & Caveats

The license is not specified, which is a significant caveat for adoption. Some planned features like Mistral and Anthropic encoder support, and Chroma vector database integration, are marked as "coming soon."

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Jared Palmer Jared Palmer(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
8 more.

llm-vscode by huggingface

0.1%
1k
VSCode extension for LLM-powered code development
Created 2 years ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Nir Gazit Nir Gazit(Cofounder of Traceloop), and
4 more.

llmware by llmware-ai

0.6%
14k
Framework for enterprise RAG pipelines using small, specialized models
Created 2 years ago
Updated 1 month ago
Feedback? Help us improve.