llm-app by pathwaycom

LLM app templates for RAG, AI pipelines, and enterprise search

Created 2 years ago

56,296 stars

Top 0.4% on SourcePulse

3 Experts Love This Project

chiphuyen

Author of "AI Engineering", "Designing Machine Learning Systems"

pgarbacki

Cofounder of Fireworks AI

ishaan-jaff

Cofounder of LiteLLM

Project Summary

This repository provides ready-to-run, Docker-friendly application templates for building RAG (Retrieval-Augmented Generation) and AI enterprise search solutions. It targets developers and researchers needing to quickly deploy AI applications that stay synchronized with live data sources like Google Drive, SharePoint, S3, and Kafka, offering significant advantages in data freshness and simplified infrastructure.

How It Works

The core of these applications is the Pathway Live Data framework, a Python library with an embedded Rust engine. This framework handles data synchronization, indexing (vector, hybrid, and full-text search), and API serving in a unified manner. It eliminates the need for separate vector databases, caches, and API frameworks, leveraging in-memory indexing with usearch for vector search and Tantivy for hybrid search. This integrated approach aims for high accuracy and scalability, with templates optimized for simplicity or performance.

Quick Start & Requirements

Each application template includes a README.md with specific instructions.
Templates are runnable as Docker containers, exposing an HTTP API.
Optional Streamlit UIs are available for some templates.
Supports Linux and macOS.

Highlighted Details

Live synchronization with various data sources (files, Google Drive, SharePoint, S3, Kafka, PostgreSQL, real-time APIs).
Built-in in-memory indexing for vector, hybrid, and full-text search.
Templates include RAG, multimodal RAG (GPT-4o), Unstructured-to-SQL, and private RAG with Ollama.
Adaptive RAG technique claims up to 4x token cost reduction while maintaining accuracy.

Maintenance & Community

Active community support via Discord.
Encourages contributions for documentation, features, and bug fixes.
Follow on X: @pathway_com

Licensing & Compatibility

The README does not explicitly state the license.

Limitations & Caveats

The specific license is not clearly stated in the README, which may impact commercial use or closed-source integration.

Health Check

Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

1,691 stars in the last 30 days

Explore Similar Projects

Starred by

David Cournapeau

David Cournapeau(Author of scikit-learn),

Abhishek Thakur

Abhishek Thakur(World's First 4x Kaggle GrandMaster), and

1 more.

NyRAG by vespaai-playground

No-code RAG framework for scalable knowledge retrieval

Created 2 months ago

Updated 3 weeks ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind) and

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

sycamore by aryn-ai

LLM-powered platform for unstructured data search and analytics

Created 2 years ago

Updated 1 day ago

Awesome-RAG by Danielskry

Awesome list of RAG resources

Created 1 year ago

Updated 3 weeks ago

RAGMeUp by SensAI-PT

RAG framework for applying LLMs to custom datasets

Created 1 year ago

Updated 1 day ago

FlashRAG by RUC-NLPIR

Python toolkit for efficient RAG research

Created 1 year ago

Updated 3 months ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind),

Travis Fischer

Travis Fischer(Founder of Agentic), and

1 more.

cognita by truefoundry

RAG framework for production RAG apps

Created 2 years ago

Updated 2 days ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind),

Luca Soldaini

Luca Soldaini(Research Scientist at Ai2), and

5 more.

orama by oramasearch

Browser-based search engine and RAG pipeline

Created 3 years ago

Updated 1 week ago

Starred by

Simon Willison

Simon Willison(Coauthor of Django) and

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind).

nano-graphrag by gusye1234

GraphRAG implementation for simpler, faster knowledge graphs

Created 1 year ago

Updated 4 weeks ago

Starred by

Alex Cheema

Alex Cheema(Cofounder of EXO Labs).

infinity by infiniflow

AI-native database for LLM applications

Created 3 years ago

Updated 1 day ago

Starred by

Chang She

Chang She(Cofounder of LanceDB),

Carol Willing

Carol Willing(Core Contributor to CPython, Jupyter), and

11 more.

lancedb by lancedb

Embedded retrieval engine for multimodal AI

Created 3 years ago

Updated 18 hours ago

Starred by

Didier Lopes

Didier Lopes(Founder of OpenBB),

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow), and

3 more.

SurfSense by MODSetter

Open-source tool for personal knowledge base research

Created 1 year ago

Updated 17 hours ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI),

Xiaofan Luan

Xiaofan Luan(VP Engineering at Zilliz), and

16 more.

milvus by milvus-io

Cloud-native vector database for scalable ANN search

Created 6 years ago

Updated 17 hours ago

Feedback? Help us improve.