orca  by santiagomed

Rust framework for LLM orchestration

created 1 year ago
281 stars

Top 93.7% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Orca is a Rust-based framework for orchestrating Large Language Models (LLMs), targeting developers building portable, efficient, and potentially edge-deployed LLM applications. It aims to simplify LLM pipeline creation and extend LLM capabilities through features like WebAssembly deployment and memory-safe distributed systems.

How It Works

Orca leverages Rust's performance and memory safety for building LLM applications. It supports prompt templating using a Handlebars-like syntax, enabling dynamic prompt generation. The framework facilitates loading various data sources, including HTML from URLs/files and PDFs, and integrates with vector stores like Qdrant. It currently supports OpenAI Chat and offers pipeline execution for sequential LLM calls.

Quick Start & Requirements

  • Install Rust: https://www.rust-lang.org/tools/install
  • Add to Cargo.toml: orca = { git = "https://github.com/scrippt-tech/orca", package = "orca-core" }
  • Local development requires cargo-make: cargo install cargo-make
  • Build: $ makers build
  • Test: $ makers test

Highlighted Details

  • Supports prompt templating with Handlebars-like syntax.
  • Can load documents from HTML (URLs/files) and PDF.
  • Integrates with Qdrant for vector store support.
  • Currently supports OpenAI Chat LLM.
  • Enables building simple and sequential LLM pipelines.

Maintenance & Community

The project is currently in development, with contributions welcomed via issues or pull requests. Feature additions are encouraged to be discussed via issues first.

Licensing & Compatibility

The repository does not explicitly state a license in the README.

Limitations & Caveats

The framework is in active development, meaning it may contain bugs and has limited functionality. Future directions are still being explored, and suggestions are welcome.

Health Check
Last commit

1 year ago

Responsiveness

1+ week

Pull Requests (30d)
1
Issues (30d)
0
Star History
4 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Tim J. Baek Tim J. Baek(Founder of Open WebUI), and
2 more.

llmware by llmware-ai

0.2%
14k
Framework for enterprise RAG pipelines using small, specialized models
created 1 year ago
updated 1 week ago
Feedback? Help us improve.