LLMCompiler by SqueezeAILab

LLM compiler for parallel function calling

Created 2 years ago

1,809 stars

Top 23.6% on SourcePulse

View on GitHub

7 Experts Love This Project

Jeff Hammerbacher

Cofounder of Cloudera

Rodrigo Nader

Cofounder of Langflow

Pawel Garbacki

Cofounder of Fireworks AI

Gabriel Almeida

Cofounder of Langflow

and 3 more!

Project Summary

LLMCompiler is a framework designed to optimize Large Language Model (LLM) interactions by enabling parallel function calling. It addresses the latency, cost, and accuracy issues of sequential function execution by automatically identifying and orchestrating tasks that can be performed concurrently, benefiting researchers and developers working with complex LLM-driven applications.

How It Works

LLMCompiler decomposes complex problems into a Directed Acyclic Graph (DAG) of tasks, allowing for parallel execution of LLM function calls. This approach leverages the LLM's reasoning capabilities to determine task dependencies and optimize the execution order, leading to significant speedups and cost reductions compared to traditional sequential methods.

Quick Start & Requirements

Installation: Clone the repository and install dependencies via pip install -r requirements.txt within a Python 3.10 conda environment.
Prerequisites: OpenAI API key (or Azure/Friendli credentials), Python 3.10. vLLM is supported for custom models.
Running Benchmarks: python run_llm_compiler.py --benchmark {benchmark-name} --store {store-path}.
Resources: Requires API access to LLMs. Detailed setup for vLLM serving is available in vLLM documentation.
Links: Paper, vLLM, LangGraph, LlamaIndex

Highlighted Details

Achieves latency speedup, cost saving, and accuracy improvement across various benchmarks.
Supports both open-source (LLaMA via vLLM) and closed-source (OpenAI, Azure) LLM models.
Integrates with LangChain's LangGraph and LlamaIndex frameworks.
Offers streaming capabilities for improved responsiveness.

Maintenance & Community

The project is associated with SqueezeAILab and has been integrated into popular LLM orchestration frameworks like LangChain and LlamaIndex. Updates include support for Friendli endpoints and vLLM.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Users should verify licensing for commercial use or integration into closed-source projects.

Limitations & Caveats

Logging is not yet supported for vLLM. Default prompts are tailored for LLaMA-2 70B and may require adjustments for other models. The roadmap indicates planned Tree-of-Thoughts evaluation.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

10 stars in the last 30 days