blast by stanford-mast

High-performance serving engine for web browsing AI

Created 9 months ago

768 stars

Top 45.5% on SourcePulse

View on GitHub

1 Expert Loves This Project

Simon Willison

Coauthor of Django

Project Summary

BLAST is a high-performance serving engine designed for web browsing AI applications, targeting developers who need to integrate AI-powered web interaction into their products. It offers an OpenAI-compatible API, automatic caching, parallelism, and streaming capabilities to reduce costs and improve latency for automated workflows and local usage.

How It Works

BLAST functions as a serving engine that handles AI-driven web browsing tasks. It employs automatic parallelism and prefix caching to optimize performance and reduce operational costs. The system supports streaming of LLM output, enabling real-time user experiences, and is built for efficient concurrency to manage multiple users without excessive resource consumption.

Quick Start & Requirements

Primary install / run command: pip install blastai && blastai serve
Prerequisites: Python.
Documentation: https://docs.blastproject.org
Website: https://blastproject.org

Highlighted Details

OpenAI-Compatible API for easy integration.
High performance through automatic parallelism and prefix caching.
Real-time streaming of LLM output.
Built-in support for concurrent users with efficient resource management.

Maintenance & Community

Community support available via Discord: https://discord.gg/AUMAYTAS
Project lead: Caleb Win (Twitter: https://x.com/realcalebwin)

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive MIT license allows for commercial use and integration into closed-source applications.

Limitations & Caveats

The project is presented as a serving engine for web browsing AI, implying it requires integration with existing LLM models and web browsing capabilities rather than providing them intrinsically. Specific performance benchmarks or detailed resource requirements are not detailed in the README.

blast by stanford-mast

Explore Similar Projects

dash-infer by modelscope

tokasaurus by ScalingIntelligence

genai-bench by sgl-project

candle-vllm by EricLBuehler

qwen-vllm by owenliang

kubeai by kubeai-project

openai-forward by KenyonY

mistral.rs by EricLBuehler

datasets by huggingface

sglang by sgl-project

litellm by BerriAI

vllm by vllm-project