ray-llm by ray-project

LLM deployment framework on Ray (now upstreamed to Ray)

Created 3 years ago

1,263 stars

Top 30.7% on SourcePulse

View on GitHub

7 Experts Love This Project

Cofounder of Langfuse

Travis Fischer

Founder of Agentic

and 3 more!

Project Summary

This repository, ray-project/ray-llm, is an archived project that previously provided APIs for deploying Large Language Models (LLMs) on the Ray distributed computing framework. The functionality has been migrated to the main Ray repository, specifically within ray.serve.llm and ray.data.llm, offering a more integrated and maintained solution for LLM deployment.

How It Works

The project leveraged the Ray ecosystem to facilitate distributed LLM inference and data processing. By integrating LLM deployment into Ray's core libraries, it aimed to simplify the complexities of scaling LLMs across multiple nodes and GPUs, enabling efficient serving and data handling for LLM-related workloads.

Highlighted Details

APIs for LLM deployment and data handling are now integrated into the main Ray repository.
Functionality is available via ray.serve.llm and ray.data.llm.

Maintenance & Community

This repository is archived and no longer maintained. The Ray team now manages the LLM-related APIs within the main Ray project.

Licensing & Compatibility

The original licensing information is not directly available as the repository is archived. However, the main Ray project is typically licensed under Apache 2.0, which is permissive for commercial use.

Limitations & Caveats

This repository is archived and will not receive further updates or maintenance. Users should migrate to the integrated APIs within the main Ray project for current LLM deployment needs.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days