ray-llm  by ray-project

LLM deployment framework on Ray (now upstreamed to Ray)

created 2 years ago
1,260 stars

Top 32.0% on sourcepulse

GitHubView on GitHub
Project Summary

This repository, ray-project/ray-llm, is an archived project that previously provided APIs for deploying Large Language Models (LLMs) on the Ray distributed computing framework. The functionality has been migrated to the main Ray repository, specifically within ray.serve.llm and ray.data.llm, offering a more integrated and maintained solution for LLM deployment.

How It Works

The project leveraged the Ray ecosystem to facilitate distributed LLM inference and data processing. By integrating LLM deployment into Ray's core libraries, it aimed to simplify the complexities of scaling LLMs across multiple nodes and GPUs, enabling efficient serving and data handling for LLM-related workloads.

Highlighted Details

  • APIs for LLM deployment and data handling are now integrated into the main Ray repository.
  • Functionality is available via ray.serve.llm and ray.data.llm.

Maintenance & Community

This repository is archived and no longer maintained. The Ray team now manages the LLM-related APIs within the main Ray project.

Licensing & Compatibility

The original licensing information is not directly available as the repository is archived. However, the main Ray project is typically licensed under Apache 2.0, which is permissive for commercial use.

Limitations & Caveats

This repository is archived and will not receive further updates or maintenance. Users should migrate to the integrated APIs within the main Ray project for current LLM deployment needs.

Health Check
Last commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jerry Liu Jerry Liu(Cofounder of LlamaIndex), and
4 more.

llama-hub by run-llama

0.0%
3k
Data loaders for LLMs (deprecated, now in LlamaIndex core)
created 2 years ago
updated 1 year ago
Feedback? Help us improve.