Kubernetes operator for Ray application deployment
Top 23.1% on sourcepulse
KubeRay is an open-source Kubernetes operator designed to simplify the deployment and management of Ray applications on Kubernetes. It targets ML engineers and data scientists who need to scale distributed AI workloads, offering robust lifecycle management, autoscaling, and fault tolerance for Ray clusters.
How It Works
KubeRay leverages Kubernetes Custom Resource Definitions (CRDs) including RayCluster
, RayJob
, and RayService
. RayCluster
manages the full lifecycle of Ray clusters, enabling autoscaling and fault tolerance. RayJob
automates the creation of a Ray cluster and job submission, with optional cluster cleanup. RayService
combines a RayCluster
with a Ray Serve deployment graph, facilitating zero-downtime upgrades and high availability for model serving.
Quick Start & Requirements
kubectl apply -f <manifest-file>
.Highlighted Details
kubectl ray
plugin (Beta) for simplified workflows.Maintenance & Community
#kuberay-questions
channel. Bi-weekly community meetings are held.Licensing & Compatibility
Limitations & Caveats
1 day ago
Inactive