Discover and explore top open-source AI tools and projects—updated daily.
ray-projectKubernetes operator for Ray application deployment
Top 21.1% on SourcePulse
KubeRay is an open-source Kubernetes operator designed to simplify the deployment and management of Ray applications on Kubernetes. It targets ML engineers and data scientists who need to scale distributed AI workloads, offering robust lifecycle management, autoscaling, and fault tolerance for Ray clusters.
How It Works
KubeRay leverages Kubernetes Custom Resource Definitions (CRDs) including RayCluster, RayJob, and RayService. RayCluster manages the full lifecycle of Ray clusters, enabling autoscaling and fault tolerance. RayJob automates the creation of a Ray cluster and job submission, with optional cluster cleanup. RayService combines a RayCluster with a Ray Serve deployment graph, facilitating zero-downtime upgrades and high availability for model serving.
Quick Start & Requirements
kubectl apply -f <manifest-file>.Highlighted Details
kubectl ray plugin (Beta) for simplified workflows.Maintenance & Community
#kuberay-questions channel. Bi-weekly community meetings are held.Licensing & Compatibility
Limitations & Caveats
21 hours ago
1 week
vllm-project