ray-educational-materials  by ray-project

Educational materials for scaling Python and ML workloads with Ray

created 2 years ago
422 stars

Top 70.8% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides hands-on educational materials for learning and applying the Ray distributed computing framework to scale Python and machine learning workloads. It targets developers and researchers looking to efficiently handle tasks like computer vision, NLP, and time-series forecasting on distributed systems.

How It Works

The materials are structured into modules covering core Ray concepts such as remote functions (tasks), remote objects, and stateful actors. It then progresses to practical applications like scaling batch inference and model training, with specific examples for computer vision and LLMs. The approach emphasizes practical implementation and understanding of Ray's distributed primitives for building scalable ML applications.

Quick Start & Requirements

  • Installation: Follow instructions within individual module notebooks.
  • Prerequisites: Python, Ray library. Specific modules may require additional ML libraries (e.g., Hugging Face Transformers, PyTorch).
  • Resources: Requires a local machine or cluster environment where Ray can be installed and executed.
  • Links: Ray Documentation, Official Ray Site

Highlighted Details

  • Comprehensive coverage from Ray Core fundamentals to advanced use cases.
  • Includes specific examples for scaling CV, NLP (LLMs), and time-series forecasting.
  • Modules on LLM fine-tuning, distributed hyperparameter tuning, and serving with Ray Serve.
  • Covers Ray observability features like the Ray State API and Dashboard.

Maintenance & Community

  • Developed by Anyscale Inc.
  • Active community engagement through Slack, discussion boards, and meetups.
  • Contributions welcomed via GitHub issues for feature requests and bug reports.
  • Links: Ray Community Slack, GitHub Issues

Licensing & Compatibility

  • License: Apache License 2.0.
  • Compatibility: Permissive license allows for commercial use and integration with closed-source applications.

Limitations & Caveats

The materials are focused on demonstrating Ray's capabilities and assume a foundational understanding of Python and machine learning concepts. Some advanced modules might require significant computational resources for practical execution.

Health Check
Last commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
32 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
5 more.

TensorRT-LLM by NVIDIA

0.6%
11k
LLM inference optimization SDK for NVIDIA GPUs
created 1 year ago
updated 18 hours ago
Starred by Hiroshi Shibata Hiroshi Shibata(Core Contributor to Ruby), Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), and
27 more.

ray by ray-project

0.3%
38k
AI compute engine for scaling Python and AI applications
created 8 years ago
updated 16 hours ago
Feedback? Help us improve.