examples  by CerebriumAI

ML/AI examples for Cerebrium serverless GPUs

Created 2 years ago
510 stars

Top 61.3% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a comprehensive collection of self-contained examples for building and deploying Machine Learning/AI applications on the Cerebrium Serverless GPUs platform. It targets developers and researchers looking to deploy LLMs, process voice, or handle image/video tasks, offering practical, ready-to-use code to accelerate development.

How It Works

Examples are organized by category, demonstrating specific use cases from basic deployments to advanced techniques like VLLM for inference speed and multi-GPU setups. Each example is designed for direct deployment on Cerebrium using the cerebrium deploy command after cloning the repository, simplifying the process of testing and integrating various ML models and workflows.

Quick Start & Requirements

  • Install/Run: Clone the repository and run cerebrium deploy within each example folder.
  • Prerequisites: Cerebrium account and CLI. Specific examples may require additional libraries or models detailed within their respective directories.
  • Resources: Requires Cerebrium platform access.
  • Links: Documentation, Discord Community, Slack Community

Highlighted Details

  • Covers a wide range of ML tasks: LLMs, voice processing (Whisper, XTTS), image/video generation (ComfyUI, SDXL), and integrations with Langchain/Langsmith.
  • Demonstrates advanced deployment strategies including VLLM for faster inference, multi-GPU setups, and WebSockets for streaming.
  • Includes examples for FastAPI and Gradio for building ML web interfaces.
  • Offers guidance on batching requests with LitServe, vLLM, and transformers.

Maintenance & Community

The repository encourages community contributions, offering swag for merged pull requests. Active community channels are available via Discord and Slack for support and discussion.

Licensing & Compatibility

The repository itself is not explicitly licensed in the README. Examples may contain code with varying licenses depending on the underlying libraries and models used. Users should verify compatibility for commercial or closed-source use on a per-example basis.

Limitations & Caveats

The repository focuses solely on demonstrating use cases within the Cerebrium ecosystem; examples may not be directly transferable to other platforms without modification. Specific hardware or software dependencies for individual examples are not centrally listed and must be checked within each example's documentation.

Health Check
Last Commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
1
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
3 more.

LitServe by Lightning-AI

0.3%
4k
AI inference pipeline framework
Created 1 year ago
Updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
2 more.

torchchat by pytorch

0.1%
4k
PyTorch-native SDK for local LLM inference across diverse platforms
Created 1 year ago
Updated 1 week ago
Starred by Didier Lopes Didier Lopes(Founder of OpenBB), John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), and
1 more.

launchables by brevdev

0%
2k
Notebook templates for AI/ML tasks
Created 2 years ago
Updated 3 weeks ago
Feedback? Help us improve.