Container-based computing cluster platform
Top 55.9% on SourcePulse
Backend.AI is a container-based computing cluster platform designed for multi-tenant, on-demand computation sessions. It supports a wide range of programming languages and ML frameworks, with pluggable heterogeneous accelerator support including GPUs (CUDA, ROCm), TPUs, and NPUs. The platform is ideal for research institutions, data science teams, and organizations needing scalable, isolated compute environments.
How It Works
Backend.AI utilizes a distributed architecture comprising a central Manager for routing and scaling, and Agents running on compute nodes to manage containers. It employs Sokovan as its orchestrator. Sessions are exposed via REST and GraphQL APIs, with direct WebSocket tunneling for in-container applications like Jupyter, VSCode, and SSH. Its storage abstraction layer (vfolders) provides unified access to network storage, with customizable access controls.
Quick Start & Requirements
scripts/install-dev.sh
after cloning.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README mentions an "enterprise edition" with additional features, implying some functionality may be proprietary. While a single-node development script is provided, multi-node production setup details are deferred to external documentation.
1 day ago
Inactive