NVIDIA Data Science Stack: tool for GPU-accelerated data science setup
Top 74.9% on sourcepulse
This repository provides the NVIDIA Data Science Stack, a tool designed to simplify the setup and management of GPU-accelerated data science environments on workstations and cloud VMs. It targets data scientists and researchers seeking a streamlined way to deploy and manage their development stacks, offering both containerized and local Conda environment options.
How It Works
The stack utilizes a shell script to automate system configuration, including NVIDIA driver installation and SELinux policy setup for containerized GPU access. Users can then choose to build and run Jupyter environments within Docker containers or local Conda environments. The script manages dependencies and provides commands for building, running, purging, and upgrading these environments, abstracting away much of the complexity of manual setup.
Quick Start & Requirements
./data-science-stack setup-system
.Highlighted Details
jupyter-repo2docker
, NVIDIA GPU Cloud CLI, Kaggle CLI, and AWS CLI.Maintenance & Community
The project is maintained by NVIDIA. Issue tracking and release planning are available via GitHub Projects and Issues. Users can subscribe to release notifications by watching the repository.
Licensing & Compatibility
The repository itself is licensed under the Apache 2.0 license. However, it installs and configures NVIDIA drivers and software, which have their own licensing terms. Compatibility with commercial or closed-source applications depends on the underlying NVIDIA software licenses.
Limitations & Caveats
1 year ago
1+ week