dgx-spark-playbooks  by NVIDIA

AI/ML workload playbooks for NVIDIA DGX Spark

Created 3 months ago
316 stars

Top 85.6% on SourcePulse

GitHubView on GitHub
Project Summary

This repository offers a curated collection of step-by-step playbooks for configuring and deploying AI/ML workloads on NVIDIA DGX Spark systems, specifically those equipped with Blackwell architecture. It targets engineers and researchers aiming to accelerate AI development and deployment on high-performance NVIDIA hardware.

How It Works

The project provides detailed, actionable playbooks covering installation, configuration, inference, and development environment setup for various AI frameworks and models. Each playbook includes prerequisites, step-by-step instructions, troubleshooting, and example code, designed to leverage the specialized capabilities of DGX Spark devices.

Quick Start & Requirements

Highlighted Details

  • Supports numerous AI frameworks and tools: NVIDIA Comfy UI, Dreambooth LoRA, LLaMA Factory, NeMo, Ollama, vLLM, TRT LLM, SGLang, Unsloth.
  • Encompasses advanced AI tasks: multi-agent chatbots, multi-modal inference, RAG applications, speculative decoding, video search/summarization agents.
  • Includes playbooks for optimizing inference, fine-tuning models, and setting up development environments (e.g., VS Code, AI Workbench).
  • Covers system-level configurations: local network access, multi-Spark NCCL communication, Tailscale setup.

Maintenance & Community

Community support and discussion are available via the NVIDIA Developer Forum. Specific details on project maintainers, sponsorships, or roadmap are not provided in the README snippet.

Licensing & Compatibility

Licensing information is detailed in the repository's LICENSE and LICENSE-3rd-party files. Specific terms, restrictions, or compatibility notes for commercial use are not elaborated upon in the provided text.

Limitations & Caveats

The playbooks are strictly limited to NVIDIA DGX Spark hardware featuring the Blackwell architecture, posing a significant hardware dependency. Setup may require advanced knowledge of AI infrastructure and NVIDIA-specific technologies.

Health Check
Last Commit

5 days ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
6
Star History
77 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), Elvis Saravia Elvis Saravia(Founder of DAIR.AI), and
15 more.

semantic-kernel by microsoft

0.2%
27k
SDK for building intelligent AI agents and multi-agent systems
Created 2 years ago
Updated 2 days ago
Feedback? Help us improve.