Discover and explore top open-source AI tools and projects—updated daily.
NVIDIAC++ and Python interface for NVIDIA cuDNN and high-performance kernels
Top 49.6% on SourcePulse
NVIDIA/cudnn-frontend offers a modern, open-source C++ header-only library and Python interface to the NVIDIA cuDNN library. It simplifies access to cuDNN's Graph API and high-performance kernels, targeting developers seeking to optimize deep learning workloads on NVIDIA hardware. The project enables inspection and contribution to core logic through open-sourced kernels, enhancing transparency and customizability.
How It Works
The library provides a Unified Graph API for defining complex computational subgraphs as reusable cudnn_frontend::graph::Graph objects. It abstracts the boilerplate of the backend cuDNN API through simplified C++ and Python bindings (via pybind11). Key advantages include built-in autotuning, support for the latest NVIDIA GPU architectures, and the ability to leverage and contribute to open-sourced, high-performance kernels like optimized GEMM and Native Sparse Attention.
Quick Start & Requirements
pip install nvidia_cudnn_frontendinclude/ directory.python-dev and dependencies listed in requirements.txt. Environment variables CUDAToolkit_ROOT and CUDNN_PATH can override default paths.Highlighted Details
Maintenance & Community
Contributions are actively welcomed. The README does not specify community channels (e.g., Discord, Slack) or list notable contributors or sponsorships.
Licensing & Compatibility
Licensed under the MIT License. This permissive license generally allows for commercial use and integration into closed-source projects.
Limitations & Caveats
The provided README does not explicitly detail limitations, alpha status, or known bugs. Building from source is required for C++ samples and Python bindings, suggesting a focus on integration via the header-only C++ API or the pip-installed Python package.
1 week ago
Inactive
microsoft
ELS-RD
HazyResearch
NVIDIA
NVIDIA
NVIDIA
karpathy
NVIDIA