resource-stream  by gpu-mode

CUDA resource collection for GPU programming

created 1 year ago
1,642 stars

Top 26.2% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This repository serves as a curated collection of resources for GPU programming, primarily focusing on CUDA. It targets developers, researchers, and students interested in high-performance computing, AI acceleration, and kernel development, offering a centralized hub for learning materials and practical implementations.

How It Works

The project aggregates links to books, papers, blog posts, videos, and code repositories related to CUDA, Triton, PyTorch performance, and GPU architecture. It aims to provide a structured learning path from introductory concepts to advanced optimization techniques and research frontiers in GPU computing.

Quick Start & Requirements

  • Contribution: Submit links via pull request or Discord.
  • Learning: Primarily relies on external resources (links provided).
  • Prerequisites: Access to Discord and YouTube for live sessions and recordings.

Highlighted Details

  • Comprehensive coverage from "1st Contact with CUDA" to advanced topics like CUDA Graphs and Tensor Core programming.
  • Links to key libraries and tools: CUTLASS, Triton, PyTorch C++ API, Numba, TVM, JAX, CuPy, Codon.
  • Includes resources on foundational AI systems, data-centric AI, and specific optimizations like FlashAttention-2.
  • Features "CUDA Grandmasters" like Tri Dao and Tim Dettmers, with links to their influential work.

Maintenance & Community

  • Active community via Discord server (link provided).
  • Regular lectures and reading group sessions with recordings on YouTube.
  • Contributions are encouraged via pull requests to the repository.

Licensing & Compatibility

  • Resource links point to various external sources with their own licenses.
  • Code examples and libraries mentioned have their respective licenses (e.g., MIT, Apache 2.0).

Limitations & Caveats

This is a curated list of links, not a runnable software project. The quality and availability of external resources are subject to their original sources. Some older links or specific implementations might be outdated.

Health Check
Last commit

6 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
168 stars in the last 90 days

Explore Similar Projects

Starred by David Cournapeau David Cournapeau(Author of scikit-learn), Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), and
4 more.

lectures by gpu-mode

0.4%
5k
Lecture series for GPU-accelerated computing
created 1 year ago
updated 1 month ago
Feedback? Help us improve.