Discover and explore top open-source AI tools and projects—updated daily.
ROCm library for boosting AMD GPU performance on Windows
Top 54.0% on SourcePulse
This repository provides optimized ROCm libraries for various AMD GPU architectures, primarily targeting Windows users and enabling AI workloads (like Llama, Stable Diffusion) via the ZLUDA CUDA wrapper. It aims to bridge the gap in official ROCm support for certain AMD GPUs, offering significant performance improvements over DirectML.
How It Works
The project compiles and distributes custom-built ROCm libraries, derived from official Linux versions with added optimizations. These libraries are designed to be drop-in replacements for existing ROCm components, enhancing performance for specific GPU architectures. The approach leverages community-driven builds and environment variable overrides (like HSA_OVERRIDE_GFX_VERSION
) to enable compatibility on unsupported hardware.
Quick Start & Requirements
.zip
or .7z
archive from the releases page matching your HIP SDK version. Backup existing rocblas.dll
and the rocblas
directory in your HIP SDK's bin
folder. Extract the downloaded archive, placing the library files into the corresponding bin\rocblas
directory and rocblas.dll
into the bin
directory of your HIP SDK installation..7z
files are provided for HIP SDK versions 5.7.1, 6.1.2, and 6.2.4.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 weeks ago
1 day