Discover and explore top open-source AI tools and projects—updated daily.
NVIDIAGPU memory copy library using GPUDirect RDMA
Top 31.6% on SourcePulse
GDRCopy is a low-latency library enabling direct CPU access to GPU memory via NVIDIA GPUDirect RDMA. It's designed for researchers and developers requiring high-performance data transfers between CPU and GPU, offering a CPU-driven copy mechanism with minimal overhead.
How It Works
GDRCopy leverages GPUDirect RDMA APIs to create user-space mappings of GPU memory. This allows GPU memory to be treated like host memory, facilitating efficient CPU-driven data transfers. The approach minimizes overhead by avoiding intermediate copies, though an initial memory pinning phase is required.
Quick Start & Requirements
make), RPM packages (build-rpm-packages.sh), or DEB packages (build-deb-packages.sh).Highlighted Details
gdrcopy_copybw), latency (gdrcopy_copylat), API performance (gdrcopy_apiperf), and ping-pong latency (gdrcopy_pplat).Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
gdr_map() requires addresses aligned to GPU pages; users must ensure alignment.cudaMalloc allocations is not well-supported.2 months ago
1 week
ztxz16
gpu-mode
FMInference