Discover and explore top open-source AI tools and projects—updated daily.
kyuz0LLM serving container for AMD Strix Halo hardware
Top 95.4% on SourcePulse
This repository provides optimized Docker/Podman containers and Toolbx integration for serving Large Language Models (LLMs) using vLLM on AMD Ryzen AI Max “Strix Halo” (gfx1151) hardware. It targets engineers and researchers seeking high-performance LLM inference, offering advanced features like RDMA-based distributed clustering for unified memory expansion across multiple nodes. The project enables efficient LLM deployment on specialized AMD hardware, leveraging ROCm nightly builds.
How It Works
The core approach utilizes Fedora 43-based Toolbx-compatible containers built upon TheRock nightly ROCm builds. A key innovation is a custom ROCm/RCCL patch enabling native RDMA/RoCE v2 support, facilitating low-latency, high-bandwidth communication between nodes. This allows for Tensor Parallelism across multiple Strix Halo devices, effectively pooling their memory for larger model deployments. vLLM serves as the inference engine, optimized within this containerized environment.
Quick Start & Requirements
refresh_toolbox.sh for Fedora Toolbx or distrobox create for Ubuntu.iommu=pt, amdgpu.gttsize=126976, ttm.pages_limit=32505856) may be required for optimal unified memory configuration.Highlighted Details
start-vllm TUI wizard for simplified model serving setup.Maintenance & Community
This project is maintained as a hobby in the author's spare time. Support is available via voluntary contributions ("buy me a coffee"). No specific community channels (like Discord or Slack) are listed.
Licensing & Compatibility
The license type is not explicitly stated in the provided README content. Compatibility is specific to AMD Strix Halo (gfx1151) hardware.
Limitations & Caveats
Vision model support is currently unavailable due to a patch disabling vision encoder profiling to prevent indefinite hangs during MIOpen kernel searches. The project primarily targets Fedora 43, with Ubuntu support relying on Distrobox.
1 week ago
Inactive
S-LoRA
alibaba
b4rtaz