hipfire by Kaden-Schutt

RDNA-native LLM inference engine for AMD GPUs

Created 3 months ago

473 stars

Top 63.7% on SourcePulse

Project Summary

Summary

hipfire provides an RDNA-native LLM inference engine in Rust, targeting AMD GPUs, especially consumer hardware often overlooked by official ROCm support. It offers a unified, high-performance solution for developers and researchers seeking accelerated inference across the RDNA family without Python or complex ROCm stacks.

How It Works

This project leverages Rust and HIP for RDNA-specific LLM inference, aiming for a single binary that ships pre-compiled kernels and JIT-compiles others. By avoiding Python, PyTorch, and the ROCm userspace stack at runtime, hipfire simplifies dependencies and optimizes performance for the entire RDNA GPU spectrum (RDNA1-RDNA4, consumer, pro, APU).

Quick Start & Requirements

For Linux with ROCm 6+, install via: curl -L https://raw.githubusercontent.com/Kaden-Schutt/hipfire/master/scripts/install.sh | bash. Windows/source builds and verification details are in docs/GETTING_STARTED.md.

Highlighted Details

Performance: Benchmarks on a 7900 XTX show significant decode/prefill speedups over ollama (e.g., 1.71x decode for Qwen 3.5 9B).
DFlash: Implements DFlash speculative decode for further gains (up to 4.45x speedup on HumanEval/53 for 27B model), with genre-conditional performance detailed per-architecture.
API: Offers an OpenAI-compatible API via hipfire serve.
RDNA Support: Explicitly targets RDNA1-RDNA4 GPUs (consumer, pro, APU).

Maintenance & Community

The project is at v0.1.8-alpha.2, indicating early development. A CHANGELOG.md is available. Correctness is emphasized via scripts like ./scripts/coherence-gate-dflash.sh and detailed benchmarking methodology. No community channels or sponsorships are listed.

hipfire by Kaden-Schutt

Summary

Explore Similar Projects

dash-infer by modelscope

xinfer by guoqingbao

dotLLM by kkokosa

rvllm by m0at

ssd by tanishqkumar

atlas by Avarok-Cybersecurity

candle-vllm by EricLBuehler

ZhiLight by zhihu

colibri by JustVugg

picolm by RightNow-AI

mistral.rs by EricLBuehler

text-generation-inference by huggingface