Compiler solution for PyTorch operator optimization on diverse accelerators
Top 64.9% on sourcepulse
Antares (AutoRT) is a compiler solution for PyTorch users to invent, benchmark, and optimize custom operators for various hardware accelerators. It targets researchers and developers needing to push performance boundaries or integrate PyTorch with custom hardware backends, offering accelerated standard PyTorch applications and custom/fused operator generation.
How It Works
Antares utilizes an intermediate representation (IR) to define operations, which are then compiled and optimized for specific backends. This approach allows for abstract operator definition and backend-agnostic compilation, enabling efficient execution across diverse hardware like DirectX 12, CUDA, ROCm, and SYCL. The system supports both programmatic API-style and command-line style operator generation, with an integrated tuning mechanism.
Quick Start & Requirements
pip install autort
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 months ago
1 day