Discover and explore top open-source AI tools and projects—updated daily.
meta-pytorchAutonomous GPU kernel generation and optimization via AI agents
Top 94.3% on SourcePulse
KernelAgent addresses the challenge of autonomously generating and optimizing GPU kernels for PyTorch programs, transforming them into verified Triton kernels. It targets engineers, researchers, and power users seeking to enhance GPU performance by automating complex kernel synthesis and optimization tasks. The primary benefit is the potential for significant performance improvements and reduced manual development effort through an LLM-driven, multi-stage pipeline.
How It Works
KernelAgent employs a multi-stage approach: static analysis to determine the optimal path (lightweight or full LLM pipeline), LLM-assisted refactoring to isolate fusable subgraphs, parallel generation of Triton kernels with strict runtime verification, end-to-end composition of synthesized kernels to rebuild the original forward pass, and a hardware-guided optimization pipeline for iterative performance enhancement. This LLM-driven synthesis and verification process aims for correctness and efficiency.
Quick Start & Requirements
pip install -e .pip install triton or nightly buildgit clone https://github.com/ScalingIntelligence/KernelBench.gitHighlighted Details
Maintenance & Community
The project is hosted by pytorch-labs. The primary community interaction point mentioned is the GitHub Issues page: https://github.com/pytorch-labs/KernelAgent/issues. No specific details on active contributors, sponsorships, or community channels like Discord/Slack are provided in the README.
Licensing & Compatibility
KernelAgent is released under the Apache License 2.0. This license is permissive and generally compatible with commercial use and linking within closed-source projects.
Limitations & Caveats
The system relies heavily on external LLM providers, requiring API keys and potentially incurring costs. Setup involves managing multiple dependencies including PyTorch, Triton, and LLM configurations. Intel XPU support necessitates compatible hardware and specific drivers. The strict verification process may halt if generated kernels fail correctness checks, potentially requiring manual intervention.
2 days ago
Inactive
ByteDance-Seed
cfregly
mirage-project