Hierarchical reasoning for complex tasks
New!
Top 20.3% on sourcepulse
The Hierarchical Reasoning Model (HRM) addresses limitations in current LLM reasoning, such as brittle task decomposition and high latency, by introducing a novel recurrent neural network architecture inspired by human brain processing. It targets AI researchers and practitioners seeking efficient and stable solutions for complex, goal-oriented reasoning tasks, offering significant performance gains with a compact model size.
How It Works
HRM employs two interdependent recurrent modules: a high-level planner for abstract, slow processing and a low-level module for rapid, detailed computations. This hierarchical, multi-timescale approach allows for deep sequential reasoning within a single forward pass, avoiding explicit supervision of intermediate steps and enhancing training stability and efficiency compared to traditional methods.
Quick Start & Requirements
packaging
, ninja
, wheel
, setuptools
, setuptools-scm
, requirements.txt
. Weights & Biases (wandb
) integration is used for experiment tracking.Highlighted Details
Maintenance & Community
The project is associated with SapientAI. The README includes a citation for a 2025 arXiv paper, indicating recent development. No community links (Discord, Slack) are provided.
Licensing & Compatibility
The README does not explicitly state a license. The presence of setuptools-scm
might imply a standard open-source license, but this requires verification.
Limitations & Caveats
The project requires specific CUDA versions (12.6) and manual installation of CUDA extensions and FlashAttention, which can be complex. Small-sample learning may exhibit accuracy variance, and late-stage overfitting on tasks like Sudoku-Extreme can cause numerical instability, suggesting the need for early stopping.
4 days ago
Inactive