Discover and explore top open-source AI tools and projects—updated daily.
sapientincEfficient text generation model pretraining framework
New!
Top 44.5% on SourcePulse
Summary
HRM-Text offers an accessible framework for pretraining foundation models from scratch, drastically reducing compute and data needs. Targeting researchers and engineers, it enables LLM development with costs as low as ~$1000, democratizing access to large-scale model creation.
How It Works
This project employs a Hierarchical Recurrent Architecture (HRM) combined with PrefixLM sequence packing and FlashAttention 3 kernels for efficient processing. Training utilizes PyTorch's FSDP2 for optimized distributed computation. This approach achieves pretraining with orders of magnitude less compute and data than traditional scaling methods.
Quick Start & Requirements
docker run --gpus all --ipc=host --network=host -it -v "$PWD":/workspace sapientai/hrm-text:latest). Alternatively, pip install -r requirements.txt after setting up PyTorch, CUDA, and FlashAttention 3.sapientai/hrm-text:latest. Paper: https://arxiv.org/abs/2605.20613.Highlighted Details
Maintenance & Community
Active development is evident, with ongoing work on native Transformers and vLLM support. No specific community channels or roadmap links are detailed in the provided text.
Licensing & Compatibility
Released under the Apache License 2.0, permitting commercial use and integration into closed-source projects.
Limitations & Caveats
Reliance on FlashAttention 3 necessitates Hopper-class GPUs. Native integration with Transformers and vLLM is pending, requiring conversion steps. Data preparation depends on a companion data_io pipeline.
9 hours ago
Inactive
foundation-model-stack
stanford-crfm
goombalab
allenai
EleutherAI