Discover and explore top open-source AI tools and projects—updated daily.
HazyResearchGenomic foundation model for long-range DNA sequence modeling
Top 46.5% on SourcePulse
HyenaDNA provides an official implementation of a long-range genomic foundation model capable of processing up to 1 million tokens at single nucleotide resolution. It is designed for researchers aiming to pretrain HyenaDNA models or fine-tune them for downstream genomic tasks, offering a powerful tool for analyzing extensive DNA sequences.
How It Works
HyenaDNA leverages a novel architecture that replaces traditional self-attention mechanisms with a "Hyena" operator. This operator uses a combination of element-wise multiplications and learned, time-varying filters implemented via FFT convolutions. This approach allows for significantly longer context lengths and improved computational efficiency compared to standard transformer models, making it suitable for processing entire genomes.
Quick Start & Requirements
git clone --recurse-submodules), create a conda environment (conda create -n hyena-dna python=3.8), install PyTorch 1.13 with CUDA 11.7, and then pip install -r requirements.txt. Flash Attention installation is also required.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
8 months ago
Inactive
seal-rg
MAGICS-LAB
goombalab
agemagician
evo-design