Discover and explore top open-source AI tools and projects—updated daily.
HenryNdubuakuJax library for building transformer models, including GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT
Top 89.4% on SourcePulse
NanoDL is a Jax-based library for building and training transformer models from scratch, targeting AI/ML experts who need to develop smaller-scale, efficient models. It provides a pedagogical approach with modular code, enabling customization and accelerated development of neural networks with distributed training capabilities.
How It Works
NanoDL leverages Jax and Flax for efficient computation and distributed training. Its core design emphasizes modularity, with each model and its components contained in single files to minimize dependencies and facilitate learning. This approach allows users to easily select, combine, and modify layers and blocks, including specialized ones like RoPE, GQA, and MQA, for flexible model development.
Quick Start & Requirements
pip install nanodlHighlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
Inactive
erfanzar
epfLLM
huggingface
n2cholas
young-geng
AI-Hypercomputer
EleutherAI
huggingface
Lightning-AI