memory_reduced_optimizer by adonis-dym

Research paper for memory-reduced deep network training

Created 1 year ago

530 stars

Top 59.7% on SourcePulse

Project Summary

This repository provides memory-reduced variants of popular deep learning optimizers (AdamW, Adan, Lion) by reusing gradient space. It targets researchers and practitioners training large models who face memory constraints, offering significant memory savings without compromising training dynamics.

How It Works

The core innovation is gradient space reutilization. When a gradient's historical information is no longer required by the optimizer's update rule, its allocated memory is repurposed to store intermediate variables. This technique is applied to AdamW, Adan, and Lion, creating AdamW-R, Adan-R, and Lion-R, respectively. This approach aims to reduce the optimizer's memory footprint, enabling larger models or batch sizes on limited hardware.

Quick Start & Requirements

Install by placing the provided optimizer files directly into your project directory.
Requires PyTorch.
See paper for detailed experimental results.

Highlighted Details

Achieves 6-25% memory savings across various models (ViT, ConvNeXt, BLOOM, LLaMA-2, etc.) compared to standard optimizers.
Memory reduction is demonstrated with and without ZeRO optimization.
AdamW-R and Adan-R maintain identical training dynamics to their originals.
Lion-R has theoretically equivalent dynamics with minimal impact on outcomes.

Maintenance & Community

Developed by adonis-dym, with Yiming Dong and Zhouchen Lin as authors.
Paper won the PRCV Best Paper Award.

Licensing & Compatibility

The repository does not explicitly state a license.

Limitations & Caveats

The specific license is not declared, which may impact commercial use or integration into closed-source projects.
The README does not detail installation beyond placing files in the project directory, suggesting potential manual integration effort.

Health Check

Last Commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

APOLLO by zhuhanqing

Memory-efficient optimizer for LLM training

Created 1 year ago

Updated 1 month ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI) and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

BAdam by Ledzy

Memory-efficient optimizer for large language model finetuning

Created 1 year ago

Updated 10 months ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI) and

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI).

Adam-mini by zyushun

PyTorch implementation of Adam-mini optimizer from a research paper

Created 1 year ago

Updated 8 months ago

Starred by

Stas Bekman

Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

2 more.

YaFSDP by yandex

Sharded data parallelism framework for transformer-like neural networks

Created 1 year ago

Updated 1 month ago

Starred by

Ying Sheng

Ying Sheng(Coauthor of SGLang) and

Stas Bekman

Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake).

llm-analysis by cli99

CLI tool for LLM latency/memory analysis during training/inference

Created 2 years ago

Updated 8 months ago

pytorch-memonger by Lyken17

PyTorch module for sublinear memory optimization

Created 6 years ago

Updated 6 years ago

Starred by

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

2 more.

MeZO by princeton-nlp

Research paper implementation for memory-efficient LM fine-tuning

Created 2 years ago

Updated 2 years ago

Starred by

Luca Antiga

Luca Antiga(CTO of Lightning AI),

William Falcon

William Falcon(Founder of Lightning AI), and

4 more.

lightning-thunder by Lightning-AI

PyTorch compiler for model optimization via source-to-source transformation

Created 1 year ago

Updated 1 day ago

Efficient-Deep-Learning by MingSun-Tse

DNN efficiency methods collection (neural compression, acceleration)

Created 7 years ago

Updated 9 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

Awesome-Efficient-LLM by horseee

Curated list for efficient LLMs

Created 2 years ago

Updated 6 months ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Daniel Han

Daniel Han(Cofounder of Unsloth), and

1 more.

GaLore by jiaweizzhao

Memory-efficient training for large language models via gradient low-rank projection

Created 1 year ago

Updated 1 year ago

Starred by

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

12 more.

Liger-Kernel by linkedin

Triton kernels for efficient LLM training

Created 1 year ago

Updated 4 days ago

Feedback? Help us improve.