EasyDeL  by erfanzar

Framework for streamlined JAX model training/serving on TPU/GPU

created 2 years ago
294 stars

Top 90.9% on sourcepulse

GitHubView on GitHub
Project Summary

EasyDeL is a JAX/Flax framework designed to accelerate and optimize the training and serving of machine learning models, particularly for large-scale deployments on TPUs and GPUs. It targets researchers and engineers seeking efficient, flexible, and production-ready solutions for modern deep learning architectures.

How It Works

Built on Flax NNX, EasyDeL offers a modular and customizable architecture. It provides specialized trainers for tasks like Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO), along with support for various attention mechanisms (FlashAttention2, Ring Attention, etc.) and quantization techniques (NF4, A8BIT). Its vInference engine and OpenAI-compatible API server enable efficient serving.

Quick Start & Requirements

  • Install via pip: pip install easydel
  • Requires JAX, Transformers, and Datasets. GPU/TPU recommended for performance.
  • Official Documentation: https://erfanzar.github.io/EasyDeL/

Highlighted Details

  • Supports diverse model architectures including Transformers, Mamba, and RWKV.
  • Integrates platform-specific optimizations like TRITON, XLA, and Pallas.
  • Offers advanced quantization methods (NF4, A8BIT, A8Q, A4Q) for memory and speed improvements.
  • Includes a vInference engine for efficient LLM serving and an OpenAI-compatible API server.

Maintenance & Community

  • Primarily maintained by Erfan Zare Chavoshi.
  • Contributing guidelines are available in the repository.

Licensing & Compatibility

  • Released under the Apache v2 license.
  • Permissive license suitable for commercial use and integration into closed-source projects.

Limitations & Caveats

The framework is built on modern Flax NNX, which is still evolving. While it supports many architectures, specific model compatibility or performance tuning might require deeper understanding of JAX and Flax internals.

Health Check
Last commit

4 days ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
2
Star History
27 stars in the last 90 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera) and Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

InternEvo by InternLM

1.0%
402
Lightweight training framework for model pre-training
created 1 year ago
updated 1 week ago
Starred by Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
6 more.

EasyLM by young-geng

0.2%
2k
LLM training/finetuning framework in JAX/Flax
created 2 years ago
updated 11 months ago
Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), Michael Han Michael Han(Cofounder of Unsloth), and
1 more.

ktransformers by kvcache-ai

0.4%
15k
Framework for LLM inference optimization experimentation
created 1 year ago
updated 2 days ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Nat Friedman Nat Friedman(Former CEO of GitHub), and
32 more.

llama.cpp by ggml-org

0.4%
84k
C/C++ library for local LLM inference
created 2 years ago
updated 14 hours ago
Feedback? Help us improve.