mlx  by ml-explore

Array framework for machine learning on Apple silicon

Created 1 year ago
22,226 stars

Top 1.8% on SourcePulse

GitHubView on GitHub
Project Summary

MLX is an array framework designed for machine learning on Apple silicon, targeting ML researchers and developers. It offers a user-friendly yet efficient platform for training and deploying models, simplifying the exploration of new ideas with a conceptually simple and extensible design.

How It Works

MLX leverages a unified memory model, allowing arrays to reside in shared memory accessible by both CPU and GPU without explicit data transfers. Computations are lazy and use a dynamically constructed graph, enabling efficient execution and straightforward debugging, even with changing input shapes. Its design is inspired by frameworks like NumPy, PyTorch, Jax, and ArrayFire, incorporating composable function transformations for automatic differentiation, vectorization, and graph optimization.

Quick Start & Requirements

Highlighted Details

  • Familiar Python API mirroring NumPy, with C++, C, and Swift APIs.
  • Higher-level packages (mlx.nn, mlx.optimizers) mimic PyTorch APIs.
  • Supports multi-device operations (CPU and GPU) via unified memory.
  • Examples include Transformer training, LLaMA fine-tuning, Stable Diffusion, and Whisper.

Maintenance & Community

MLX is developed by Apple machine learning research. Contributions are welcomed.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README text.

Limitations & Caveats

The framework is primarily designed for Apple silicon, limiting its use on other hardware architectures. The BibTex entry lists the version as 0.0, suggesting it may be in early development stages.

Health Check
Last Commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)
72
Issues (30d)
25
Star History
283 stars in the last 30 days

Explore Similar Projects

Starred by Yaowei Zheng Yaowei Zheng(Author of LLaMA-Factory), Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), and
1 more.

VeOmni by ByteDance-Seed

3.4%
1k
Framework for scaling multimodal model training across accelerators
Created 5 months ago
Updated 3 weeks ago
Starred by Bojan Tunguz Bojan Tunguz(AI Scientist; Formerly at NVIDIA), Alex Chen Alex Chen(Cofounder of Nexa AI), and
19 more.

ggml by ggml-org

0.3%
13k
Tensor library for machine learning
Created 3 years ago
Updated 2 days ago
Feedback? Help us improve.