Discover and explore top open-source AI tools and projects—updated daily.
maderixDirect neural network training on Apple Neural Engine
New!
Top 8.3% on SourcePulse
Summary This research project demonstrates training neural networks directly on Apple's Neural Engine (ANE) by reverse-engineering private APIs. It targets researchers and engineers exploring on-device AI compute beyond standard inference, offering a proof-of-concept and performance benchmarks for direct ANE access.
How It Works
The core approach uses reverse-engineered _ANEClient and _ANECompiler private APIs to compile custom Model Intermediate Language (MIL) compute graphs directly on the ANE. Objective-C generates MIL at runtime, compiled in-memory. Data and weights are handled via IOSurface in a channel-first layout. Forward and backward passes run on ANE kernels, with weight gradients (dW) computed on CPU via cblas. This bypasses Apple's inference-only restriction.
Quick Start & Requirements
Build with xcrun clang -O2 -framework Foundation -framework IOSurface -framework CoreML -framework Accelerate -ldl -lobjc -o train_large training/train_large.m and run ./train_large. Requires macOS 15+ on Apple Silicon (M4 tested). Uses only system frameworks and private ANE APIs resolved at runtime.
Highlighted Details
3 days ago
Inactive
MarioSieg
Kaixhin
bigscience-workshop
huggingface
mindspore-ai
NVIDIA