Framework for porting LLMs to Apple Neural Engine (ANE)
Top 35.1% on sourcepulse
ANEMLL is an open-source library designed to accelerate the porting and on-device inference of Large Language Models (LLMs) on Apple's Neural Engine (ANE). It targets developers building low-power, privacy-focused applications for edge devices, enabling seamless integration of LLMs into iOS and macOS applications.
How It Works
ANEMLL provides a pipeline for converting Hugging Face models to Apple's CoreML format, optimized for ANE execution. It leverages CoreML Tools for conversion and offers Swift and Python implementations for inference. This approach allows for direct on-device processing, enhancing privacy and reducing reliance on cloud infrastructure.
Quick Start & Requirements
git clone https://github.com/Anemll/Anemll.git
cd Anemll
python -m venv anemll-env
source anemll-env/bin/activate
pip install -r requirements.txt
coremlcompiler
)../anemll/utils/convert_model.sh --model <path_to_model> --output <output_directory>
python ./tests/chat.py --meta <output_directory>/meta.yaml
Highlighted Details
chat.py
) and advanced (chat_full.py
) Python chat interfaces with conversation history management.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 weeks ago
1 day