swama by Trans-N-ai

High-performance LLM inference engine for macOS

Created 7 months ago

463 stars

Top 65.4% on SourcePulse

1 Expert Loves This Project

developit

Author of Preact

Project Summary

Swama is a high-performance LLM and VLM inference engine for macOS, built with pure Swift on Apple's MLX framework. It targets macOS users and developers seeking efficient, local AI model execution, offering an OpenAI-compatible API, a native menu bar app, and CLI tools for seamless integration and model management.

How It Works

Swama leverages Apple's MLX framework, optimized for Apple Silicon, to deliver fast local inference. Its architecture includes SwamaKit (core logic), Swama CLI (model management and inference), and Swama.app (menu bar UI). It supports multimodal inputs, local audio transcription via Whisper, and text embeddings, all accessible through an OpenAI-compatible API.

Quick Start & Requirements

Install: Download Swama.dmg from releases and install the app. Install CLI tools via the menu bar app.
Requirements: macOS 14.0+, Apple Silicon (M1/M2/M3/M4), Xcode 15.0+, Swift 6.1+.
Links: Releases

Highlighted Details

OpenAI-compatible API for chat completions, embeddings, and audio transcription.
Native macOS menu bar application for background inference and management.
Smart model management with aliases, automatic downloading, and caching from HuggingFace.
Supports multimodal (text/image) inputs and local audio transcription with Whisper.

Maintenance & Community

Active development with a clear roadmap.
Community support via GitHub Discussions.

Licensing & Compatibility

MIT License. Permissive for commercial use and closed-source linking.

Limitations & Caveats

Requires Apple Silicon hardware and recent macOS versions.
Building from source requires Xcode and Swift toolchain setup.

Health Check

Last Commit

2 days ago

Responsiveness

1 week

Pull Requests (30d)

7

Issues (30d)

6

Star History

20 stars in the last 30 days

Explore Similar Projects

ata by transformrs

CLI tool for multimodal AI in the terminal

Created 3 years ago

Updated 9 months ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

S.A.T.U.R.D.A.Y by GRVYDEV

Vocal computing toolbox for building voice interfaces to LLMs

Created 2 years ago

Updated 2 years ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

whisper.rn by mybigday

React Native binding for high-performance local speech recognition

Created 2 years ago

Updated 1 month ago

Kokoros by lucasjinreal

Rust crate for fast, high-quality TTS

Created 1 year ago

Updated 2 days ago

unity-AI-Chat-Toolkit by zhangliwei7758

Unity toolkit for AI chat functionality

Created 2 years ago

Updated 6 months ago

swift-realtime-openai by m1guelpf

Swift SDK for OpenAI's Realtime API, enabling multimodal conversations

Created 1 year ago

Updated 3 months ago

Alpaca by Jeffser

Ollama client for local AI model management and chat

Created 1 year ago

Updated 3 days ago

Starred by

Emile Vauge

Emile Vauge(Founder of Traefik).

Scriberr by rishikanthc

Self-hosted app for local AI audio transcription

Created 1 year ago

Updated 4 days ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera) and

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai).

moonshine by moonshine-ai

Speech-to-text models optimized for fast, accurate ASR on edge devices

Created 1 year ago

Updated 1 month ago

Starred by

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai),

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind), and

2 more.

WhisperKit by argmaxinc

Speech recognition framework for Apple Silicon

Created 1 year ago

Updated 1 month ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic).

OpenAI by MacPaw

Swift SDK for OpenAI's API

Created 3 years ago

Updated 1 month ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen), and

2 more.

inference by xorbitsai

Model serving library for language, speech, and multimodal models

Created 2 years ago

Updated 1 day ago

Feedback? Help us improve.