Discover and explore top open-source AI tools and projects—updated daily.
FluidInferenceNative Swift audio processing for Apple devices
Top 23.4% on SourcePulse
FluidAudio is a Swift framework for on-device, low-latency audio processing on Apple platforms, targeting developers building real-time applications. It offers speaker diarization, voice activity detection (VAD), and automatic speech recognition (ASR) using open-source models converted to Apple's Core ML format, optimized for efficient background processing on Apple Silicon.
How It Works
FluidAudio leverages native Swift and Core ML for all audio processing, ensuring full local operation and minimal latency. It utilizes custom-converted, optimized versions of state-of-the-art models like Parakeet TDT for ASR and Pyannote for speaker diarization. The framework prioritizes CPU-based execution, avoiding GPU/MPS/Shaders to guarantee consistent performance and battery efficiency on Apple devices, including leveraging the Apple Neural Engine.
Quick Start & Requirements
https://github.com/FluidInference/FluidAudio.git. Ensure the library is added to your target, not the executable.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
14 hours ago
Inactive
antirez
argmaxinc