parakeet-rs  by altunenes

High-performance speech processing in Rust

Created 6 months ago
267 stars

Top 95.9% on SourcePulse

GitHubView on GitHub
Project Summary

parakeet-rs provides a high-performance Rust library for speech-to-text (ASR) and speaker diarization, leveraging NVIDIA's Parakeet models via ONNX Runtime. It targets developers seeking fast, efficient, and streamable audio processing, offering significant speed advantages even on CPU, making it suitable for real-time applications and resource-constrained environments.

How It Works

The library integrates NVIDIA's Parakeet models, executing them through ONNX Runtime with support for various hardware acceleration providers (CUDA, TensorRT, WebGPU, DirectML, MIGraphx) and CPU fallback. It offers distinct models for different tasks: CTC for English ASR, TDT for multilingual ASR with auto-detection, EOU for streaming ASR with end-of-utterance detection, Nemotron for cache-aware streaming ASR, Multitalker for streaming multi-speaker ASR, and Sortformer for streaming speaker diarization. This modular approach allows for flexible deployment and optimized performance across diverse hardware.

Quick Start & Requirements

Installation typically involves adding parakeet-rs as a dependency in a Rust project (Cargo.toml) and enabling features like cuda if GPU acceleration is desired. Prerequisites: Rust toolchain, ONNX Runtime (with specific EPs for GPU acceleration), and separately downloaded NVIDIA Parakeet ONNX models and associated files (e.g., model.onnx, tokenizer.json) from HuggingFace. Audio input must be 16kHz mono WAV (16-

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
7
Issues (30d)
1
Star History
29 stars in the last 30 days

Explore Similar Projects

Starred by Dan Guido Dan Guido(Cofounder of Trail of Bits), Patrick von Platen Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral), and
3 more.

voxtral.c by antirez

0.6%
2k
Pure C speech-to-text inference engine for Mistral Voxtral Realtime 4B
Created 2 months ago
Updated 2 months ago
Feedback? Help us improve.