Fast streaming TTS via Orpheus, accessible through a Python package
Top 89.3% on sourcepulse
This project provides a fast, streaming Text-to-Speech (TTS) engine using C++ bindings for the Orpheus TTS model, integrated with WebRTC via FastRTC. It targets developers and researchers seeking low-latency, high-quality speech synthesis for real-time applications, offering a significant performance improvement over Python-only implementations.
How It Works
The core of Orpheus-CPP leverages C++ for performance-critical TTS inference, wrapping the Orpheus model. It utilizes WebRTC for efficient, low-latency audio streaming, enabling real-time communication and playback. The integration with FastRTC further optimizes the WebRTC stack for speed and resource efficiency.
Quick Start & Requirements
pip install orpheus-cpp
pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu
(for CPU) or --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/metal
(for Apple Silicon).python -m orpheus_cpp
and visit http://localhost:7860
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README notes that reducing pre_buffer_size
too much may impact audio smoothness. Installation of llama-cpp-python
requires specific wheel URLs, indicating potential build complexities or dependency management needs.
3 months ago
1 day