speaches by speaches-ai

OpenAI API-compatible server for transcription, translation, and speech generation

Created 2 years ago

3,484 stars

Top 13.5% on SourcePulse

View on GitHub

1 Expert Loves This Project

Amin Ahmad

Cofounder of Vectara

Project Summary

Speaches provides an OpenAI API-compatible server for ASR, translation, and TTS, targeting developers and researchers who want to integrate speech capabilities into their applications. It offers a unified interface for various speech models, simplifying complex workflows and enabling real-time, streaming interactions.

How It Works

Speaches leverages faster-whisper for speech-to-text and translation, and piper or kokoro for text-to-speech. Its core design mimics the OpenAI API, allowing seamless integration with existing tools and SDKs. The server supports dynamic model loading and offloading, automatically managing resources based on request activity, which is advantageous for efficient GPU/CPU utilization.

Quick Start & Requirements

Install: Docker Compose is the primary deployment method.
Prerequisites: GPU support is recommended for optimal performance.
Documentation: speaches.ai

Highlighted Details

OpenAI API compatibility for broad tool integration.
Streaming transcription and speech generation for real-time applications.
Dynamic model loading/offloading for efficient resource management.
Supports high-quality TTS via kokoro (ranked #1 in TTS Arena) and piper.

Maintenance & Community

The project is actively maintained, with a call for issues and feature suggestions. Links to community channels or roadmaps are not explicitly provided in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is therefore undetermined.

Limitations & Caveats

The project is described as having a "TODO" for speech generation demos, indicating this feature may still be under development or refinement. The lack of a specified license poses a significant caveat for adoption.

Health Check

Last Commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

115 stars in the last 30 days