speaches  by speaches-ai

OpenAI API-compatible server for transcription, translation, and speech generation

Created 1 year ago
2,388 stars

Top 19.3% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

Speaches provides an OpenAI API-compatible server for ASR, translation, and TTS, targeting developers and researchers who want to integrate speech capabilities into their applications. It offers a unified interface for various speech models, simplifying complex workflows and enabling real-time, streaming interactions.

How It Works

Speaches leverages faster-whisper for speech-to-text and translation, and piper or kokoro for text-to-speech. Its core design mimics the OpenAI API, allowing seamless integration with existing tools and SDKs. The server supports dynamic model loading and offloading, automatically managing resources based on request activity, which is advantageous for efficient GPU/CPU utilization.

Quick Start & Requirements

  • Install: Docker Compose is the primary deployment method.
  • Prerequisites: GPU support is recommended for optimal performance.
  • Documentation: speaches.ai

Highlighted Details

  • OpenAI API compatibility for broad tool integration.
  • Streaming transcription and speech generation for real-time applications.
  • Dynamic model loading/offloading for efficient resource management.
  • Supports high-quality TTS via kokoro (ranked #1 in TTS Arena) and piper.

Maintenance & Community

The project is actively maintained, with a call for issues and feature suggestions. Links to community channels or roadmaps are not explicitly provided in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is therefore undetermined.

Limitations & Caveats

The project is described as having a "TODO" for speech generation demos, indicating this feature may still be under development or refinement. The lack of a specified license poses a significant caveat for adoption.

Health Check
Last Commit

20 hours ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
11
Star History
117 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.