OpenAI-compatible API endpoint for local, free, high-quality TTS
Top 37.1% on sourcepulse
This project provides a free, OpenAI-compatible Text-to-Speech (TTS) API endpoint that leverages Microsoft Edge's TTS service. It's designed for developers and users looking for a high-quality, cost-effective alternative to services like OpenAI or ElevenLabs, offering flexibility in voice selection, audio formats, and playback speed.
How It Works
The service emulates the OpenAI TTS API's /v1/audio/speech
endpoint, accepting similar request structures. It maps OpenAI's standard voices (alloy, echo, fable, onyx, nova, shimmer) to their Microsoft Edge TTS equivalents, but also allows direct specification of any available Edge TTS voice. The underlying Microsoft Edge TTS service provides the core speech generation, with the project handling API compatibility and format conversions.
Quick Start & Requirements
docker compose up --build
(or docker run -d -p 5050:5050 --env-file .env travisvn/openai-edge-tts
).env
file for configuration.Highlighted Details
/v1/audio/speech
.Maintenance & Community
The project is maintained by travisvn. Contributions are welcome via pull requests.
Licensing & Compatibility
Licensed under GNU General Public License v3.0 (GPL-3.0). The acceptable use-case is intended for personal use. Enterprise or non-personal use requires contacting the maintainer.
Limitations & Caveats
The GPL-3.0 license may impose copyleft restrictions on derivative works or linked applications, potentially requiring them to also be open-sourced. The project is primarily intended for personal use.
1 month ago
1 day