TTS service for voice synthesis using Microsoft Azure
Top 62.8% on sourcepulse
This project provides a text-to-speech (TTS) service that leverages Microsoft Azure Speech Services for high-quality voice synthesis. It is designed for developers and users needing to integrate TTS capabilities into applications or use a standalone service, offering flexibility through multiple languages, voices, adjustable speech parameters, and various audio formats.
How It Works
The service acts as a wrapper around Microsoft Azure's TTS API, providing a user-friendly interface and compatibility with the OpenAI TTS API. It handles long text inputs by automatically segmenting and merging them, ensuring smooth audio generation. The architecture supports customization via environment variables or YAML configuration files for parameters like port, Azure region, default voice, speech rate, pitch, and audio format.
Quick Start & Requirements
docker run -d -p 8080:8080 --name=tts zuoban/zb-tts
http://localhost:8080
or API docs at http://localhost:8080/api-doc
.API_KEY
environment variable.Highlighted Details
Maintenance & Community
The project is maintained by zuoban. No specific community channels or roadmap details are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The service relies on an external Microsoft Azure Speech Service API key, which may incur costs. Configuration details for Azure region and API key are essential for operation.
1 month ago
Inactive