API for text-to-speech using CosyVoice
Top 90.0% on sourcepulse
This project provides a Python API wrapper for the CosyVoice2 text-to-speech model, enabling developers to integrate advanced voice synthesis capabilities into their applications. It targets developers and researchers working with AI voice generation, offering flexible options for both built-in voice synthesis and voice cloning.
How It Works
The API exposes several endpoints for different synthesis tasks. The /tts
endpoint handles basic text-to-speech with predefined roles (languages/genders). The /clone_eq
endpoint performs voice cloning using a reference audio and matching text, while /cone
allows cross-lingual voice cloning. It also offers an OpenAI-compatible /v1/audio/speech
endpoint for seamless integration with existing OpenAI TTS workflows.
Quick Start & Requirements
python -m pip install flask
python api.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project relies on a pre-existing CosyVoice2 deployment, and its own licensing is not specified, which may impact commercial adoption. The README does not detail error handling or advanced configuration options.
6 months ago
1 day