TTS tool for high-quality Chinese speech synthesis and voice cloning
Top 63.3% on sourcepulse
FlashTTS provides high-quality Chinese text-to-speech (TTS) and zero-shot voice cloning, leveraging advanced models like SparkTTS, OrpheusTTS, and MegaTTS 3. It targets developers and users needing natural-sounding speech for applications such as dubbing, reading, accessibility, and virtual characters, offering a user-friendly web interface for quick generation.
How It Works
FlashTTS utilizes a modular architecture, supporting multiple high-performance inference backends including vllm, sglang, llama-cpp, mlx-lm, and tensorrt-llm. This flexibility allows users to choose the most efficient engine for their hardware and performance needs. It features dynamic batching and asynchronous queues for high concurrency, enabling it to handle significant request loads. The system offers fine-grained control over speech parameters like pitch, speed, and emotion, and supports streaming TTS for improved interactivity.
Quick Start & Requirements
pip install flashtts
flashtts infer -i "text" -o output.wav -m ./models/your_model -b vllm
flashtts serve --model_path Spark-TTS-0.5B --backend vllm --llm_device cuda
Highlighted Details
Maintenance & Community
The project is associated with HuiResearch. Further community engagement details such as Discord/Slack links or a roadmap are not explicitly provided in the README.
Licensing & Compatibility
The project inherits the license from Spark-TTS. The specific license details are available in the LICENSE
file. It is intended for academic research, education, and legitimate uses like accessibility, but explicitly prohibits fraudulent or illegal applications such as deepfakes.
Limitations & Caveats
MegaTTS 3's WaveVAE encoder is not publicly released due to security considerations, requiring users to follow official instructions for download. SparkTTS weights require bfloat16
or float32
precision; float16
will cause errors. For extended silence issues, increasing repetition_penalty
is suggested.
2 months ago
Inactive