Discover and explore top open-source AI tools and projects—updated daily.
High-fidelity voice cloning and dialogue generation
Top 90.4% on SourcePulse
This ComfyUI custom node integrates IndexTTS, a high-quality, fast voice cloning and synthesis system. It targets users within the ComfyUI ecosystem seeking advanced text-to-speech capabilities, including realistic voice cloning, multi-language support (Chinese/English), and nuanced emotional expression, with a notable feature for generating two-person dialogues.
How It Works
The node leverages the IndexTTS model architecture, which combines advanced techniques for voice cloning and speech synthesis. Key advantages include its ability to capture and replicate custom voice timbres with high fidelity and speed. It supports fine-grained control over emotional expression through audio prompts, text prompts, and vector manipulation, enabling dynamic and natural-sounding speech generation, including complex multi-speaker interactions.
Quick Start & Requirements
cd ComfyUI/custom_nodes
git clone https://github.com/billwuhao/ComfyUI_IndexTTS.git
cd ComfyUI_IndexTTS
pip install -r requirements.txt
Windows users require specific pynini
installation steps using provided wheel files and WeTextProcessing
.requirements.txt
. GPU is highly recommended for performance.bigvgan_generator.pth
, bpe.model
, gpt.pth
for v1.5; various components from Hugging Face for v2) and placement into the ComfyUI/models/TTS/Index-TTS
directory.Highlighted Details
Maintenance & Community
No specific community links (Discord, Slack) or detailed maintenance information (contributors, roadmap) are provided in the README.
Licensing & Compatibility
The repository's license is not explicitly stated in the provided text. Compatibility is within the ComfyUI framework.
Limitations & Caveats
DeepSpeed acceleration is noted as providing minimal performance gains. The first run automatically builds custom CUDA kernels, which may add initial setup time. Model file management requires manual steps.
1 week ago
Inactive