Discover and explore top open-source AI tools and projects—updated daily.
yolainAdvanced Text-to-Speech generation for ComfyUI
Top 99.8% on SourcePulse
This repository provides a ComfyUI custom node that enhances the IndexTTS2 text-to-speech model. It offers a streamlined workflow, advanced voice cloning capabilities, and integrated model management, targeting ComfyUI users seeking greater control and flexibility in TTS generation. The primary benefit is an improved user experience and expanded functionality over the base IndexTTS2 model within the ComfyUI ecosystem.
How It Works
This project is a modified version of ComfyUI_Index_TTS, built upon the IndexTTS2 architecture. While retaining the core logic of the original IndexTTS model, it introduces a significantly adjusted usage flow and new nodes. Key architectural choices include adapting to the ComfyUI v3 node paradigm for better integration and developing specific nodes for managing models (downloading from HuggingFace/ModelScope, unloading) and advanced voice/emotion referencing (using descriptions, audio samples, or vectors).
Quick Start & Requirements
cd ComfyUI/custom_nodes
git clone https://github.com/yolain/ComfyUI-Easy-IndexTTS2
cd ComfyUI-Easy-IndexTTS2
../../python_embeded/python.exe -m pip install -r requirements.txt
ComfyUI/models/IndexTTS-2/ or will be auto-downloaded to ./ComfyUI/models/IndexTTS-2/hf_cache/.semantic_codec/model.safetensors from https://huggingface.co/amphion/MaskGCT/tree/main/semantic_codeccampplus_cn_common.bin from https://huggingface.co/funasr/campplusw2v-bert-2.0/ folder from https://huggingface.co/facebook/w2v-bert-2.0nvidia/bigvgan_v2_22khz_80band_256x) into bigvgan/.gpt.pth, s2mel.pth, bpe.model, wav2vec2bert_stats.pt, qwen0.6bemo4-merge/) and the base IndexTTS-2 model.Highlighted Details
-0.5s-).(start, end) for subtitle alignment.Maintenance & Community
No specific details regarding maintainers, community channels (like Discord/Slack), sponsorships, or roadmap are provided in the README.
Licensing & Compatibility
The project is provided "as is" with no explicit warranties. A disclaimer states the author and copyright holders are not liable for any claims, damages, or responsibilities arising from its use. It strictly prohibits illegal use and copyright infringement, placing responsibility on the user to comply with all applicable laws and regulations. No specific open-source license (e.g., MIT, Apache) is mentioned, and compatibility for commercial use or closed-source linking is not addressed.
Limitations & Caveats
The project includes a broad disclaimer of liability, making users solely responsible for legal compliance and any issues arising from usage. It requires a recent ComfyUI version to function correctly due to its reliance on v3 node paradigms. The extensive list of required model files and their specific placement can be complex to set up.
4 months ago
Inactive
canopyai
RVC-Boss