Speech synthesis model/GUI for galgame characters
Top 38.3% on sourcepulse
MoeTTS provides a GUI and pre-trained models for synthesizing speech in the style of galgame characters. It targets fans and hobbyists interested in AI-driven voice generation, offering a user-friendly interface to leverage advanced TTS and voice conversion technologies.
How It Works
The project integrates several state-of-the-art speech synthesis models, including Tacotron2, Hifigan, and VITS, along with the Diff-svc model for voice conversion. This combination allows for high-quality speech generation and the ability to transform existing audio into the voice of a chosen character. The GUI simplifies the process of selecting models, inputting text, and applying voice conversion parameters.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project states that GUI maintenance is complete and will no longer be actively developed. Model sharing is no longer actively supported. Key contributors include ShiroDoMain and menproject.
Licensing & Compatibility
The project is released under an open-source license but includes additional user agreements. Commercial use of the software, pre-trained models, or derivatives is strictly prohibited. Use for original game production is also forbidden.
Limitations & Caveats
The project is no longer actively maintained, meaning no new features or models will be added. Compatibility is limited to TTS models with unmodified architectures; modified versions like so-vits or emo-vits are not supported. Users are responsible for any consequences arising from the use of provided models, which may originate from community contributions.
2 years ago
Inactive