Bert-VITS2  by fishaudio

VITS2 backbone for multilingual text-to-speech

created 2 years ago
8,522 stars

Top 6.2% on sourcepulse

GitHubView on GitHub
Project Summary

Bert-VITS2 is a text-to-speech (TTS) system that integrates a VITS2 backbone with multilingual BERT for enhanced voice synthesis. It is designed for advanced users and researchers interested in TTS technology, offering a foundation for custom voice model training and experimentation.

How It Works

This project builds upon the VITS2 architecture, incorporating multilingual BERT embeddings to improve prosody and naturalness in speech generation. The core idea is to leverage BERT's contextual understanding of text to inform the VITS2 model, leading to more expressive and human-like synthesized speech.

Quick Start & Requirements

Highlighted Details

  • Core ideas are inspired by MassTTS and VITS.
  • Aims for state-of-the-art open-source TTS quality.
  • Includes references to related projects like fish-speech and so-vits-svc.

Maintenance & Community

The project states it will no longer be actively maintained, recommending FishAudio's Fish-Speech as a successor.

Licensing & Compatibility

The README does not specify a license. Given the project's nature and references, it's likely intended for research and non-commercial use. Users should exercise caution regarding commercial applications.

Limitations & Caveats

The project is no longer actively maintained. The README explicitly warns against using the project for any illegal purposes, particularly those violating Chinese laws, and prohibits political use.

Health Check
Last commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
4
Issues (30d)
0
Star History
178 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Lianmin Zheng Lianmin Zheng(Author of SGLang).

fish-speech by fishaudio

0.3%
23k
Open-source TTS for multilingual speech synthesis
created 1 year ago
updated 1 week ago
Feedback? Help us improve.