TTS by mozilla

Deep learning library for text-to-speech generation

Created 8 years ago

10,097 stars

Top 5.0% on SourcePulse

3 Experts Love This Project

transitive-bullshit

Founder of Agentic

ankane

Author of pgvector

soumith

Soumith Chintala

Coauthor of PyTorch

Project Summary

Mozilla TTS is an open-source library for advanced Text-to-Speech (TTS) generation, targeting researchers and developers seeking high-quality, efficient, and customizable speech synthesis. It offers a robust framework built on state-of-the-art deep learning models, enabling users to train custom voices or leverage pre-trained models across multiple languages.

How It Works

The library implements a modular architecture featuring distinct Text-to-Spectrogram models (Tacotron, Tacotron2, Glow-TTS, Speedy-Speech) and various Vocoder models (MelGAN, ParallelWaveGAN, WaveRNN, etc.). It also includes a Speaker Encoder for efficient speaker embedding extraction. This separation allows for flexible experimentation and optimization of individual components, facilitating the achievement of a balance between training ease, inference speed, and audio quality.

Quick Start & Requirements

Install via pip: pip install TTS
For development: git clone https://github.com/mozilla/TTS and pip install -e .
Requires Python >= 3.6 and < 3.9.
Official Docker image available.
Tutorials and examples: TTS/Wiki

Highlighted Details

Supports multi-speaker TTS and efficient multi-GPU training.
Enables conversion of PyTorch models to TensorFlow 2.0 and TFLite for inference.
Provides tools for dataset quality analysis and a demo server for model testing.
Includes notebooks for extensive model benchmarking and parameter selection.

Maintenance & Community

Active discussion forum: discourse.mozilla.org/c/tts
Matrix channel for general discussion.
Governed by Mozilla's code of conduct.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README does not specify a license, which may impact commercial adoption.
Python version compatibility is limited to < 3.9.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

1

Star History

28 stars in the last 30 days

Explore Similar Projects

Meta-voicebox by SpeechifyInc

PyTorch implementation of Meta's Voicebox speech model

Created 2 years ago

Updated 2 years ago

deepspeech-german by AASHISHAG

ASR module using Mozilla DeepSpeech for German speech

Created 6 years ago

Updated 2 years ago

radtts by NVIDIA

Flow-based TTS recipes for training, inference, and voice conversion

Created 3 years ago

Updated 2 years ago

Starred by

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral).

FastDiff by Rongjiehuang

PyTorch implementation for fast, high-fidelity speech synthesis via conditional diffusion

Created 4 years ago

Updated 1 year ago

Starred by

Casper Hansen

Casper Hansen(Author of AutoAWQ).

melgan by seungwonpark

PyTorch implementation of MelGAN vocoder

Created 6 years ago

Updated 5 years ago

openWakeWord by dscripka

Open-source wakeword detection library for voice-enabled apps

Created 3 years ago

Updated 1 week ago

TransformerTTS by spring-media

TensorFlow 2 implementation for non-autoregressive text-to-speech

Created 5 years ago

Updated 1 year ago

dl-colab-notebooks by tugstugi

Colab notebooks for deep learning model demos

Created 6 years ago

Updated 3 years ago

Starred by

Soumith Chintala

Soumith Chintala(Coauthor of PyTorch),

Travis Fischer

Travis Fischer(Founder of Agentic), and

8 more.

speechbrain by speechbrain

PyTorch toolkit for speech and text processing research

Created 5 years ago

Updated 6 days ago

Starred by

Lilian Weng

Lilian Weng(Cofounder of Thinking Machines Lab),

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity), and

26 more.

tensor2tensor by tensorflow

Deprecated library for deep learning models/datasets, successor to Trax

Created 8 years ago

Updated 2 years ago

Starred by

Joe Walnes

Joe Walnes(Head of Experimental Projects at Stripe),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

13 more.

DeepSpeech by mozilla

Open-source speech-to-text engine for on-device inference

Created 9 years ago

Updated 6 months ago

Starred by

Jason Huggins

Jason Huggins(Creator of Selenium),

Michael Han

Michael Han(Cofounder of Unsloth), and

11 more.

TTS by coqui-ai

Deep learning toolkit for Text-to-Speech, research-tested

Created 5 years ago

Updated 1 year ago

Feedback? Help us improve.