Docker app for recording voice samples to train a TTS voice with Mimic2
Top 62.0% on sourcepulse
Mimic Recording Studio is a Docker-based application designed for collecting voice samples to train custom Text-to-Speech (TTS) voices using Mycroft's Mimic 2 engine. It targets individuals or teams looking to create unique, high-quality synthetic voices, simplifying the data collection process.
How It Works
The application utilizes a Dockerized architecture, separating the frontend (React) and backend (Python/Flask) services. The backend handles audio processing, including automatic silence trimming via FFmpeg, and stores recordings and metadata in a SQLite database. The frontend provides a web interface for users to record, play back audio, and view basic metrics. This containerized approach ensures cross-platform compatibility and simplifies setup.
Quick Start & Requirements
git clone https://github.com/MycroftAI/mimic-recording-studio.git && cd mimic-recording-studio && docker-compose up
docker-compose up
may take time to build containers.Highlighted Details
Maintenance & Community
Support is available via the Mycroft Forum and Mycroft Chat. Contributions via Pull Requests are welcomed.
Licensing & Compatibility
The project is licensed under the Apache License 2.0. Voice recordings donated to Mycroft must be licensed under the Creative Commons CC0 Public Domain license for use in TTS applications.
Limitations & Caveats
Creating a high-quality voice requires a significant effort, estimated at 15,000-20,000 phrases. The project notes that using a new corpus requires resetting the SQLite database.
2 years ago
Inactive