Discover and explore top open-source AI tools and projects—updated daily.
SWividOpen-source foundation for unified Arabic speech synthesis
Top 88.3% on SourcePulse
Summary
Habibi-TTS provides an open-source foundation for unified-dialectal Arabic speech synthesis, addressing the need for high-quality TTS across various Arabic dialects. It targets researchers and developers, offering a flexible system for generating natural-sounding Arabic speech with support for multiple dialects and advanced evaluation metrics.
How It Works
The project leverages a unified model and specialized models for different Arabic dialects. It supports zero-shot TTS inference using reference audio and text, allowing users to specify dialects or infer them from prompts. The system is designed for both ease of use via a Gradio GUI and flexibility through a command-line interface, with advanced configuration options available via TOML files.
Quick Start & Requirements
pip install habibi-ttshabibi-tts_infer-gradioaccelerate launch), Python. Specific CUDA version not stated but recommended for performance.Highlighted Details
Maintenance & Community
No specific details on contributors, sponsorships, or community channels (like Discord/Slack) are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The CC-BY-NC-SA-4.0 license on key models imposes non-commercial and share-alike restrictions, potentially limiting adoption in commercial products. Specific hardware requirements (e.g., GPU, CUDA version) are not explicitly detailed in the README, though implied for performance. The README points to external documentation (F5-TTS) for detailed installation, suggesting the provided README might be a high-level overview.
1 week ago
Inactive
RVC-Boss