Discover and explore top open-source AI tools and projects—updated daily.
OpenMOSSOpen-source speech and sound generation model family
New!
Top 50.4% on SourcePulse
Summary
MOSS-TTS Family provides an open-source suite for high-fidelity, high-expressiveness audio generation across complex scenarios, addressing single-model limitations. It targets engineers and researchers needing production-ready components for diverse needs like long-form speech, dialogue, voice design, and real-time streaming, enhancing audio content creation.
How It Works
The MOSS-TTS Family comprises five specialized models (MOSS-TTS, MOSS-TTSD, MOSS-VoiceGenerator, MOSS-TTS-Realtime, MOSS-SoundEffect) for modularity or pipeline composition. A core MOSS-Audio-Tokenizer, built on a "CNN-free" Causal Transformer, unifies audio representation, compressing 24kHz audio to 12.5Hz with high fidelity and native streaming support. This enables novel capabilities like reference-free voice design and specialized solutions for long-speech, expressive dialogue, and low-latency agents.
Quick Start & Requirements
cd MOSS-TTS, pip install --extra-index-url https://download.pytorch.org/whl/cu128 -e ..Highlighted Details
Maintenance & Community
Recently released (Feb 2026), the README lacks contributor/community channel details. Information may be found via linked Huggingface spaces or GitHub.
Licensing & Compatibility
Licensed under Apache License 2.0, permitting commercial use and integration into closed-source projects.
Limitations & Caveats
Optional FlashAttention 2 installation may fail on some hardware. As a new project, long-term maintenance and community adoption are TBD. Different model architectures present distinct trade-offs requiring careful selection.
21 hours ago
Inactive
canopyai