Discover and explore top open-source AI tools and projects—updated daily.
RoyChao19477Mamba-based speech enhancement models
Top 98.0% on SourcePulse
SEMamba provides an official implementation for speech enhancement (SE) models based on the Mamba architecture, designed for universal, robust, and generalizable performance. It addresses diverse audio distortions and sampling frequencies with a single model, targeting researchers and engineers in audio signal processing. The project achieved 4th place in the URGENT challenge at IEEE SLT 2024.
How It Works
This project integrates the Mamba architecture into speech enhancement pipelines, aiming to create models capable of handling a wide spectrum of audio degradations, including additive noise, reverberation, clipping, and bandwidth limitations. The core advantage lies in Mamba's sequential modeling capabilities, enabling a unified approach across various sampling rates and distortion types, leading to enhanced robustness and generalization.
Quick Start & Requirements
python=3.9), installing PyTorch 2.2.2, then pip install -r requirements.txt, followed by installing Mamba from source (cd mamba_install && pip install .). Docker environments for x86 and ARM are available.Highlighted Details
Maintenance & Community
No explicit community channels (e.g., Discord, Slack), roadmap, or detailed contributor information are provided in the README.
Licensing & Compatibility
The repository's license is not specified in the README, which is a critical omission for assessing commercial use or derivative works.
Limitations & Caveats
mamba-2 branch for potential compatibility improvements.4 months ago
Inactive