Curated list of large audio models
Top 63.7% on sourcepulse
This repository serves as a curated list of significant large models and research papers in the audio domain, covering speech, music, and sound generation and understanding. It targets researchers and engineers working with state-of-the-art audio AI, providing a centralized reference for foundational models and recent advancements.
How It Works
The project aggregates links to papers and their corresponding code repositories, categorized by application area such as spoken language models, prompt-based audio synthesis, audio language models, and self-supervised learning (SSL/UL) models. This approach offers a structured overview of the rapidly evolving landscape of large audio models.
Quick Start & Requirements
This repository is a curated list and does not have a direct installation or execution command. Users are directed to individual project repositories for specific setup and requirements.
Highlighted Details
Maintenance & Community
The list appears to be actively updated with recent publications, indicating ongoing curation. Specific community channels or contributor details are not provided in the README.
Licensing & Compatibility
The repository itself is a list of links and does not have a specific license. The licenses of the linked projects vary and must be checked individually.
Limitations & Caveats
This is a reference list and does not provide any code for direct use or experimentation. Users must navigate to individual project repositories to access code, models, and specific usage instructions.
10 months ago
1 week