Curated list of speech and audio AI research papers
Top 16.0% on sourcepulse
This repository is a curated list of academic papers covering speech and audio processing, specifically Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis (TTS), Voice Conversion (VC), Language Modeling, Singing Voice Synthesis (SVS), and Music Modeling. It serves as a valuable resource for researchers, engineers, and students in the field looking to stay updated on foundational and state-of-the-art techniques.
How It Works
The repository functions as a comprehensive bibliography, categorizing influential and recent research papers within specific sub-domains of speech and audio processing. Each entry typically includes the paper title, authors, year, and a direct link to the PDF. This structured approach allows users to quickly navigate and discover relevant literature.
Quick Start & Requirements
This is a curated list of papers; there are no installation or execution requirements. Users can directly access the papers via the provided PDF links.
Highlighted Details
Maintenance & Community
The repository is maintained by zzw922cn. Further community engagement or roadmap details are not specified in the README.
Licensing & Compatibility
The repository itself is a list of links to academic papers. The licensing of the individual papers would be governed by their respective publishers or authors. Compatibility for commercial use depends on the licenses of the linked papers.
Limitations & Caveats
This is a static list of papers and does not provide code, datasets, or implementations. The "interesting papers" section is subjective and may not be exhaustive.
1 year ago
Inactive