ASR-TTS-paper-daily by halsay

Daily AI paper updates for ASR and TTS research

Created 1 year ago

514 stars

Top 60.1% on SourcePulse

Project Summary

This repository serves as a daily curated list of recent research papers in Automatic Speech Recognition (ASR) and Text-to-Speech (TTS). It aims to keep researchers, engineers, and practitioners updated on the latest advancements in speech technology, providing a centralized resource for tracking new publications.

How It Works

The project functions as a dynamic, manually updated index of ASR and TTS research papers. It scrapes or manually collects information on papers, organizing them by publication date, title, authors, and availability of PDF and code. This approach ensures a focused and up-to-date overview of the field.

Quick Start & Requirements

Access: No installation required; access via the GitHub repository.
Requirements: A web browser and internet connection.
Links: The README provides direct links to papers and associated code where available.

Highlighted Details

Daily updates ensure the most current research is captured.
Comprehensive coverage of both ASR and TTS subfields.
Links to code repositories facilitate reproducibility and further research.
Organized by publication date for easy tracking of recent trends.

Maintenance & Community

The project is maintained by halsay, with updates appearing daily. Community interaction is primarily through GitHub's issue and pull request features.

Licensing & Compatibility

The repository itself, containing curated links and metadata, is typically under a permissive license like MIT, allowing for broad reuse. Individual papers retain their original licenses.

Limitations & Caveats

The primary limitation is that this is a curated list, not a functional tool. It relies on the availability and accuracy of information from external sources, and the inclusion of code links is dependent on authors making them public.

ASR-TTS-paper-daily by halsay

Explore Similar Projects

speech-recognition-uk by egorsmkv

awesome-russian-speech by alphacep

Habibi-TTS by SWivid

kugelaudio-open by Kugelaudio

INTERSPEECH-2023-24-Papers by DmitryRyumin

FireRedTTS by FireRedTeam

parrots by shibing624

speech-synthesis-paper by wenet-e2e

ichigo by janhq

FunASR by modelscope

espnet by espnet

GPT-SoVITS by RVC-Boss