Voice dataset list for voice/sound computing
Top 22.7% on sourcepulse
This repository provides a comprehensive, curated list of over 95 open-source datasets for voice and sound computing, targeting researchers and developers in speech recognition, emotion detection, and audio analysis. It serves as a centralized resource to discover and access diverse audio data, accelerating development in the field.
How It Works
The project acts as a directory, categorizing datasets into "Speech datasets" and "Audio events and music datasets." Each entry includes a brief description, size, speaker/actor information, emotional categories, and specific use cases like ASR, emotion recognition, or source separation. This structured approach simplifies dataset discovery for various audio processing tasks.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The repository is maintained by jim-schwoebel. Feedback and new dataset suggestions are welcomed via a provided link.
Licensing & Compatibility
Dataset licenses vary; users must consult individual dataset licenses for usage restrictions. The repository itself is likely under a permissive license, but the datasets it lists have diverse licensing.
Limitations & Caveats
The repository is a curated list, not a data provider; users must download datasets individually. Some datasets may have specific non-commercial use restrictions or require payment (e.g., TIMIT).
1 year ago
Inactive