voice_datasets by jim-schwoebel

Voice dataset list for voice/sound computing

Created 7 years ago

2,207 stars

Top 19.8% on SourcePulse

View on GitHub

3 Experts Love This Project

Piotr Dąbkowski

Cofounder of ElevenLabs

Omar Sanseviero

DevRel at Google DeepMind

Patrick von Platen

Author of Hugging Face Diffusers; Research Engineer at Mistral

Project Summary

This repository provides a comprehensive, curated list of over 95 open-source datasets for voice and sound computing, targeting researchers and developers in speech recognition, emotion detection, and audio analysis. It serves as a centralized resource to discover and access diverse audio data, accelerating development in the field.

How It Works

The project acts as a directory, categorizing datasets into "Speech datasets" and "Audio events and music datasets." Each entry includes a brief description, size, speaker/actor information, emotional categories, and specific use cases like ASR, emotion recognition, or source separation. This structured approach simplifies dataset discovery for various audio processing tasks.

Quick Start & Requirements

Access: Datasets are linked directly from the README. Download commands or links are provided per dataset.
Prerequisites: Varies by dataset; common requirements include sufficient disk space, internet bandwidth, and potentially specific audio processing libraries for local use.
Resources: Dataset sizes range from megabytes to hundreds of gigabytes.

Highlighted Details

Extensive coverage of speech datasets, including emotional speech, noisy environments, and diverse accents.
Includes datasets for audio event detection, music analysis, and environmental sound classification.
Links to related projects like "Voice Computing in Python" and the "Allie" framework.
Mentions datasets suitable for specific tasks like wake word detection and speaker identification.

Maintenance & Community

The repository is maintained by jim-schwoebel. Feedback and new dataset suggestions are welcomed via a provided link.

Licensing & Compatibility

Dataset licenses vary; users must consult individual dataset licenses for usage restrictions. The repository itself is likely under a permissive license, but the datasets it lists have diverse licensing.

Limitations & Caveats

The repository is a curated list, not a data provider; users must download datasets individually. Some datasets may have specific non-commercial use restrictions or require payment (e.g., TIMIT).

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

7 stars in the last 30 days