Seismic signal dataset for AI-driven earthquake research
Top 83.1% on sourcepulse
The STanford EArthquake Dataset (STEAD) provides a comprehensive global collection of seismic signals for AI-driven earthquake analysis. It targets researchers and engineers in seismology and machine learning, offering a large, curated dataset to train and validate models for earthquake detection, characterization, and related tasks.
How It Works
STEAD organizes seismic waveforms into HDF5 files, with associated metadata in CSV files. This structure allows for efficient data access and filtering. The dataset includes both earthquake and noise waveforms, with detailed metadata such as event location, magnitude, and station information. The project also provides Python scripts for accessing, filtering, and processing the data, including converting raw waveforms to displacement, velocity, or acceleration using Obspy and instrument response information.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The dataset was last updated on May 25, 2020. Bug reporting is handled via GitHub issues or email. The primary author is S. M. Mousavi. Several studies have utilized STEAD, with their code repositories available as examples.
Licensing & Compatibility
The repository license is not explicitly stated in the README, but a "LICENSE" file is mentioned. Compatibility for commercial use or closed-source linking would require clarification of the specific license terms.
Limitations & Caveats
The README notes that some back azimuths in the current version may be misplaced and can be recalculated using Obspy. Less than 4% of noise data may have identical waveforms across components due to single-channel stations. The dataset is large, requiring substantial storage and bandwidth.
2 years ago
1 week