GitHub - FRAMAX444/CNN-explainability-Earthquakes: This repository demonstrates how to use SHAP (SHapley Additive exPlanations) to interpret machine learning models trained on earthquake-related data. The goal is to provide insights into how features influence model predictions, enabling researchers and engineers to better understand and trust their models in the context of seismology.

Overview

This project explores the application of explainability techniques in deep learning models trained on seismic data. Inspired by the work of Laurenti et al., we focus on understanding what machine learning models learn when classifying foreshocks and aftershocks or predicting earthquake magnitude. Using SHAP (SHapley Additive exPlanations), we analyze the importance of input features derived from seismic waveforms, converted into spectrograms or feature vectors.

Content

Preprocessing Notebook

Purpose: Prepares the raw seismic waveforms for use in training models.
Steps:
- Combines waveforms from multiple stations.
- Converts signals into three-channel spectrograms (log-spectrograms or standard spectrograms).

CNN Training and SHAP Analysis (Foreshocks and Aftershocks)

Log-Spectrogram Notebook:
- Trains a CNN on three-channel log-spectrograms for binary classification (foreshocks vs. aftershocks).
- Uses SHAP to interpret the pixel contributions of the spectrograms to the classification.
Spectrogram Notebook:
- Trains a CNN on standard spectrograms for the same binary classification task.
- Applies SHAP to understand the role of frequency and time components.

Random Forest Regression for Magnitude Prediction

Purpose: Predicts earthquake magnitude using extracted features from raw waveforms.
Steps:
- Trains a Random Forest regression model.
- Applies SHAP to interpret feature contributions to magnitude predictions.

How to Use

Download the Dataset

We used the same dataset from Laurenti et al. article (only selecting pre and post waveforms), you can find it here.

Clone the repository

git clone https://github.com/FRAMAX444/CNN-explainability-Earthquakes
cd your_path/CNN-explainability-Earthquakes

Create and activate Virtual Environment

python3 -m venv venv
source venv/bin/activate

Install dependecies
```
pip install -r requirements.txt  
```

Results

1. Seismic Event Classification

Model: A CNN trained on spectrograms derived from seismic waveforms.
Datasets:
- NRCA dataset (station-specific, 5862 samples): Accuracy = 99.57%.
- Full dataset (multi-station, 55,617 samples): Accuracy = 95.75%.
- NRCA model applied to the full dataset: Accuracy = 50.19% (poor generalization).
SHAP Insights:
- Foreshocks: Significant features in the 10–25 Hz range shortly after P-wave arrivals.
- Aftershocks: Features more evenly distributed over time, with lower frequency contributions.

2. Earthquake Magnitude Prediction

Model: Random Forest Regressor with 100 trees.
Performance:
- R² = 0.8834
- MAE = 0.1052, RMSE = 0.1457
SHAP Insights:
- Key features: Mean spectral power, P-wave travel times, and peak counts.
- Spectral and temporal characteristics strongly influenced predictions.

Observations

Generalization Challenges:
- CNNs trained on single stations achieve high accuracy locally but fail to generalize to diverse datasets.
- Dataset diversity is crucial for robust seismic models.
Model Transparency:
- SHAP analysis provided critical insights into feature importance, bridging the gap between ML predictions and seismic domain knowledge.
Future Directions:
- Explore domain adaptation, metadata integration, and real-time monitoring applications.

References

Our Report
- What is Machine Learning Teaching Us? Explainable AI for Seismic Models
Project Presentation
- Results Presentation
Laurenti, Paolini et al. (Nature 2024)
- Probing the Evolution of Fault Properties During the Seismic Cycle with Deep Learning

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
__pycache__		__pycache__
media		media
shap_tensors		shap_tensors
trained_models		trained_models
.gitignore		.gitignore
README.md		README.md
preprocess.ipynb		preprocess.ipynb
preprocess_utils.py		preprocess_utils.py
regression.ipynb		regression.ipynb
requirements.txt		requirements.txt
training_log_spec_NRCA.ipynb		training_log_spec_NRCA.ipynb
training_log_spectrogram.ipynb		training_log_spectrogram.ipynb
training_spectrogram.ipynb		training_spectrogram.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Content

Preprocessing Notebook

CNN Training and SHAP Analysis (Foreshocks and Aftershocks)

Random Forest Regression for Magnitude Prediction

How to Use

Results

1. Seismic Event Classification

2. Earthquake Magnitude Prediction

Observations

References

About

Releases

Packages

Contributors 2

Languages

FRAMAX444/CNN-explainability-Earthquakes

Folders and files

Latest commit

History

Repository files navigation

Overview

Content

Preprocessing Notebook

CNN Training and SHAP Analysis (Foreshocks and Aftershocks)

Random Forest Regression for Magnitude Prediction

How to Use

Results

1. Seismic Event Classification

2. Earthquake Magnitude Prediction

Observations

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages