GitHub - Vardanush/Robust-audio-classification

Project 1: Denoising & Robust Classification of Sound Signals

ML-LAB WS 2020/2021

About The Project

The aim of this project is to implement audio signal classifiers and improve their performance. This has been done via two approaches:

Approach 1: Denoising + Classification
Approach 2: Data augmentation/Robust Training of end-to-end audio classifiers

This repo contain implementation of several baseline audio classifiers with Pytorch-lightning wrapper:

CRNN: Convolutional recurrent neural networks (From paper https://arxiv.org/abs/1609.04243)
M11 & M18: Very Deep CNN (From paper https://arxiv.org/pdf/1610.00087.pdf)

The audio classifiers can be trained with two dataset: Urbansound8k, which is publically availabe and BMW which is accessible only for this project. The repo provides Dataset and Dataloader for both datasets and in addition functionalities to add several types of audio transformations and audio augmentations. For certain additive augmentation, you may need to download backgorund noise clips from the MUSAN dataset (https://www.openslr.org/17/). If a new dataset with the same folder structure as BMW is added, an annotation file with stratified k-fold cross validation splits will be automatically generated when training on this dataset for the first time.

We implemented several robustness methods:

Mixup: from paper https://arxiv.org/abs/1710.09412
Label smoothing: from paper https://arxiv.org/abs/1512.00567
SmoothADV: from paper https://arxiv.org/abs/1906.04584

Finally, the repos provides two approaches to evaluate model robustness: (1) Certification radius: We created a SmoothClassfier model wrapper for randomize smoothing procedures, which can also be used to calculate certification raidus. (2) Robust accuarcy under adversarial attacks: L-inf and L-2 fast gradient attacks on pre-trained models with varying attack radius.

We have created common routines to train, test and evaluate all models. Results from our experiements can be found on our wiki.

Getting Started

To get a local copy up and running follow these simple steps.

Prerequisites

pytorch_lightning. A lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
```
  pip install pytorch-lightning
```
Optuna. An open source hyperparameter optimization framework to automate hyperparameter search.
```
  pip install optuna
```
foolbox. Fast adversarial attacks to benchmark the robustness of machine learning models in PyTorch, TensorFlow, and JAX
```
  python3 -m pip install foolbox
```
Note: to make sure that foolbox works for all models, it is better to use the modified foolbox library in the shared project folder.
WavAugment Wavaugment provides a wrapper for data augmentation on audio data. The audio data is represented as pytorch tensors.
```
  git clone git@github.com:facebookresearch/WavAugment.git && cd WavAugment && python setup.py develop
```

Installation

Clone the repo

git clone https://gitlab.lrz.de/ml-lab-winter-2020-21/project-1.git

Install audi_classificaiton packages

cd audio_classification
python3 -m pip install -e .

Usage

The repo provides a key-value based config system that can be used to obtain standard, common behaviors when running experiments. Our config system is inspired by the Detectron2 framework and uses YAML. For example, crnn_bmw.yaml lets you specify configurations to train a CRNN model on BMW dataset. More examples for configs can be found in /configs.

In /notebooks, we provided several notebooks to demo the common use cases of our repo. The most important ones are

train_net.ipynb: Train models according the YAML configurations.
test_net.ipynb: Obtain the test accuracy, recall and precision with a pre-trained model. In addition, if it is a model with the SmoothClassifier wrapper, calculate the certification radius.
HyperParamTuning_Optuna.ipynb: Hyperparameter tuning routines with Optuna.
run_attacks.ipynb: Evaluate model robustness under adversarial attacks.

more useful notebooks can be found in /notebooks/additional/.

Contact

Project Link: https://gitlab.lrz.de/ml-lab-winter-2020-21/project-1

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
Denoising		Denoising
audio_classification		audio_classification
foolbox		foolbox
images		images
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 1: Denoising & Robust Classification of Sound Signals

Table of Contents

About The Project

Getting Started

Prerequisites

Installation

Usage

Contact

About

Releases

Packages

Languages

Vardanush/Robust-audio-classification

Folders and files

Latest commit

History

Repository files navigation

Project 1: Denoising & Robust Classification of Sound Signals

Table of Contents

About The Project

Getting Started

Prerequisites

Installation

Usage

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages