Urban sound classification

The goal of the project is to build a neural network capable of classification on the Urban Sound 8k dataset.
An in depth overview of the project is present on the project report.

Project structure

The project folder is structured as follows:

data/ contains processed and raw data. To reproduce results using the dataset, put the folds folders inside data/raw/audio, then put the metadata file inside data/raw/metadata
models/ contains trained models, namely the scaler and pca used in the project
notebooks/ contains the Jupyter notebooks used execute the code
src/ contains data, model and utils sub-folders, with code regarding the different parts of the project
report/ contains the project report written in Latex

Requirements

Libraries used in the project are the following: pandas, numpy, matplotlib, tensorflow, librosa, dask, keras_nightly, keras, scikit_learn

You can install them using the following command

pip install -r src/requirements.txt

Methodology

The methodology followed in the project can be seen in the various jupyter notebooks.

Feature extraction and dataset creation

In the first notebook audio features are extracted using Librosa library and scaling is applied.
In the second notebook, more features are extracted and PCA feature selection is exploited to reduce the dataset dimensionality. Also, audio as an image is extracted from the dataset for later training a CNN.

Cross validation on the training sets

To understand what training set is best suited for the project, cross validation is performed on the initial, scaled, extended and pca dataset obtained at the previous step. The results are presented in the third notebook.

Hyperparameter tuning

After selecting the best dataset from the cross validation results, a Random Search is performed to optimize the network hyperparameters, details about results as well as test set evaluation can be found on the fourth notebook.

CNN training and tuning

The last notebook presents the results using a convolutional neural network on the image dataset obtained in the second notebook. This notebook was executed on Google Colab for performance reasons.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
data/processed		data/processed
models		models
notebooks		notebooks
report		report
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Urban sound classification

Project structure

Requirements

Methodology

Feature extraction and dataset creation

Cross validation on the training sets

Hyperparameter tuning

CNN training and tuning

About

Releases

Packages

Contributors 2

Languages

License

tomfran/urban-sound-classification

Folders and files

Latest commit

History

Repository files navigation

Urban sound classification

Project structure

Requirements

Methodology

Feature extraction and dataset creation

Cross validation on the training sets

Hyperparameter tuning

CNN training and tuning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages