Keras (Tensorflow) implementation of the [paper].
- Going really deep does not seem to help much on this dataset. We clearly overfit very easily. Adding more regularization might help. I haven't tried to use the FC layers (though it has been implemented).
- We use the
fold10
folder for the testing set and the remaining for the training set. - Models implemented:
[x] M3
[x] M5
[x] M11
[x] M18
[x] M34 (ResNet)
Dataset can be downloaded here: http://urbansounddataset.weebly.com/urbansound8k.html
git clone https://github.com/philipperemy/very-deep-convnets-raw-waveforms.git
cd very-deep-convnets-raw-waveforms
sudo pip3 install -r requirements.txt
./run_all.sh # will run M3, M5, M11, M18 and M34