ijcnn19_submission

Cosine-similarity penalty to discriminate sound classes in weakly-supervised sound event detection

This work is concerned with multi-label sound event detection when only weak labels are available for training. Weak annotations provide tags of audio events but do not provide temporal boundaries.

The contributions are :

the use of a Multiple Instance Learning (MIL)-inspired loss function to perform sound event detection (SED),
the introduction of a cosine similarity penalty term to enhance the discriminative power of the network.

This lengthy (but structured:)) python3 notebook allows to train and test two Recurrent Convolutional Neural Networks (CRNN) in Keras with Tensorflow:

the first one for audio tagging (AT), i.e. multi-label classification at recording level,
the second one for SED, i.e. localization of the event boundaries within the recordings.

There is code to replace the ReLU activation functions with Gated Linear Units (GLU) as was done in [1]

This is a two-pass SED system: we first perform AT and then SED.

For SED, we only keep the temporal predictions of the classes that were tagged by the AT system.

Results on the DCASE 2018 task 4 test subset data

Feel free to contact me for any question: thomas.pellegrini@irit.fr

If you use this code, please consider citing:

Thomas Pellegrini, Léo Cances. Cosine-similarity penalty to discriminate sound classes in weakly-supervised sound event detection, arXiv preprint arXiv:1901.03146, 2019 Preprint: https://arxiv.org/abs/1901.03146

Reference

[1] Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley. "Large-scale weakly supervised audio classification using gated convolutional neural network", in Proc. ICASSP2018 https://arxiv.org/abs/1710.00343, code: https://github.com/yongxuUSTC/dcase2017_task4_cvssp

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
test_predictions		test_predictions
utils		utils
README.md		README.md
block_schema_screenshot_final.png		block_schema_screenshot_final.png
drawing-networks.png		drawing-networks.png
dummy_strong_preds_and_y_true.npz		dummy_strong_preds_and_y_true.npz
ijcnn2019-sed-cosine.ipynb		ijcnn2019-sed-cosine.ipynb
results_dcase18_task4.png		results_dcase18_task4.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ijcnn19_submission

About

Releases

Packages

Languages

topel/ijcnn19_submission

Folders and files

Latest commit

History

Repository files navigation

ijcnn19_submission

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages