Quaternion Neural Networks for 3D Sound Source Localization: Implementation using First Order Ambisonics.
The objective of our work is to build a working deep quaternion neural network (DQNN) based network that works with First Order Ambisonics data sets. In particular, we are going to extend DQNN, adding capabilities to both support pre-existing data sets (ansim, resim, etc.) and the FOA one in a smart, modular, performing way. Therefore, other metrics have been added like the SELD score, mainly used in the 2019 paper outcomes evaluation, and a tiny library for a graphical representation of the results.
This project can be easily executed using one of the two proposed notebooks:
The latter gives you the possibility to use a pre-loaded and pre-extracted dataset (~200GB).
A quick view of our CSV files.
|
|