Skip to content

Audio tagging is the process of inferring descriptive labels from audio clips (Multi label classification task). This repository contains exploratory code/scripts for audio preprocessing and model fitting for the task of audio tagging and its applications.

Notifications You must be signed in to change notification settings

rahuls98/audio-tagging

Repository files navigation

audio-tagging

Audio tagging is the process of inferring descriptive labels from audio clips. This can be treated as a multi label classification task, involving the recognition and classification of audio samples in order to apply tags of various natures. To study the applications of audio tagging for real world problems, there is a need to develop a good general purpose audio tagging model. Different methods of audio preprocessing must also be explored for the same. This repository contains exploratory code/scripts for audio preprocessing and model fitting on the Freesound dataset provided by kaggle as a part of its FSD Audio Tagging Challenge (2019). In an effort to build a good pipeline for the process of audio tagging and it's applications, research is being done into multiple alternatives for the same. Post which, transfer learning can be performed using the developed model and preprocessing techniques, on applications that require real time audio processing and classification.


Datasets

About

Audio tagging is the process of inferring descriptive labels from audio clips (Multi label classification task). This repository contains exploratory code/scripts for audio preprocessing and model fitting for the task of audio tagging and its applications.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published