This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
-
Updated
Dec 20, 2023 - Jupyter Notebook
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
Exploration of different audio features and CNN-based architectures for building an effective Speech Emotion Recognition (SER) system. The goal is to improve the accuracy of detecting emotions embedded in speech signals. The repository contains code, notebook, and detailed explanations of the experiments conducted.
Whisper AI is an automated speech recognition (ASR) system. It is open source and can be access via GitHub or HuggingFace. This is the simplest way to implement Whisper AI via Github using python Google Colab Notebook.
Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.
To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."