CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
-
Updated
Dec 16, 2024 - Python
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Phoneme segmentation using pre-trained speech models
Pretrained models, tools and resources for Persian ASR
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
Deep Learning Utilities for Audio Segmentation
Add a description, image, and links to the speech-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the speech-segmentation topic, visit your repo's landing page and select "manage topics."