speech-activity-detection

Here are 16 public repositories matching this topic...

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Nov 29, 2024
Jupyter Notebook

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Updated Dec 2, 2024
Python

RicherMans / GPV

Star

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

machine-learning pytorch voice-activity-detection speech-activity-detection noise-robust-asr sound-activity

Updated Aug 3, 2023
Python

RicherMans / Datadriven-GPVAD

Star

The codebase for Data-driven general-purpose voice activity detection.

machine-learning pytorch voice-activity-detection speech-activity-detection noise-robust

Updated Aug 3, 2023
Python

HHousen / speaker-change-detection

Star

Speaker change detection using SincNet and an LSTM/Transformer

machine-learning transformers pytorch lstm audio-processing speech-activity-detection speaker-change-detection speaker-segmentation

Updated Jun 30, 2024
Jupyter Notebook

bigcash / awesome-vad

Star

A curated list of awesome voice activity detection

list awesome speech vad sad voice-activity-detection speech-activity-detection

Updated Nov 22, 2024

vimalmanohar / kaldi

Star

Fork of the official kaldi.

semi-supervised-learning transfer-learning domain-adaptation multilingual-speech-recognition speech-activity-detection lightly-supervised-training

Updated Mar 22, 2022
Shell

idiap / zff_vad

Star

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

machine-learning signal-processing audio-processing voice-activity-detection speech-activity-detection noise-robust

Updated Oct 19, 2023
Python

AmirHoseein99 / Depression-Engine

Star

Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach

deep-learning hybrid-model sad speech-activity-detection depression-detection daic-woz

Updated Jan 5, 2023
Python

dangvansam / pyannote-onnx

Star

PyAnnote Voice Activity Detection (ONNX version)

vad audio-segmentation speech-separation onnx speech-activity-detection audio-split audio-splitter pyannote voice-ac

Updated Sep 9, 2023
Jupyter Notebook

rafaelgreca / voxseg-pytorch

Star

The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.

python speech cnn torch pytorch vad speech-processing voice-activity-detection bilstm speech-activity-detection speech-segmentation voxseg

Updated Oct 18, 2023
Python

ina-foss / InaGVAD

Star

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

aditya-joglekar / FS02_Scoring_Toolkit

Star

Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks

speech-recognition speech-processing speaker-diarization speaker-identification speech-activity-detection scoring-code

Updated Feb 16, 2021
Python

KF-R / turk-chat

Star

Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.

Updated Mar 3, 2024
Python

sajR / V-SAD

Star

machine-learning neural-network visual-speech-recognition speech-activity-detection

Updated Mar 4, 2021
Python

Improve this page

Add a description, image, and links to the speech-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-activity-detection

Here are 16 public repositories matching this topic...

pyannote / pyannote-audio

jtkim-kaist / VAD

ina-foss / inaSpeechSegmenter

RicherMans / GPV

RicherMans / Datadriven-GPVAD

HHousen / speaker-change-detection

bigcash / awesome-vad

vimalmanohar / kaldi

idiap / zff_vad

AmirHoseein99 / Depression-Engine

dangvansam / pyannote-onnx

rafaelgreca / voxseg-pytorch

ina-foss / InaGVAD

aditya-joglekar / FS02_Scoring_Toolkit

KF-R / turk-chat

sajR / V-SAD

Improve this page

Add this topic to your repo