IMPORTANT NOTE

I'm not working on this project anymore. I advise everyone curious about voice detection to have a look at some more modern approaches using deep learning, like:

Voice Activity Detector

Python code to apply voice activity detector to wave file. Voice activity detector based on ration between energy in speech band and total energy.

Requirements

numpy
scipy
matplotlib
tkinter (sudo apt install python3-tk)

Basic Idea

Input audio data treated as following:

Convert stereo to mono.
Move a window of 20ms along the audio data.
Calculate the ratio between energy of speech band and total energy for window.
If ratio is more than threshold (0.6 by default) label windows as speech.
Apply median filter with length of 0.5s to smooth detected speech regions.
Represent speech regions as intervals of time.

How To

Create object:

import vad module.
create instance of class VoiceActivityDetector with full path to wave file.
run method to detect speech regions.
optionally, plot original wave data and detected speech region.

Example python script which saves speech intervals in json file:

./detectVoiceInWave.py ./wav-sample.wav ./results.json

Example python code to plot detected speech regions:

from vad import VoiceActivityDetector

filename = '/Users/user/wav-sample.wav'
v = VoiceActivityDetector(filename)
v.plot_detected_speech_regions()

Alexander USOLTSEV 2015 (c) MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.gitignore		.gitignore
README.md		README.md
detectVoiceInWave.py		detectVoiceInWave.py
test.json		test.json
vad.py		vad.py
wav-sample.wav		wav-sample.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IMPORTANT NOTE

Voice Activity Detector

Requirements

Basic Idea

How To

About

Releases

Packages

Contributors 6

Languages

marsbroshok/VAD-python

Folders and files

Latest commit

History

Repository files navigation

IMPORTANT NOTE

Voice Activity Detector

Requirements

Basic Idea

How To

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages