Skip to content

Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Linde-Buzo-Gray (LBG) clustering algorithm

Notifications You must be signed in to change notification settings

miselaytes-anton/whospeaks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Who speaks?

Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Linde-Buzo-Gray (LBG) clustering algorithm. The implementation is based on this matlab tutorial. It gives about ~65% of correct results using this data set.

Prepare audio files

bash scripts/preprocess-audio.sh

Creates audio-preprocessed folder

Train

./src/cli.js train ./audio-preprocessed/train/

Creates training-results.json

Recognize

Requires training-results.json

./src/cli.js recognize ./audio-preprocessed/test/F101_test_1.wav

Test performance

Requires training-results.json

./src/cli.js test-performance ./audio-preprocessed/test/

About

Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Linde-Buzo-Gray (LBG) clustering algorithm

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published