Skip to content

o2buzzle/SR-Homework2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SR-Homework2: Isolated Word Speech Recognition

Within src/ is a full "toolchain" for performing speech recognition

label_fixer.py - Prepare labels to the correct format
sound_segmenter.py - Segment audio files using Audacity's simple label format
mfcc.py - MFCCs extraction from audio files
dtw.py - Simple naive implementation of the DTW algorithm for speech recognition
GMM-HMM.ipynb - Simple implementation of a GMMHMM model for speech recognition

To run:

  • Clone the repository
  • Install requirements - pip install -r requirements.txt
  • Put audio and label data into data/
  • (Optional) Fix labels if they are in the extended format
  • Segment the audio data (sound_segmenter.py)
  • Run the algorithms

Demo:

Demo

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published