Within `src/` is a full "toolchain" for performing speech recognition

SR-Homework2: Isolated Word Speech Recognition

Within `src/` is a full "toolchain" for performing speech recognition

label_fixer.py - Prepare labels to the correct format
sound_segmenter.py - Segment audio files using Audacity's simple label format
mfcc.py - MFCCs extraction from audio files
dtw.py - Simple naive implementation of the DTW algorithm for speech recognition
GMM-HMM.ipynb - Simple implementation of a GMMHMM model for speech recognition

To run:

Clone the repository
Install requirements - pip install -r requirements.txt
Put audio and label data into data/
(Optional) Fix labels if they are in the extended format
Segment the audio data (sound_segmenter.py)
Run the algorithms

Demo:

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
media		media
src		src
.gitignore		.gitignore
README.md		README.md
pyvenv.cfg		pyvenv.cfg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SR-Homework2: Isolated Word Speech Recognition

Within `src/` is a full "toolchain" for performing speech recognition

About

Releases

Packages

Languages

o2buzzle/SR-Homework2

Folders and files

Latest commit

History

Repository files navigation

SR-Homework2: Isolated Word Speech Recognition

Within src/ is a full "toolchain" for performing speech recognition

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Within `src/` is a full "toolchain" for performing speech recognition

Packages