Neural Acoustic Word Embeddings for Switchboard

Overview:

This is a recipe for learning neural acoustic word embeddings for a subset of Switchboard. The models are explained in greater detail in Settle & Livescu, 2016 as well as Settle et al., 2017:

S. Settle and K. Livescu, "Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches," in Proc. SLT, 2016.
S. Settle, K. Levin, H. Kamper, and K. Livescu, "Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings," in Proc. Interspeech, 2017.

Steps:

Ensure access to installed dependencies.
- Python 3.6
- Tensorflow 1.5 (and numpy/scipy)
- kaldi
- kaldi-io-for-python
Clone repo.
Check that $KALDI_ROOT variable points to the location of installed/compiled kaldi. This can be set in your ~/.bashrc or in kaldi/path.sh.
Update kaldi/run.sh:
- set $swbd variable to your local switchboard datapath
- set $nj to number of desired jobs (default=8)
- set $stage to desired stage in feature creation (default=1)
- set $min_word_length to desired minimum length character sequence allowed for included words (default=6)
- set $min_audio_duration to minimum audio duration (in frames) allowed for included audio (default=50)
- set $min_train_occurrence_count to limit how common training words must have been (default=2, note: this must be >= 2 or siamese training will not work)
Navigate to kaldi directory and run "./run.sh". Now you should have the desired features.
Navigate to code directory and run "python main.py". This will train, evaluate, and save models.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
code		code
kaldi		kaldi
partitions		partitions
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Acoustic Word Embeddings for Switchboard

Overview:

Contents:

Steps:

About

Releases

Packages

Languages

shane-settle/neural-acoustic-word-embeddings

Folders and files

Latest commit

History

Repository files navigation

Neural Acoustic Word Embeddings for Switchboard

Overview:

Contents:

Steps:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages