-
Notifications
You must be signed in to change notification settings - Fork 5.3k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[egs] wake-word recipes: use python3, add README (#4052)
- Loading branch information
Showing
11 changed files
with
37 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
|
||
The Mobvoi dataset is a ~67-hour corpus of wake word corpus | ||
in Chinese covering 523 speakers. It is currently not publicly available. | ||
The wake word is "Hi Xiaowen" (in Pinyin). | ||
Each speaker’s collection includes positive utterances and negative utterances | ||
recorded with different speaker-to-microphone distance and different | ||
signal-to-noise (SNR) ratio where noises are from typical home environments. | ||
The dataset is provided by Mobvoi. Inc. | ||
|
||
The recipe is in v1/ | ||
|
||
The E2E LF-MMI recipe does not require any prior alignments for training | ||
LF-MMI, making the alignment more flexible during training. It can be optionally | ||
followed by a regular LF-MMI training to further improve the performance. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
|
||
The SNIPS dataset is a ~54-hour corpus of wake word corpus covering 3300 speakers. | ||
The wake word is "Hey Snips" pronounced with no pause between the two words. | ||
It contains a large variety of English accents and recording environments. | ||
Negative samples have been recorded in the same conditions than wake-word utterances. | ||
To download the dataset you need to follow the instructions on | ||
https://github.com/snipsco/keyword-spotting-research-datasets. It is provided | ||
by Snips, Paris, France (https://snips.ai) | ||
|
||
The recipe is in v1/ | ||
|
||
The E2E LF-MMI recipe does not require any prior alignments for training | ||
LF-MMI, making the alignment more flexible during training. It can be optionally | ||
followed by a regular LF-MMI training to further improve the performance. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters