Multi-condition multi-database English LVCSR recipe #870

vijayaditya · 2016-06-27T23:38:46Z

This is an extension to the Multi-database English LVCSR recipe being tracked in #699 .

Previously we found that the ASpIRE models performed better than fisher_english models when used off-the-shelf on new test sets. This is due to the data augmentation being used the ASpIRE recipe.

As the multi-database English LVCSR recipe is shaping up, I think it would be better to extend this recipe to have multi-condition training. This recipe would reside in the same directory as multi_en recipe ( #699 ). It would involve the creation of a new subdirectory local/multicondition which will house the scripts to download the data necessary for simulating reverberation and noise conditions (see #552 ) and nnet3 recipes (xent, xent+sMBR and chain) for acoustic model training.
This issue has been created to track the progress of this recipe.

The models trained using this recipe could most probably be our best off-the-shelf models, so the person involved in this project might learn about a lot of interesting research problems when these models are used by the community-at-large.

It would involve

coordination with people working on WIP: Multi-database English LVCSR recipe #771 and Enhancements to wav-reverberate command #716 to use the new data reverberation tools.
data preparation for new test sets (e.g. CHiME, REVERB-2014, AMI, Aurora-4) which will be decoded using these models

The text was updated successfully, but these errors were encountered:

tomkocse · 2016-06-30T03:25:30Z

I have started working on this issue.

tomkocse · 2016-06-30T05:39:12Z

@vijayaditya I am going to work on @guoguo12 's branch #771 and put my stuff in local/multicondition
Should i commit to his branch once i create the PR ?

For the RIRs, I can let the user to choose if they want real RIRs or simulated RIRs.
For the noise, @vimalmanohar could you please refer to me the noise dataset you are using ?

guoguo12 · 2016-06-30T07:23:59Z

@tomkocse: Let me know if you need any help getting the multi_en stuff set up!

vijayaditya · 2016-06-30T13:23:16Z

@tomkocse cloning @guoguo12 's branch is dangerous as he will be rebasing his branch to ensure that his commits are not interleaved with other commits. This will enable him to do a squash merge when he wants to merge his branch.

I would recommend the following. Please continue working on other aspects of this issue as long as you can. Once @guoguo12 's branch is relatively stable we will merge his recipe into the master, as it will not break anything else. You could then start working on this project. Does that sound fine to you ?

@tomkocse In the mean time could you please complete #716 and #552 . I will try to complete the reviews of these PRs or at least request some one else to do it, if I am unable to do it.

vijayaditya · 2016-06-30T13:37:34Z

@tomkocse Also remember that we decided to just download all the simulated RIRs rather than preparing them in the recipe, so you need not write any scripts for the RIR preparation.

tomkocse · 2016-06-30T13:57:07Z

I am supposed to place the reverberation stuff in a new directory local/multicondition so i think the possibility of crashing with @guoguo12 's existing files will be low.
I have completed #716 (reverberate_data_dir.py) and #552 (wav-reverberate.cc) based on all the suggestions raised in the issue. (Actually i have been using them in my own task). I need some user feedback, e.g. if @vijayaditya or @vimalmanohar can pass me a pair of rir-list and noise-list that fit to your tasks so that i can make a more complicated test to my script.

vijayaditya · 2016-06-30T14:35:16Z

@tomkocse Sorry for the delay w.r.t. the RIR and noise lists. Could you please create these lists to have a combination of all the different Kaldi non-table and table IO types
e.g. a wav file, a command to create a wav file, an index into an ark file, etc.

tomkocse · 2016-07-04T08:46:09Z

@vijayaditya The rir and noise are wave files, so yes, different IO types like the location of the wave file or a pipeline to create the wave file should be supported, but i don't understand why " an index into an ark file" is needed.
So could you please clarify whether you want me to create the RIR and noise lists or you are going to pass me one ?

vijayaditya · 2016-07-04T18:46:56Z

@tomkocse I thought support for ark files would be desirable as we might have a lot of very short noise files which can be better stored in ark format. However this is not immediately necessary as the current noise database we have access to (NOISEX) has comparatively small number of noise files.

Could you please generate the files yourself. I am a bit swamped and @vimalmanohar might be busy too. Once you run this test I will check this PR.

vijayaditya · 2016-09-16T00:29:59Z

@tomkocse #771 has been merged. Would you be able to work on the nnet3-chain recipes and add data perturbation with reverberation as an optional stage ? This script can reside in local/chain/.

tomkocse · 2016-09-16T00:53:44Z

OK

stale · 2020-06-19T10:52:07Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2020-07-19T05:24:52Z

This issue has been automatically closed by a bot strictly because of inactivity. This does not mean that we think that this issue is not important! If you believe it has been closed hastily, add a comment to the issue and mention @kkm000, and I'll gladly reopen it.

vijayaditya added enhancement help wanted Please help us with this issue! labels Jun 27, 2016

vijayaditya mentioned this issue Jun 30, 2016

WIP : egs/aspire : added nnet3 xent, sMBR and chain recipes #875

Merged

vijayaditya mentioned this issue Jul 18, 2016

WIP: Multi-database English LVCSR recipe #771

Merged

vijayaditya mentioned this issue Sep 16, 2016

Multi-database English LVCSR recipe #699

Closed

stale bot added the stale Stale bot on the loose label Jun 19, 2020

stale bot closed this as completed Jul 19, 2020

kkm000 removed help wanted Please help us with this issue! stale Stale bot on the loose labels Jul 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-condition multi-database English LVCSR recipe #870

Multi-condition multi-database English LVCSR recipe #870

vijayaditya commented Jun 27, 2016 •

edited

Loading

tomkocse commented Jun 30, 2016

tomkocse commented Jun 30, 2016

guoguo12 commented Jun 30, 2016

vijayaditya commented Jun 30, 2016

vijayaditya commented Jun 30, 2016

tomkocse commented Jun 30, 2016

vijayaditya commented Jun 30, 2016

tomkocse commented Jul 4, 2016

vijayaditya commented Jul 4, 2016 •

edited

Loading

vijayaditya commented Sep 16, 2016

tomkocse commented Sep 16, 2016

stale bot commented Jun 19, 2020

stale bot commented Jul 19, 2020

Multi-condition multi-database English LVCSR recipe #870

Multi-condition multi-database English LVCSR recipe #870

Comments

vijayaditya commented Jun 27, 2016 • edited Loading

tomkocse commented Jun 30, 2016

tomkocse commented Jun 30, 2016

guoguo12 commented Jun 30, 2016

vijayaditya commented Jun 30, 2016

vijayaditya commented Jun 30, 2016

tomkocse commented Jun 30, 2016

vijayaditya commented Jun 30, 2016

tomkocse commented Jul 4, 2016

vijayaditya commented Jul 4, 2016 • edited Loading

vijayaditya commented Sep 16, 2016

tomkocse commented Sep 16, 2016

stale bot commented Jun 19, 2020

stale bot commented Jul 19, 2020

vijayaditya commented Jun 27, 2016 •

edited

Loading

vijayaditya commented Jul 4, 2016 •

edited

Loading