Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-condition multi-database English LVCSR recipe #870

Closed
vijayaditya opened this issue Jun 27, 2016 · 13 comments
Closed

Multi-condition multi-database English LVCSR recipe #870

vijayaditya opened this issue Jun 27, 2016 · 13 comments

Comments

@vijayaditya
Copy link
Contributor

vijayaditya commented Jun 27, 2016

This is an extension to the Multi-database English LVCSR recipe being tracked in #699 .

Previously we found that the ASpIRE models performed better than fisher_english models when used off-the-shelf on new test sets. This is due to the data augmentation being used the ASpIRE recipe.

As the multi-database English LVCSR recipe is shaping up, I think it would be better to extend this recipe to have multi-condition training. This recipe would reside in the same directory as multi_en recipe ( #699 ). It would involve the creation of a new subdirectory local/multicondition which will house the scripts to download the data necessary for simulating reverberation and noise conditions (see #552 ) and nnet3 recipes (xent, xent+sMBR and chain) for acoustic model training.
This issue has been created to track the progress of this recipe.

The models trained using this recipe could most probably be our best off-the-shelf models, so the person involved in this project might learn about a lot of interesting research problems when these models are used by the community-at-large.

It would involve

  1. coordination with people working on WIP: Multi-database English LVCSR recipe #771 and Enhancements to wav-reverberate command #716 to use the new data reverberation tools.
  2. data preparation for new test sets (e.g. CHiME, REVERB-2014, AMI, Aurora-4) which will be decoded using these models
@vijayaditya vijayaditya added enhancement help wanted Please help us with this issue! labels Jun 27, 2016
@tomkocse
Copy link
Contributor

I have started working on this issue.

@tomkocse
Copy link
Contributor

@vijayaditya I am going to work on @guoguo12 's branch #771 and put my stuff in local/multicondition
Should i commit to his branch once i create the PR ?

For the RIRs, I can let the user to choose if they want real RIRs or simulated RIRs.
For the noise, @vimalmanohar could you please refer to me the noise dataset you are using ?

@guoguo12
Copy link
Contributor

@tomkocse: Let me know if you need any help getting the multi_en stuff set up!

@vijayaditya
Copy link
Contributor Author

@tomkocse cloning @guoguo12 's branch is dangerous as he will be rebasing his branch to ensure that his commits are not interleaved with other commits. This will enable him to do a squash merge when he wants to merge his branch.

I would recommend the following. Please continue working on other aspects of this issue as long as you can. Once @guoguo12 's branch is relatively stable we will merge his recipe into the master, as it will not break anything else. You could then start working on this project. Does that sound fine to you ?

@tomkocse In the mean time could you please complete #716 and #552 . I will try to complete the reviews of these PRs or at least request some one else to do it, if I am unable to do it.

@vijayaditya
Copy link
Contributor Author

@tomkocse Also remember that we decided to just download all the simulated RIRs rather than preparing them in the recipe, so you need not write any scripts for the RIR preparation.

@tomkocse
Copy link
Contributor

I am supposed to place the reverberation stuff in a new directory local/multicondition so i think the possibility of crashing with @guoguo12 's existing files will be low.
I have completed #716 (reverberate_data_dir.py) and #552 (wav-reverberate.cc) based on all the suggestions raised in the issue. (Actually i have been using them in my own task). I need some user feedback, e.g. if @vijayaditya or @vimalmanohar can pass me a pair of rir-list and noise-list that fit to your tasks so that i can make a more complicated test to my script.

@vijayaditya
Copy link
Contributor Author

@tomkocse Sorry for the delay w.r.t. the RIR and noise lists. Could you please create these lists to have a combination of all the different Kaldi non-table and table IO types
e.g. a wav file, a command to create a wav file, an index into an ark file, etc.

@tomkocse
Copy link
Contributor

tomkocse commented Jul 4, 2016

@vijayaditya The rir and noise are wave files, so yes, different IO types like the location of the wave file or a pipeline to create the wave file should be supported, but i don't understand why " an index into an ark file" is needed.
So could you please clarify whether you want me to create the RIR and noise lists or you are going to pass me one ?

@vijayaditya
Copy link
Contributor Author

vijayaditya commented Jul 4, 2016

@tomkocse I thought support for ark files would be desirable as we might have a lot of very short noise files which can be better stored in ark format. However this is not immediately necessary as the current noise database we have access to (NOISEX) has comparatively small number of noise files.

Could you please generate the files yourself. I am a bit swamped and @vimalmanohar might be busy too. Once you run this test I will check this PR.

@vijayaditya
Copy link
Contributor Author

@tomkocse #771 has been merged. Would you be able to work on the nnet3-chain recipes and add data perturbation with reverberation as an optional stage ? This script can reside in local/chain/.

@tomkocse
Copy link
Contributor

OK

@stale
Copy link

stale bot commented Jun 19, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale Stale bot on the loose label Jun 19, 2020
@stale
Copy link

stale bot commented Jul 19, 2020

This issue has been automatically closed by a bot strictly because of inactivity. This does not mean that we think that this issue is not important! If you believe it has been closed hastily, add a comment to the issue and mention @kkm000, and I'll gladly reopen it.

@stale stale bot closed this as completed Jul 19, 2020
@kkm000 kkm000 removed help wanted Please help us with this issue! stale Stale bot on the loose labels Jul 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants