adding frame level dropout to TDNN+LSTM on AMI SDM1 #1264

GaofengCheng · 2016-12-14T14:09:10Z

No description provided.

Conflicts: egs/wsj/s5/steps/libs/nnet3/train/common.py

…dule

GaofengCheng · 2016-12-14T14:10:02Z

@danpovey Could you help me merge this into ... #1248

danpovey · 2016-12-14T22:06:33Z

Submit your own PR into which you will merge, or have already merged, #1248.

…

On Wed, Dec 14, 2016 at 6:10 AM, Gaofeng Cheng ***@***.***> wrote: @danpovey <https://github.com/danpovey> Could you help me merge this into ... #1248 <#1248> — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1264 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADJVu5hATmYTiMuv98r948Ntm_1Oak9wks5rH_i8gaJpZM4LM9gE> .

danpovey · 2016-12-14T22:08:08Z

@vimalmanohar, can you please review this?

…

On Wed, Dec 14, 2016 at 6:09 AM, Gaofeng Cheng ***@***.***> wrote: ------------------------------ You can view, comment on, or merge this pull request online at: #1264 Commit Summary - dropout_schedule: Adding dropout schedule to scripts - dropout_schedule: Add set-dropout-proportion in nnet3 utils - dropout_schedule: Print dropout info - dropout_schedule: Adding more comments and fixing bug - dropout_schedule: Bug fix - dropout_schedule: Fixed bug - dropout_schedule: Fixing logging - dropout_schedule: Not printing shrinkage when its 1.0 - change dropout_parser strategy - adding frame level dropout to TDNN+LSTM on AMI SDM1 #1248 File Changes - *A* egs/ami/s5b/local/chain/tuning/run_tdnn_lstm_1i_dp.sh <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-0> (295) - *M* egs/wsj/s5/steps/libs/nnet3/train/chain_objf/acoustic_model.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-1> (20) - *M* egs/wsj/s5/steps/libs/nnet3/train/common.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-2> (233) - *M* egs/wsj/s5/steps/libs/nnet3/train/frame_level_objf/common.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-3> (18) - *M* egs/wsj/s5/steps/libs/nnet3/xconfig/lstm.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-4> (25) - *M* egs/wsj/s5/steps/nnet3/chain/train.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-5> (31) - *M* egs/wsj/s5/steps/nnet3/train_dnn.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-6> (14) - *M* egs/wsj/s5/steps/nnet3/train_raw_dnn.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-7> (14) - *M* egs/wsj/s5/steps/nnet3/train_raw_rnn.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-8> (29) - *M* egs/wsj/s5/steps/nnet3/train_rnn.py <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-9> (29) - *M* src/nnet3/nnet-utils.cc <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-10> (33) - *M* src/nnet3/nnet-utils.h <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-11> (3) - *M* src/nnet3bin/nnet3-copy.cc <https://github.com/kaldi-asr/kaldi/pull/1264/files#diff-12> (12) Patch Links: - https://github.com/kaldi-asr/kaldi/pull/1264.patch - https://github.com/kaldi-asr/kaldi/pull/1264.diff — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#1264>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADJVuwWNHFvmEA0CGrO_qwtQh3srTBZTks5rH_iJgaJpZM4LM9gE> .

vimalmanohar · 2016-12-14T23:27:23Z

egs/ami/s5b/local/chain/tuning/run_tdnn_lstm_1i_dp.sh

+ihm_gmm=tri3  # the gmm for the IHM system (if --use-ihm-ali true).
+num_threads_ubm=32
+nnet3_affix=_cleaned  # cleanup affix for nnet3 and chain dirs, e.g. _cleaned
+dropout_schedule='0,0@0.20,0.5@0.50,0@0.50,0'


This will now crash with the new change. But it seems like in some cases this step change is required. So should I change the script to accept the step changes and retain the order?

vimalmanohar · 2016-12-14T23:33:06Z

egs/ami/s5b/local/chain/tuning/run_tdnn_lstm_1i_dp.sh

+  for decode_set in dev eval; do
+      (
+      steps/nnet3/decode.sh --acwt 1.0 --post-decode-acwt 10.0 \
+          --nj $nj --cmd "$decode_cmd" \


Check that nj < the number of speakers in dev and eval for both sdm and ihm. Otherwise it may crash.
Use nj_dev as min(number of speakers, nj)

vimalmanohar · 2016-12-14T23:36:14Z

egs/wsj/s5/steps/libs/nnet3/train/common.py

-        dropout_values.append((0, float(parts[0])))
-
+        dropout_values.append((0, float(parts[0]))) 
+        data_fraction_one_previous='' # used to control situations like: 0.2@0.75,0@0.75


Can you make a pull request wrt my branch? I think we made conflicting changes.

vimalmanohar · 2016-12-14T23:37:19Z

egs/wsj/s5/steps/libs/nnet3/xconfig/lstm.py

@@ -339,6 +345,8 @@ def generate_lstm_config(self):
                                abs(delay)))
        affine_str = self.config['ng-affine-options']
        pes_str = self.config['ng-per-element-scale-options']
+        lstm_dropout_value = self.config['dropout-proportion']
+        lstm_dropout_str = 'dropout-proportion='+str(self.config['dropout-proportion'])


Use 'dropout-proportion={0}'.format( )

vimalmanohar · 2016-12-14T23:38:34Z

egs/wsj/s5/steps/libs/nnet3/xconfig/lstm.py

@@ -745,6 +761,7 @@ def set_default_configs(self):
                        'ng-affine-options' : ' max-change=1.5',
                        'zeroing-interval' : 20,
                        'zeroing-threshold' : 15.0
+


Remove extra line.

vimalmanohar · 2016-12-14T23:42:47Z

I think the best way is that @GaofengCheng changes this pull request to be wrt my branch in #1248. I will merge that PR to my branch and that will be merged to master in #1248.
I think this way it will be easier for someone to search for specific issues or PRs.

danpovey · 2016-12-14T23:56:54Z

I think it's better if he changes the function specification to not have a step change. I expect he meant @0.8 or something like that, anyway.

…

On Wed, Dec 14, 2016 at 3:27 PM, Vimal Manohar ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In egs/ami/s5b/local/chain/tuning/run_tdnn_lstm_1i_dp.sh <#1264 (review)>: > + +set -e -o pipefail + +# First the options that are passed through to run_ivector_common.sh +# (some of which are also used in this script directly). +stage=0 +mic=ihm +nj=30 +min_seg_len=1.55 +use_ihm_ali=false +train_set=train_cleaned +gmm=tri3_cleaned # the gmm for the target data +ihm_gmm=tri3 # the gmm for the IHM system (if --use-ihm-ali true). +num_threads_ubm=32 +nnet3_affix=_cleaned # cleanup affix for nnet3 and chain dirs, e.g. _cleaned ***@***.******@***.******@***.***,0' This will now crash with the new change. But it seems like in some cases this step change is required. So should I change the script to accept the step changes and retain the order? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1264 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADJVuxLq_Z-T4aDWjAoYa-RErxuTOAZQks5rIHtdgaJpZM4LM9gE> .

vimalmanohar · 2016-12-15T00:07:05Z

He changed the dropout parser to use num_archives + 1 if the number of archives is same for two consecutive dropout specifications. That seems to work at least for his case. On Wed, Dec 14, 2016 at 6:57 PM Daniel Povey <notifications@github.com> wrote:

I think it's better if he changes the function specification to not have a step change. I expect he meant @0.8 or something like that, anyway. On Wed, Dec 14, 2016 at 3:27 PM, Vimal Manohar ***@***.***> wrote: > ***@***.**** commented on this pull request. > ------------------------------ > > In egs/ami/s5b/local/chain/tuning/run_tdnn_lstm_1i_dp.sh > <#1264 (review) >: > > > + > +set -e -o pipefail > + > +# First the options that are passed through to run_ivector_common.sh > +# (some of which are also used in this script directly). > +stage=0 > +mic=ihm > +nj=30 > +min_seg_len=1.55 > +use_ihm_ali=false > +train_set=train_cleaned > +gmm=tri3_cleaned # the gmm for the target data > +ihm_gmm=tri3 # the gmm for the IHM system (if --use-ihm-ali true). > +num_threads_ubm=32 > +nnet3_affix=_cleaned # cleanup affix for nnet3 and chain dirs, e.g. _cleaned > ***@***.******@***.******@***.***,0' > > This will now crash with the new change. But it seems like in some cases > this step change is required. So should I change the script to accept the > step changes and retain the order? > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#1264 (review) >, > or mute the thread > < https://github.com/notifications/unsubscribe-auth/ADJVuxLq_Z-T4aDWjAoYa-RErxuTOAZQks5rIHtdgaJpZM4LM9gE > > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1264 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEATV9PUq2ZbwoGem5cAqPyIMHhw5H9bks5rIIJQgaJpZM4LM9gE> .

-- Vimal Manohar PhD Student Electrical & Computer Engineering Johns Hopkins University

GaofengCheng · 2016-12-15T00:36:05Z

this PR is merged into #1248

vimalmanohar · 2016-12-15T00:44:25Z

@GaofengCheng I added a new commit in #1248 for the same data fraction issue. See if that works.
Also you still need to address some of the other comments. I didn't fix those while merging into #1248.

GaofengCheng · 2016-12-15T00:52:29Z

@vimalmanohar thx Vimal， I'll later revised the issues

vimalmanohar and others added 13 commits December 6, 2016 14:41

dropout_schedule: Adding dropout schedule to scripts

e97df65

dropout_schedule: Add set-dropout-proportion in nnet3 utils

8d26ce0

Changing option

1424c57

dropout_schedule: Print dropout info

818d495

dropout_schedule: Adding more comments and fixing bug

3342dd8

dropout_schedule: Bug fix

f17b0fc

Conflicts: egs/wsj/s5/steps/libs/nnet3/train/common.py

dropout_schedule: Fixed bug

5a6a9b1

dropout_schedule: Fixing logging

4ece089

dropout_schedule: Not printing shrinkage when its 1.0

0dd66c1

Merging

f6d25a2

Merge branch 'master' of github.com:kaldi-asr/kaldi into dropout_sche…

635bb6e

…dule

change dropout_parser strategy

7109c43

adding frame level dropout to TDNN+LSTM on AMI SDM1 kaldi-asr#1248

5435f23

GaofengCheng changed the title ~~adding frame level dropout to TDNN+LSTM on AMI SDM1 #1248~~ adding frame level dropout to TDNN+LSTM on AMI SDM1 Dec 14, 2016

dropout_schedule: Add strict checking of dropout schedule

7899760

vimalmanohar reviewed Dec 14, 2016

View reviewed changes

Merge branch 'dropout_schedule' into nnet3-dropout

18404a9

GaofengCheng closed this Dec 15, 2016

GaofengCheng added a commit to GaofengCheng/kaldi that referenced this pull request Dec 15, 2016

fix issue kaldi-asr#1264

6c56b4c

GaofengCheng mentioned this pull request Dec 15, 2016

fix issue #1264 vimalmanohar/kaldi#7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding frame level dropout to TDNN+LSTM on AMI SDM1 #1264

adding frame level dropout to TDNN+LSTM on AMI SDM1 #1264

GaofengCheng commented Dec 14, 2016

GaofengCheng commented Dec 14, 2016

danpovey commented Dec 14, 2016 via email

danpovey commented Dec 14, 2016 via email

vimalmanohar Dec 14, 2016

vimalmanohar Dec 14, 2016

vimalmanohar Dec 14, 2016

vimalmanohar Dec 14, 2016

vimalmanohar Dec 14, 2016

vimalmanohar commented Dec 14, 2016

danpovey commented Dec 14, 2016 via email

vimalmanohar commented Dec 15, 2016 via email

GaofengCheng commented Dec 15, 2016

vimalmanohar commented Dec 15, 2016

GaofengCheng commented Dec 15, 2016

adding frame level dropout to TDNN+LSTM on AMI SDM1 #1264

adding frame level dropout to TDNN+LSTM on AMI SDM1 #1264

Conversation

GaofengCheng commented Dec 14, 2016

GaofengCheng commented Dec 14, 2016

danpovey commented Dec 14, 2016 via email

danpovey commented Dec 14, 2016 via email

vimalmanohar Dec 14, 2016

Choose a reason for hiding this comment

vimalmanohar Dec 14, 2016

Choose a reason for hiding this comment

vimalmanohar Dec 14, 2016

Choose a reason for hiding this comment

vimalmanohar Dec 14, 2016

Choose a reason for hiding this comment

vimalmanohar Dec 14, 2016

Choose a reason for hiding this comment

vimalmanohar commented Dec 14, 2016

danpovey commented Dec 14, 2016 via email

vimalmanohar commented Dec 15, 2016 via email

GaofengCheng commented Dec 15, 2016

vimalmanohar commented Dec 15, 2016

GaofengCheng commented Dec 15, 2016