Add gradient filter for tdnn_lstm_ctc #565

yaozengwei · 2022-09-05T14:45:00Z

This PR adds the gradient filter for tdnn_lstm_ctc recipe. You could see #564 for details.

danpovey · 2022-09-06T01:54:40Z

@huangruizhe you can see whether this resolves your problem.

huangruizhe · 2022-09-09T01:05:31Z

Hi, I've tested on the tdnn_lstm_ctc2 recipe with grad_norm_threshold =100, but I think the model behaves similarly to the one before adding the gradient filter -- the model diverges when the learning rate takes 1e-3, as in the default recipe, while starts to converge when lr=1e-4..

Here is the tensorboard:

Running this recipe directly (with grad_norm_threshold =100): tdnn_lstm_ctc2/train.py
tensorboard
Running the above configuration, and shuffling the whole librispeech train cuts.
tensorboard
The recipe before adding the gradient filter, and shuffling the whole librispeech train cuts: tdnn_lstm_ctc/train.py
tensorboard

danpovey · 2022-09-09T02:12:32Z

It will be hard to diagnose what's really going on here without looking at the diagnostics files (obtained by starting from intermediate epochs and adding the flag --print-diagnostics=True.. should take about 5 minutes).

yaozengwei · 2022-09-09T02:37:13Z

This recipe does not support using flag --print-diagnostics=True.

danpovey · 2022-09-09T02:47:10Z

Ruizhe can figure out how to add the code from other recipes, and make a PR.

yaozengwei added 6 commits July 12, 2022 15:39

Merge branch 'k2-fsa:master' into master

0fcdd15

Merge remote-tracking branch 'k2-fsa/master'

09f3e57

Merge remote-tracking branch 'k2-fsa/master'

768b896

Merge branch 'k2-fsa:master' into master

077719c

init files

2cc6137

add gradient filter

b188507

fix bugs

890cd1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gradient filter for tdnn_lstm_ctc #565

Add gradient filter for tdnn_lstm_ctc #565

yaozengwei commented Sep 5, 2022 •

edited

Loading

danpovey commented Sep 6, 2022

huangruizhe commented Sep 9, 2022 •

edited

Loading

danpovey commented Sep 9, 2022

yaozengwei commented Sep 9, 2022

danpovey commented Sep 9, 2022

Add gradient filter for tdnn_lstm_ctc #565

Are you sure you want to change the base?

Add gradient filter for tdnn_lstm_ctc #565

Conversation

yaozengwei commented Sep 5, 2022 • edited Loading

danpovey commented Sep 6, 2022

huangruizhe commented Sep 9, 2022 • edited Loading

danpovey commented Sep 9, 2022

yaozengwei commented Sep 9, 2022

danpovey commented Sep 9, 2022

yaozengwei commented Sep 5, 2022 •

edited

Loading

huangruizhe commented Sep 9, 2022 •

edited

Loading