Reload weights after plateau #3245

DanBmh · 2020-08-12T14:52:24Z

Reload checkpoint weights after reaching a plateau that we use the best_dev weights again

community-tc-integration · 2020-08-12T14:52:42Z

No Taskcluster jobs started for this pull request

The `allowPullRequests` configuration for this repository (in `.taskcluster.yml` on the
default branch) does not allow starting tasks for this pull request.

lissyx · 2020-08-13T00:08:15Z

@DanBmh Thanks, can you elaborate a little bit? My mind is kind of somewhere else, so I'm unsure I get the point here.

DanBmh · 2020-08-13T08:20:51Z

Currently training looks like this:

epoch 5: val_loss=62
e6: vl=59
e7: vl=60
e8: vl=61
Reached a plateau, LearningRate:=LR*0.1
e9: vl=60       <- Here we're using the weights from e8, with the suggested changes we're using e6 instead
e10: vl=58   <- We have an improvement but the network has to do some more work to fix the errors from e7+e8

The old approach did still work well but I think we can make it even better by reloading the weights from the best_dev checkpoint.

DanBmh · 2020-08-14T08:27:27Z

Not ready yet!
Found an error when using --drop_source_layers flag

DanBmh · 2020-08-14T09:13:01Z

Working again:)

lissyx

LGTM, I'd like Reuben's opinion.

reuben · 2020-08-19T12:56:07Z

training/mozilla_voice_stt_training/train.py

+                        # Reload checkpoint that we use the best_dev weights again
+                        load_or_init_graph_for_training(session, allow_drop_layers=False)


Please keep this function unchanged and add a new explicitly load_best_checkpoint function, we don't want this call to load last silently for example.

reuben · 2020-08-19T15:25:48Z

Re-opened as #3261 to run tests.

Tests #3245 Reload weights after plateau

reuben · 2020-08-20T07:48:41Z

Merged in #3261

Reload weights after plateau.

73e3309

DanBmh mentioned this pull request Aug 13, 2020

Freeze layers for transfer learning #3247

Open

DanBmh changed the title ~~Reload weights after plateau~~ WIP: Reload weights after plateau Aug 14, 2020

Don't drop layers in rlrop reload.

a862717

DanBmh changed the title ~~WIP: Reload weights after plateau~~ Reload weights after plateau Aug 14, 2020

lissyx requested a review from reuben August 18, 2020 16:07

lissyx approved these changes Aug 18, 2020

View reviewed changes

reuben suggested changes Aug 19, 2020

View reviewed changes

Reload graph with extra function.

b8b6ca8

DanBmh force-pushed the reload_rlrop branch from 435a2ef to b8b6ca8 Compare August 19, 2020 15:17

reuben approved these changes Aug 19, 2020

View reviewed changes

reuben added a commit that referenced this pull request Aug 20, 2020

Merge pull request #3261 from mozilla/reload-weights-plateau-tests

d14c2b2

Tests #3245 Reload weights after plateau

reuben closed this Aug 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reload weights after plateau #3245

Reload weights after plateau #3245

DanBmh commented Aug 12, 2020

community-tc-integration bot commented Aug 12, 2020

lissyx commented Aug 13, 2020

DanBmh commented Aug 13, 2020

DanBmh commented Aug 14, 2020

DanBmh commented Aug 14, 2020

lissyx left a comment

reuben Aug 19, 2020

reuben commented Aug 19, 2020

reuben commented Aug 20, 2020

		# Reload checkpoint that we use the best_dev weights again
		load_or_init_graph_for_training(session, allow_drop_layers=False)

Reload weights after plateau #3245

Reload weights after plateau #3245

Conversation

DanBmh commented Aug 12, 2020

community-tc-integration bot commented Aug 12, 2020

lissyx commented Aug 13, 2020

DanBmh commented Aug 13, 2020

DanBmh commented Aug 14, 2020

DanBmh commented Aug 14, 2020

lissyx left a comment

Choose a reason for hiding this comment

reuben Aug 19, 2020

Choose a reason for hiding this comment

reuben commented Aug 19, 2020

reuben commented Aug 20, 2020