Fix incorrect handling of on_batch_end edge cases in run_training_batch #509

jeffling · 2019-11-15T01:04:06Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
Did you read the contributor guideline?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

This fixes a bug

ValueError: not enough values to unpack (expected 3, got 2)

When trying to return -1 to exit early during on_batch_start, and also when the batch is empty.

For future work, we should have a test for this regression.

https://github.com/williamFalcon/pytorch-lightning/blob/8f8cea1c5759524c6fc2eb33b972bba64e5ce0f4/pytorch_lightning/trainer/train_loop_mixin.py#L100

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

This fixes a bug `ValueError: not enough values to unpack (expected 3, got 2)`

The return value was actually a dict even though that variable is initialized as a list.

jeffling · 2019-11-15T06:52:01Z

It's also very unfortunate that we initialize all_log_metrics as a list and turn it into a dict somewhere. Ideally, things more or less stay the same type. However, I don't have time to clean it up now, just noting this.

Borda

I would even change its call from

output = self.run_training_batch(batch, batch_nb)
batch_result, grad_norm_dic, batch_step_metrics = output

to

batch_result, grad_norm_dic, batch_step_metrics = \
    self.run_training_batch(batch, batch_nb)

jeffling · 2019-11-18T20:06:32Z

I think we can do as as a followup, I'd like to fix the nit I pointed as well. Right now this is breaking documented functionality so we should merge this in as a hotfix and do a neatness refactor later

williamFalcon · 2019-11-19T23:39:20Z

@jeffling thanks! please follow up on the issue you mentioned :)

jeffling added 2 commits November 14, 2019 17:02

Fix returning only 2 values on an early exit.

99c290b

This fixes a bug `ValueError: not enough values to unpack (expected 3, got 2)`

Update train_loop_mixin.py

8f8cea1

jeffling changed the title ~~Fix returning only 2 values (3 needed) on an early exit.~~ Fix incorrect handling of on_batch_end edge cases in run_training_batch Nov 15, 2019

Change to return dict

30f7fd8

The return value was actually a dict even though that variable is initialized as a list.

Borda approved these changes Nov 15, 2019

View reviewed changes

williamFalcon merged commit 619143a into Lightning-AI:master Nov 19, 2019

jeffling deleted the patch-2 branch November 22, 2019 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect handling of on_batch_end edge cases in run_training_batch #509

Fix incorrect handling of on_batch_end edge cases in run_training_batch #509

jeffling commented Nov 15, 2019 •

edited

Loading

jeffling commented Nov 15, 2019 •

edited

Loading

Borda left a comment

jeffling commented Nov 18, 2019

williamFalcon commented Nov 19, 2019

Fix incorrect handling of on_batch_end edge cases in run_training_batch #509

Fix incorrect handling of on_batch_end edge cases in run_training_batch #509

Conversation

jeffling commented Nov 15, 2019 • edited Loading

Before submitting

What does this PR do?

PR review

Did you have fun?

jeffling commented Nov 15, 2019 • edited Loading

Borda left a comment

Choose a reason for hiding this comment

jeffling commented Nov 18, 2019

williamFalcon commented Nov 19, 2019

jeffling commented Nov 15, 2019 •

edited

Loading

jeffling commented Nov 15, 2019 •

edited

Loading