[RFC] Add `self.lr_schedulers()` to LightningModule for manual optimization #6567

akihironitta · 2021-03-17T13:57:45Z

What does this PR do?

Part of #6379.

Before disabling lr_scheduler.step() in manual optimization in #6379, this PR adds self.lr_schedulers() so that users can lr_scheduler.step() in LightningModule at arbitrary intervals in manual optimization.

Example:

class Model(LightningModule):
    def __init__(self):
        self.automatic_optimization = False

    def training_step(self, batch, batch_idx):
        # single scheduler
        scheduler = self.lr_schedulers()

        # multiple schedulers
        scheduler1, scheduler2 = self.lr_schedulers()

TODO

[n/a] ~~Update the docs~~ I'll update the docs in the following PR which disables lr_scheduler.step() in manual optimization because this PR itself doesn't enable users to call step() in manual optimization.
Add a test

Before submitting

[RFC] Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
[n/a] Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

Related to #6825.

akihironitta · 2021-03-17T14:15:01Z

pytorch_lightning/core/lightning.py

+        # ignore other keys "interval", "frequency", etc.
+        lr_schedulers = [s["scheduler"] for s in self.trainer.lr_schedulers]


self.lr_schedulers() is supposed to be used in manual optimization, so even when dict keys like "interval" and "monitor" are defined in configure_optimizers(), this line ignores all of the keys except "scheduler". Related docs: https://pytorch-lightning.readthedocs.io/en/latest/common/optimizers.html#learning-rate-scheduling

codecov · 2021-04-02T09:41:38Z

Codecov Report

Merging #6567 (07d25e9) into master (1bd5f36) will decrease coverage by 5%.
The diff coverage is 71%.

@@           Coverage Diff           @@
##           master   #6567    +/-   ##
=======================================
- Coverage      91%     87%    -5%     
=======================================
  Files         192     192            
  Lines       12190   12256    +66     
=======================================
- Hits        11144   10635   -509     
- Misses       1046    1621   +575

pep8speaks · 2021-04-04T18:39:31Z

Hello @akihironitta! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-04-04 20:34:31 UTC

tchaton

Looks neat !

maxoppelt · 2021-05-19T10:34:48Z

I think this update introduced a new bug:

When automatic_optimization is disabled and you are using schedulers that require a metric, e.g. th.optim.lr_scheduler.ReduceLROnPlateau, an error is raised:

pytorch_lightning.utilities.exceptions.MisconfigurationException: The lr scheduler dict must include a monitor when a "ReduceLROnPlateau" scheduler is used. For example: {"optimizer": optimizer, "lr_scheduler": {"scheduler": scheduler, "monitor": "your_loss"}}

As soon as you define a monitor you get a warning:

/pytorch_lightning/utilities/distributed.py:69: RuntimeWarning: The lr scheduler dict contains the key(s) ['monitor'], but the keys will be ignored. You need to call "lr_scheduler.step()" manually in manual optimization.

Pytorch Lightning Version 1.3.1

Edit: (I do not know what the preferred solution might be, but adding)
if scheduler['reduce_on_plateau'] and scheduler.get('monitor', None) is None and not is_manual_optimization:
to L153 in optimizers.py solves this issue for me.

awaelchli · 2021-05-19T12:19:53Z

@maxoppelt The lr schedulers need to be manually stepped in manual optimization. For schedulers that require the val_loss, that means they need to be stepped in the validaton_epoch_end hook. Therefore, the monitor key in the dict must be omitted and we have to turn off the error message in manual optimization. Is that correct?

maxoppelt · 2021-05-19T12:56:24Z

Yes, that is a possible solution. Disable the raise of MisconfigurationException when using automatic_optimization False.

Another design choice could be: Disable the warning and provide access to the monitor key in training_epoch_end/validation_epoch_end.

Minor remark on the documentation: https://pytorch-lightning.readthedocs.io/en/latest/common/optimizers.html#learning-rate-scheduling-manual is misleading: Most schedulers have an epoch argument in the step method. Therefore one should not call scheduler.step() in training_step(). Especially when adding epoch as argument to your scheduler step: You get an EPOCH_DEPRECATION_WARNING.

This could lead to misunderstandings, when reading the doc. However calling the scheduler in training_epoch_end() might be problematic when using multi dataloaders or ddp training?

awaelchli · 2021-05-20T00:41:09Z

Disable the raise of MisconfigurationException when using automatic_optimization False.

My preference.

Another design choice could be: Disable the warning and provide access to the monitor key in training_epoch_end/validation_epoch_end.

There is already a pattern for this, by returning the value in the step method or by using torchmetrics.
So I think we don't need another way. Or what would be concretely your suggestion?

However calling the scheduler in training_epoch_end() might be problematic when using multi dataloaders or ddp training?

For multi dataloaders, training_epoch_end() will receive outputs for all. So I see no big problem here.
In DDP training we update the scheduler in each process but when a metric is required, we probably want to update with the same metric value in all process. We need to think of something here with minimal code changes required for the user.

Are you interested sending a PR, for the error message handling / doc improvements?

akihironitta added the feature Is an improvement or enhancement label Mar 17, 2021

akihironitta added this to the 1.3 milestone Mar 17, 2021

akihironitta commented Mar 17, 2021

View reviewed changes

akihironitta changed the title ~~[RFC] Add self.lr_schedulers() to LightningModule for manual optimization~~ [RFC] Add self.lr_schedulers() to LightningModule for manual optimization [WIP] Mar 17, 2021

akihironitta added 2 commits April 2, 2021 18:22

Add test for lr_schedulers()

204311e

Add lr_schedulers to LightningModule

5767487

akihironitta force-pushed the feat/add-lr_schedulers-in-manopt branch from b5128e0 to 5767487 Compare April 2, 2021 09:28

akihironitta force-pushed the feat/add-lr_schedulers-in-manopt branch from 6cd0f73 to 5767487 Compare April 4, 2021 18:44

Update test comment

b72477b

akihironitta changed the title ~~[RFC] Add self.lr_schedulers() to LightningModule for manual optimization [WIP]~~ [RFC] Add self.lr_schedulers() to LightningModule for manual optimization Apr 4, 2021

akihironitta marked this pull request as ready for review April 4, 2021 20:31

akihironitta requested review from awaelchli, Borda, carmocca, justusschock, SeanNaren, tchaton and williamFalcon as code owners April 4, 2021 20:31

Update CHANGELOG

07d25e9

akihironitta mentioned this pull request Apr 5, 2021

Disable lr_scheduler.step() in manual optimization #6825

Merged

11 tasks

carmocca approved these changes Apr 7, 2021

View reviewed changes

carmocca added the ready PRs ready to be merged label Apr 7, 2021

tchaton approved these changes Apr 8, 2021

View reviewed changes

awaelchli approved these changes Apr 8, 2021

View reviewed changes

tchaton merged commit 5e4dfd7 into master Apr 9, 2021

tchaton deleted the feat/add-lr_schedulers-in-manopt branch April 9, 2021 09:32

maxoppelt mentioned this pull request May 21, 2021

Bugfix: Scheduler monitor for manual optimization #7643

Merged

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Add `self.lr_schedulers()` to LightningModule for manual optimization #6567

[RFC] Add `self.lr_schedulers()` to LightningModule for manual optimization #6567

akihironitta commented Mar 17, 2021 •

edited

Loading

akihironitta Mar 17, 2021 •

edited

Loading

codecov bot commented Apr 2, 2021 •

edited

Loading

pep8speaks commented Apr 4, 2021 •

edited

Loading

tchaton left a comment

maxoppelt commented May 19, 2021 •

edited

Loading

awaelchli commented May 19, 2021

maxoppelt commented May 19, 2021

awaelchli commented May 20, 2021

		# ignore other keys "interval", "frequency", etc.
		lr_schedulers = [s["scheduler"] for s in self.trainer.lr_schedulers]

[RFC] Add self.lr_schedulers() to LightningModule for manual optimization #6567

[RFC] Add self.lr_schedulers() to LightningModule for manual optimization #6567

Conversation

akihironitta commented Mar 17, 2021 • edited Loading

What does this PR do?

TODO

Before submitting

PR review

Did you have fun?

akihironitta Mar 17, 2021 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Apr 2, 2021 • edited Loading

Codecov Report

pep8speaks commented Apr 4, 2021 • edited Loading

Comment last updated at 2021-04-04 20:34:31 UTC

tchaton left a comment

Choose a reason for hiding this comment

maxoppelt commented May 19, 2021 • edited Loading

awaelchli commented May 19, 2021

maxoppelt commented May 19, 2021

awaelchli commented May 20, 2021

[RFC] Add `self.lr_schedulers()` to LightningModule for manual optimization #6567

[RFC] Add `self.lr_schedulers()` to LightningModule for manual optimization #6567

akihironitta commented Mar 17, 2021 •

edited

Loading

akihironitta Mar 17, 2021 •

edited

Loading

codecov bot commented Apr 2, 2021 •

edited

Loading

pep8speaks commented Apr 4, 2021 •

edited

Loading

maxoppelt commented May 19, 2021 •

edited

Loading