multiplier works weird #26

alexwongdl · 2022-05-07T08:17:31Z

While I modify the example code like this:

import torch
from torch.optim.lr_scheduler import StepLR, ExponentialLR
from torch.optim.sgd import SGD

from warmup_scheduler import GradualWarmupScheduler


if __name__ == '__main__':
    model = [torch.nn.Parameter(torch.randn(2, 2, requires_grad=True))]
    optim = SGD(model, 0.0001)

    # scheduler_warmup is chained with schduler_steplr
    scheduler_steplr = StepLR(optim, step_size=10, gamma=0.1)
    scheduler_warmup = GradualWarmupScheduler(optim, multiplier=10, total_epoch=5, after_scheduler=scheduler_steplr)

    # this zero gradient update is needed to avoid a warning message, issue #8.
    optim.zero_grad()
    optim.step()

    for epoch in range(1, 20):
        scheduler_warmup.step(epoch)
        print(epoch, optim.param_groups[0]['lr'])

        optim.step()    # backward pass (update network)

I get an unexcepted result, the sixth epoch is strange

1 0.00028
2 0.00045999999999999996
3 0.00064
4 0.00082
5 0.001
6 0.0001    
7 0.001
8 0.001
9 0.001
10 0.001
11 0.001
12 0.001
13 0.001
14 0.001
15 0.0001
16 0.0001
17 0.0001
18 0.0001
19 0.0001

The text was updated successfully, but these errors were encountered:

lucasbrynte · 2022-12-06T13:14:20Z

I can confirm this behavior. I think line 31 in warmup_scheduler/scheduler.py is troublesome, and that

return self.after_scheduler.get_last_lr()

should rather be:

return self.after_scheduler.get_lr()

I do however think the whole scheduler would be easier / less error-prone to implement using the built-in PyTorch LR scheduler LinearLR for the warmup part, optionally chained with one or more other schedulers (the equivalent of "after_scheduler") using SequentialLR.

lucasbrynte · 2022-12-06T15:49:15Z

Just to nuance my comment: For some reason it actually seems users are not supposed to call the .get_lr() function. It generates a warning message if called from elsewhere than .step(), in which case this is indicated by a with _enable_get_lr_call(self): statement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multiplier works weird #26

multiplier works weird #26

alexwongdl commented May 7, 2022

lucasbrynte commented Dec 6, 2022

lucasbrynte commented Dec 6, 2022

multiplier works weird #26

multiplier works weird #26

Comments

alexwongdl commented May 7, 2022

lucasbrynte commented Dec 6, 2022

lucasbrynte commented Dec 6, 2022