-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimizer module overhaul #396
Conversation
* Before ```julia NadamScheduler(; mu0 = 0.99, delta = 0.004, gamma = 0.5, alpha = 0.96) ``` * After ```julia NadamScheduler(; μ₀ = 0.99, δ = 0.004, γ = 0.5, α = 0.96) ```
Blocker: #394 renames: * `get_momentum` -> `getmomentum` * `get_momentum_scheduler` -> `getmomsched` * `Momentum.Fixed.momentum` -> `Momentum.Fixed.μ`
Decouple learning rate update from OptimizationState, give more control for user to trigger update via `update!`
Let user can control it. And provide default value `1/batch_size` in high-level API `fit!` only.
[ci skip]
Ready for review. The motivation of this PR is building more elegant APIs than Python (thanks to good Unicode support of Julia's REPL and editor plugin), |
Thanks for the efforts.
|
I agree the your point of view on gradient clipping and rescaling. I want to hear what naming you want. We can list all of them here, then vote. |
Unfortunately, I don't have better naming suggestion apart from the more verbose |
Add another options:
|
Is it okay to omit the |
I did the renaming, please check it out. |
good to go? |
Sorry for the late reply. I still prefer with |
Thanks a lot. About the |
Alias sounds good! |
will sort out summary later...