Skip to content
This repository has been archived by the owner on Jul 3, 2024. It is now read-only.

RMSProp epsilon=1.0, why? #19

Open
TimDettmers opened this issue May 16, 2020 · 0 comments
Open

RMSProp epsilon=1.0, why? #19

TimDettmers opened this issue May 16, 2020 · 0 comments

Comments

@TimDettmers
Copy link

Thank you so much for this codebase. It helps a lot to make NAS more reproducible.

I have a question regarding RMSProp. I do not see RMSProp often in computer vision, but I guess it is fine, there are not the greatest difference between optimizers. However, I see that you used epsilon=1.0 which I find odd since this is the constants that usually prevent division by zero errors and you set it at a very high value. That high value introduces a systematic bias in the variance estimate. Do you have any references for other public results using this in conjunction with that high learning rate or is there any reason in particular why epsilon=1.0?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant