-
Notifications
You must be signed in to change notification settings - Fork 338
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scalability: Increase resync period or make parameterizable #100
Comments
I guess the problem is the external-provisioner uses resync as its retry mechanism. So fixing this probably requires substantial changes to external-provisioner. |
@msau42 Any ideas what the standard is? How high can we go? 12 hrs? It's 15 seconds for no other reasons than: copy the PV controller (which is so low only because it needs it to do its "transactions" in a timely manner) and I guess paranoia over imaginary scenarios wherein missing events leads to data deletion (???) |
Anyway I can definitely try to decouple resync from retries, cleaning up the retry logic may solve #101 at the same time |
I'm not sure if there's a standard. Searching here, I see kubelet uses 15 minutes, component config default seems to be 12 hours. I think once we decouple resync from retries, then we could at least increase it to something on the order of minutes. |
Thanks, I've opened a PR here if you/anyone is interested kubernetes-retired/external-storage#825 . Did some basic testing and 15 minutes seems fine. Retries are now handled by the |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/close Will reconsider after we have better scale testing in place |
@msau42: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
The default resync period of 15 seconds is IMO too frequent. Once there are many PVs in the system, and multiple CSI plugins, there will be a lot of unnecessary churn. API watch is supposed to be better now and shouldn't drop events that frequently.
The text was updated successfully, but these errors were encountered: