-
-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Timeout and restart #72
Comments
I would say that any kind of logic related to this may need to be developed upstream as part of the github cache action / library. |
From practice, retying the job manually actually works. These issues can be caused by a variety of reasons, and a good portion of them is solvable by retry (since HTTP retry will likely land on a different load-balancer, might connect to a healthier upstream, etc). I've also considered these features to be implemented at upstream, but, in the end, the implementation on their end will be what, like a loop and a racing timer? And the configuration for both will have to be provided by the user of the code - thus it will have to pass through this action's config. Might make sense to just implement it here, it'll also certainly be quicker. Ultimately, I figured it'd be easier to address this here first, just in case you'd be ok with the implementation being here, at least at first. It can then be moved out into an external re-usable package, for instance. My main rationale for going here instead of the upstream is because I figured it would be quicker to land. |
Figured I'd go ahead and try to do it, ended up with #75. The downsides of this implementation are that:
I also noticed my editor reformats the source a bit via |
I have that as well, so I’m surprised it uses different formatting? |
I think I am running into the same problem. Here is an example: Link here: https://github.com/libp2p/rust-libp2p/actions/runs/3992478158/jobs/6848975863 The restore job in the screenshot has been running for ~45minutes. |
Ditto - https://github.com/streamnative/pulsar-rs/actions/runs/6709247198/job/18231865369 In my case the cache should be optional (best effort) so I'd like a timeout to give up restore cache and just start building. |
GitHub cache server sometimes crashes, and sometimes it gets super slow, emitting the cache (which is about 2 GB for us) at the <1mbps.
In both of these behaviors, the step crashes eventually (with an error in the first case, and with an overall job timeout in the second).
It would be nice to extend the logic of this action with:
Thanks for the great action btw, it does wonders for us already!
The text was updated successfully, but these errors were encountered: