Set a timeout on the builds #866

sambhav · 2021-10-21T23:07:48Z

Currently it's possible for a malicious actor or an improper dependency downloaded by a buildpack to create a cluster DoS attack by hoarding resources and resulting in a never ending build. kpack should expose active deadline seconds as a way to kill builds that take longer than a certain amount.

tomkennedy513 · 2021-10-21T23:17:28Z

Could this be mitigated by the resource limits/quotas to prevent any particular pod from hogging too much. I feel like this issue is unlikely because of the testing provided by Paketo buildpacks in regards to their dependencies and the fact that a bad actor that can create an image or build would be able to set the timeout for that image or build. That being said I would be curious to hear more about how this issue came up and if this is something that we should attempt to solve, potentially with your proposed solution.

sambhav · 2021-10-21T23:20:14Z

This might still happen accidentally and even paketo is not able to sidestep such an issue. An example is the python pip buildpack. pip currently doesn't terminate when resolving a non resolvable dependency tree, causing a build that never ends.

tomkennedy513 · 2021-10-21T23:28:50Z

It seems that this happening accidentally is almost more likely simply because of the mitigations a cluster operator can take on the resource usage side. I can see this being valuable for the case that there is no way to cancel a build that is hanging without deleting it, but then you lose that history, which might be useful for identifying issues over time.

tomkennedy513 · 2021-10-21T23:30:25Z

Is this something you experienced during your usage of kpack or is this more theoretical?

sambhav · 2021-10-21T23:38:12Z

I have definitely seen multiple tenants and builds caught out by a single bad package that they might be using. It causes long running builds that users can be entirely unaware of until they look closely at their CI/CD pipelines. If used with kp CLI with the wait feature, this also causes long standing watches to be created on the api server.

tomkennedy513 · 2021-10-21T23:40:07Z

Do you see this being more valuable as a field on the image spec or a config that an operator can set cluster or namespace wide?

sambhav · 2021-10-21T23:43:01Z

It would be better to have timeouts and fail early than a never ending stuck build. It also prevents users from submitting further builds until they cancel the previous one.

Some background on fields we could expose https://kubernetes.io/docs/concepts/workloads/controllers/job/#handling-pod-and-container-failures

both active deadline seconds and ttl seconds after finished would be nice to have

As for setting the policy, I would imagine that a good course of action would be to expose the field on the image spec and have this field be both defaulted and validated using webhooks. This would allow some flexibility and sane defaults for the users while allowing the operator to set upper limits.

daraghlowe mentioned this issue May 12, 2022

Adding activeDeadlineSeconds to build and pod spec #958

Merged

matthewmcnew closed this as completed in #958 Aug 2, 2022

mvalliath mentioned this issue Feb 3, 2023

Document kpack scalability and failure modes #1132

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set a timeout on the builds #866

Set a timeout on the builds #866

sambhav commented Oct 21, 2021 •

edited

Loading

tomkennedy513 commented Oct 21, 2021

sambhav commented Oct 21, 2021

tomkennedy513 commented Oct 21, 2021

tomkennedy513 commented Oct 21, 2021

sambhav commented Oct 21, 2021

tomkennedy513 commented Oct 21, 2021

sambhav commented Oct 21, 2021 •

edited

Loading

Set a timeout on the builds #866

Set a timeout on the builds #866

Comments

sambhav commented Oct 21, 2021 • edited Loading

tomkennedy513 commented Oct 21, 2021

sambhav commented Oct 21, 2021

tomkennedy513 commented Oct 21, 2021

tomkennedy513 commented Oct 21, 2021

sambhav commented Oct 21, 2021

tomkennedy513 commented Oct 21, 2021

sambhav commented Oct 21, 2021 • edited Loading

sambhav commented Oct 21, 2021 •

edited

Loading

sambhav commented Oct 21, 2021 •

edited

Loading