Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] pub webhooks unexpectedly return error when PUB is NOT FOUND #1567

Closed
Spground opened this issue Apr 11, 2024 · 1 comment · Fixed by #1579
Closed

[BUG] pub webhooks unexpectedly return error when PUB is NOT FOUND #1567

Spground opened this issue Apr 11, 2024 · 1 comment · Fixed by #1579
Assignees
Labels
kind/bug Something isn't working kind/good-first-issue Good for newcomers

Comments

@Spground
Copy link
Contributor

What happened:

PUB may unexpectedly interrupt Pod gc issued by KCM, which can lead Pod leak if KCM gc did not retry or retry many hours later.

What you expected to happen:

PUB webhooks never interrupt Pod gc.

How to reproduce it (as minimally and precisely as possible):

Delete workload let's say Sts or CloneSet, then Pod will be deleted by KCM gc later. Sometimes, Pod to delete will be leaking there for a lone time.

Anything else we need to know?:

The root cause is we return error when PUB CR is deleted in RetryOnConflict. Related codes is here,

The solution is simple, just check error type as we can , ignore it if it is NotFound error.

Environment:

  • Kruise version:
  • Kubernetes version (use kubectl version):
  • Install details (e.g. helm install args):
  • Others:
@Spground Spground added the kind/bug Something isn't working label Apr 11, 2024
@zmberg zmberg assigned zmberg and unassigned FillZpp Apr 15, 2024
@zmberg
Copy link
Member

zmberg commented Apr 15, 2024

@Spground You are right, can you fix the bug?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working kind/good-first-issue Good for newcomers
Projects
None yet
3 participants