Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deduper: Track job existence instead of running state #440

Merged
merged 1 commit into from
Dec 4, 2024

Conversation

DrJosh9000
Copy link
Contributor

@DrJosh9000 DrJosh9000 commented Nov 28, 2024

What

Reduce the deduper to only tracking job creation and deletion.

Why

A finished job (complete or failed) still exists in the cluster for some time afterwards. So (for deduplication purposes) it should still be treated as in-flight all the way until it is deleted.

This fixes a gap in #438: suppose a job fails without starting a pod. The controller will try to recreate it and fail. With #438 the job will remain in the deduping map and won't be removed, meaning that even once the job is cleaned up through TTL, the controller won't try to create it again. With this PR, the deduping map no longer contains the job after deletion.

@DrJosh9000 DrJosh9000 merged commit c7388e5 into main Dec 4, 2024
1 check passed
@DrJosh9000 DrJosh9000 deleted the deduper-job-existence branch December 4, 2024 00:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants