Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail fast if we cannot build indexes #265

Merged
merged 1 commit into from
Aug 23, 2023

Conversation

danielvegamyhre
Copy link
Contributor

Fixes #263

@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Aug 23, 2023
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: danielvegamyhre

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot requested a review from ahg-g August 23, 2023 17:15
@k8s-ci-robot k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Aug 23, 2023
@danielvegamyhre
Copy link
Contributor Author

cc @kannon92 @alculquicondor is this an appropriate way of handling the case where indexes cannot be built, or what is the typical approach here? I notice in the Kueue controller when setting up indexes, errors are only logged but program execution continues uninterrupted: https://github.com/kubernetes-sigs/kueue/blob/main/main.go#L178

However, as you can see in issue #263 in the case where indexes fail to build, JobSet controller is hitting a reconciliation error here and never starting the jobs due to these indexes not existing, so we need to handle this somehow. Perhaps if we get an error listing child jobs here, we should simply retry the call but without specifying an index (so it will be slower but at least work)?

Copy link
Contributor

@kannon92 kannon92 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 23, 2023
@k8s-ci-robot k8s-ci-robot merged commit 5fd390f into kubernetes-sigs:main Aug 23, 2023
k8s-ci-robot added a commit that referenced this pull request Aug 23, 2023
…-#265-upstream-release-0.2

Automated cherry pick of #265: fail fast if we cannot build indexes
@danielvegamyhre danielvegamyhre mentioned this pull request Dec 12, 2023
20 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bug: unable to setup indexes  
3 participants