Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: unable to setup indexes   #263

Closed
Tracked by #239
danielvegamyhre opened this issue Aug 22, 2023 · 5 comments · Fixed by #265
Closed
Tracked by #239

Bug: unable to setup indexes   #263

danielvegamyhre opened this issue Aug 22, 2023 · 5 comments · Fixed by #265
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@danielvegamyhre
Copy link
Contributor

Users are reporting an issue when they apply their JobSet, the JobSet object is created but it never starts running. After examining the logs we found the following error:

2023-08-22T21:19:59Z    ERROR    Reconciler error    {"controller": "jobset", "controllerGroup": "jobset.x-k8s.io", "controllerKind": "JobSet", ....  "error": "Index with name field:.metadata.controller does not exist"}

The index on .metadata.controller is created in SetupIndexes which is called from main.go here. If the indexes cannot be built, we would expect to see this error logged. Looking further in the logs, we do indeed see this error:

2023-08-21T14:37:20Z    ERROR    setup    unable to setup indexes    {"error": "failed to get API group resources: unable to retrieve the complete list of server APIs: batch/v1: Get \"https://10.72.0.1:443/apis/batch/v1\": dial tcp 10.72.0.1:443: i/o timeout"}

/kind bug

@k8s-ci-robot k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label Aug 22, 2023
@danielvegamyhre
Copy link
Contributor Author

Perhaps adding retry logic with some kind of exponential backoff policy here would be sufficient

@danielvegamyhre
Copy link
Contributor Author

@kannon92 I notice Kueue has a helper utilruntime.Must(...) in which they call SetupIndexes. Perhaps we should implement something similar? This bug has been popping up a lot lately so I want to fix it asap.

@kannon92
Copy link
Contributor

kannon92 commented Aug 23, 2023

Yea I think that is a good idea. I think @vsoch ran into this actually but wasn't sure how to reproduce it.

@alculquicondor
Copy link

Interesting. I wouldn't have expected SetupIndexes to fail.
For whatever reason, we haven't hit these issues. From the message, it looks like the apiserver was down.

Indeed it seems best to exit.

@vsoch
Copy link
Contributor

vsoch commented Aug 23, 2023

Correct - I've seen this several times, usually when I install / uninstall something and the only fix is to delete and re-create the entire cluster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants