Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Accommodate for tainted nodes to match kubeflow pipeline pods #2681

Closed
Shreyanand opened this issue Apr 26, 2022 · 1 comment · Fixed by #2848
Closed

Accommodate for tainted nodes to match kubeflow pipeline pods #2681

Shreyanand opened this issue Apr 26, 2022 · 1 comment · Fixed by #2848
Assignees
Labels
component:pipeline-editor pipeline editor component:pipeline-runtime issues related to pipeline runtimes e.g. kubeflow pipelines kind:enhancement New feature or request
Milestone

Comments

@Shreyanand
Copy link

Is your feature request related to a problem? Please describe.
Environment: Elyra UI on the Open Data Hub Jupyterhub

Feature: When we try to run a Kubeflow pipeline, a notebook that uses GPU fails to run. If we look at the scheduled pod it shows an error that it couldn't find the right node because of the taints on the nodes.

Error:
0/13 nodes are available: 3 Insufficient nvidia.com/gpu, 3 node(s) had taint {node-role.kubernetes.io/infra: }, that the pod didn't tolerate, 3 node(s) had taint {node-role.kubernetes.io/master: }, that the pod didn't tolerate, 4 node(s) had taint {odh/notebook: true}, that the pod didn't tolerate.

Describe the solution you'd like
There should be a way to add toleration to the pods scheduled for running gpu workloads. This would make sure that the pod is able to find the right tainted node.

Describe alternatives you've considered
Another way to solve this would be to remove the taints from the nodes but that is not a recommended practice.

cc @HumairAK

@ptitzler
Copy link
Member

Please subscribe to #2823 for updates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:pipeline-editor pipeline editor component:pipeline-runtime issues related to pipeline runtimes e.g. kubeflow pipelines kind:enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants