-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pipeline cannot find some services and directory. #11383
Labels
Comments
This error usually means MLMD issues. Please provide logs from |
It has been a while . I actually reinstalled kubeflow(of course after deleting it!)
metadata-grpc-deployment-c568bd446-krltx
metadata-writer-747d764c6d-m5hzq
profiles-deployment-5cdb548b74-nhsdt
workflow-controller-859c5ff4d8-rtnw2
dex-c9d5654fb-6dgld (from auth namespace)
cert-manager-cainjector-7cdfb576c5-mvk8k (from cert-manager namespace)
All the pods seem to have the same issue that either cannot find some endpoint or cannot retrieve anything from it. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I had installed KubeFlow pipeline multi-user on kind cluster just like the Readme.md in the manifests repo and it worked just fine; I could run pipelines on it successfully. But all of a sudden my host machine got restarted and I have a problem ever since.
(I have noted down the pods below). As you see KF pods and also their related pods like dex, cert-manager etc. are running but my pipelines get error and fail with this error(my pod with name arka-h2o-test-48gfr-system-dag-driver-4203517700 is a workflow created from a pipeline):
Error logs
Pods
Services in the profile namespace
According to the manifest
Latest
Impacted by this bug? Give it a 👍.
The text was updated successfully, but these errors were encountered: