Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

volcano-admission restart error, failed calling webhook "mutatepod.volcano.sh" #2610

Closed
nature1995 opened this issue Dec 21, 2022 · 6 comments
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@nature1995
Copy link
Contributor

What happened:
volcano-admission pod cannot be restarted. After the restart, the pod cannot be scheduled, and an error is reported:

Internal error occurred: failed calling webhook "mutatepod.volcano.sh": 
Post "https://volcano-admission-service.volcano-system.svc:443/pods /mutate?timeout=10s": 
no endpoints available for service "volcano-admission-service

What you expected to happen:
Pod restarts normally

How to reproduce it (as minimally and precisely as possible):
delete the volcano-admission pod or restart the volcano-admission deployments

Anything else we need to know?:

Environment:

  • Volcano Version: v1.6
  • Kubernetes version (use kubectl version): 1.20.8
  • Cloud provider or hardware configuration: baidu
  • OS (e.g. from /etc/os-release): cent7.5
  • Kernel (e.g. uname -a):
  • Install tools:
  • Others:
@nature1995 nature1995 added the kind/bug Categorizes issue or PR as related to a bug. label Dec 21, 2022
@nature1995
Copy link
Contributor Author

This problem can be solved by delete the volcano-admission-service-pods-mutate and volcano-admission-service-pods-validate

kubectl delete mutatingwebhookconfigurations volcano-admission-service-pods-mutate
kubectl delete validatingwebhookconfigurations volcano-admission-service-pods-validate

refer the from #2346

We can find the namespaceSelector on volcano-admission-service-pods-mutate and namespaceSelector. But the selector is not work unless you add the label to the namespace. So, the promblem can be solved by add a label to the namespace's yaml.

    namespaceSelector:
      matchExpressions:
        - key: kubernetes.io/metadata.name
          operator: NotIn
          values:
            - volcano-system
            - kube-system
apiVersion: v1
kind: Namespace
metadata:
  name: volcano-system
  labels:
    kubernetes.io/metadata.name: volcano-system
---
apiVersion: v1
kind: Namespace
metadata:
  name: volcano-monitoring
  labels:
    kubernetes.io/metadata.name: volcano-monitoring

@hzxuzhonghu
Copy link
Collaborator

@hwdef #2346 cause this, ptal

@nature1995
Copy link
Contributor Author

@hwdef #2346 cause this, ptal

We use Baidu CCE. It not work on the old version, such as 1.20.8.

@hwdef
Copy link
Member

hwdef commented Dec 24, 2022

@hwdef #2346 cause this, ptal

We use Baidu CCE. It not work on the old version, such as 1.20.8.

yes, Your PR looks like it could fix this.

@hwdef
Copy link
Member

hwdef commented Jan 6, 2023

/close

@volcano-sh-bot
Copy link
Contributor

@hwdef: Closing this issue.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
None yet
Development

No branches or pull requests

4 participants