Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Temporary name resolution failures on plugin BOM builds #3559

Closed
MarkEWaite opened this issue May 2, 2023 · 5 comments
Closed

Temporary name resolution failures on plugin BOM builds #3559

MarkEWaite opened this issue May 2, 2023 · 5 comments

Comments

@MarkEWaite
Copy link

Service(s)

ci.jenkins.io

Summary

The plugin bill of materials generates a new release when a maintainer (like me) runs the build on the master branch. Unfortunately, the master branch has failed the last few builds with an agent message "Temporary failure in name resolution"

apache/airflow#22319 (comment) suggests that we may need to configure the kubedns component with more replicas or we may need to configure it to autoscale when heavily loaded. I don't know how to do that, but the Kubernetes documentation at https://kubernetes.io/docs/tasks/administer-cluster/dns-horizontal-autoscaling/ may be helpful

Reproduction steps

  1. Open https://ci.jenkins.io/job/Tools/job/bom/job/master/1668/ or https://ci.jenkins.io/job/Tools/job/bom/job/master/1666/ and confirm that the job failed due to a temporary name resolution failure
@MarkEWaite MarkEWaite added the triage Incoming issues that need review label May 2, 2023
@lemeurherve lemeurherve added this to the infra-team-sync-2023-05-09 milestone May 2, 2023
@lemeurherve
Copy link
Member

lemeurherve commented May 2, 2023

I've manually restarted a build with withArtifactCachingProxy(false) to unblock it until I have the time to check this error, also noted by @dduportal in #3481 (comment)

@lemeurherve lemeurherve added artifact-caching-proxy and removed triage Incoming issues that need review artifact-caching-proxy labels May 2, 2023
@lemeurherve
Copy link
Member

I've removed the artifact caching proxy label from this issue as it doesn't seem related since the restarted build failed with the same error without it:

[ERROR] Unresolveable build extension: Plugin org.jenkins-ci.tools:maven-hpi-plugin:3.41 or one of its dependencies could not be resolved: Failed to read artifact descriptor for org.jenkins-ci.tools:maven-hpi-plugin:jar:3.41: The following artifacts could not be resolved: org.jenkins-ci.tools:maven-hpi-plugin:pom:3.41 (absent): Could not transfer artifact org.jenkins-ci.tools:maven-hpi-plugin:pom:3.41 from/to repo.jenkins-ci.org (https://repo.jenkins-ci.org/public/): repo.jenkins-ci.org: Temporary failure in name resolution: Unknown host repo.jenkins-ci.org: Temporary failure in name resolution -> [Help 2]

@dduportal
Copy link
Contributor

@lemeurherve can you check the logs of the core-dns application on the cluster cik8s ? Also check its metrics in datadog for this cluster.

Looks like the cluster's internal DNS is not robust

@dduportal
Copy link
Contributor

As compensating measures:

@dduportal
Copy link
Contributor

The builkds https://ci.jenkins.io/job/Tools/job/bom/job/master/1683/ and https://ci.jenkins.io/job/Tools/job/bom/job/master/1678/ ran without DNS errors: we can close this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants