Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use gs://k8s-release-dev for kubernetes CI builds #22840

Merged
merged 7 commits into from
Jul 9, 2021

Conversation

spiffxp
Copy link
Member

@spiffxp spiffxp commented Jul 9, 2021

Part of umbrella issue to migrate the project away from the google.com-owned gs://kubernetes-release-dev GCS bucket: kubernetes/k8s.io#2318

I broke this up into multiple commits for ease of revert. Each commit message includes a list of jobs potentially impacted

Since one of them involves a kubekins-e2e change, a followup bump PR will be required for all changes to take effect.

@k8s-ci-robot k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. approved Indicates a PR has been approved by an approver from all required OWNERS files. area/config Issues or PRs related to code in /config area/images area/jobs area/kubetest area/release-eng Issues or PRs related to the Release Engineering subproject area/scenarios size/L Denotes a PR that changes 100-499 lines, ignoring generated files. sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. sig/release Categorizes an issue or PR as relevant to SIG Release. labels Jul 9, 2021
@k8s-ci-robot k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label Jul 9, 2021
@spiffxp spiffxp force-pushed the jobs-use-k8s-release-dev branch 2 times, most recently from 323f6e8 to b5d82b4 Compare July 9, 2021 16:34
jobs affected:
- ci-kubernetes-build-deprecated
- ci-kubernetes-build-deprecated-1-21
- ci-kubernetes-build-1-20-deprecated
- ci-kubernetes-build-1-19-deprecated
jobs impacted (anything using --extract but not --extract-ci-bucket):
- ci-benchmark-kube-dns-master
- ci-benchmark-nodelocal-dns-master
- ci-containerd-e2e-cos-gce
- ci-containerd-e2e-cos-gce-1-4
- ci-containerd-e2e-cos-gce-1-4-canary
- ci-containerd-e2e-cos-gce-canary
- ci-containerd-e2e-ubuntu-gce
- ci-containerd-e2e-ubuntu-gce-canary
- ci-containerd-soak-cos-gce
- ci-cos-containerd-e2e-cos-gce
- ci-cos-containerd-e2e-cos-gce-canary
- ci-cos-containerd-e2e-ubuntu-gce
- ci-cos-containerd-e2e-ubuntu-gce-canary
- ci-cri-containerd-e2e-cos-gce
- ci-cri-containerd-e2e-cos-gce-alpha-features
- ci-cri-containerd-e2e-cos-gce-alpha-features-canary
- ci-cri-containerd-e2e-cos-gce-canary
- ci-cri-containerd-e2e-cos-gce-flaky
- ci-cri-containerd-e2e-cos-gce-flaky-canary
- ci-cri-containerd-e2e-cos-gce-ingress
- ci-cri-containerd-e2e-cos-gce-ip-alias
- ci-cri-containerd-e2e-cos-gce-ip-alias-canary
- ci-cri-containerd-e2e-cos-gce-proto
- ci-cri-containerd-e2e-cos-gce-proto-canary
- ci-cri-containerd-e2e-cos-gce-reboot
- ci-cri-containerd-e2e-cos-gce-reboot-canary
- ci-cri-containerd-e2e-cos-gce-serial
- ci-cri-containerd-e2e-cos-gce-slow
- ci-cri-containerd-e2e-cos-gce-slow-canary
- ci-cri-containerd-e2e-gce-device-plugin-gpu
- ci-cri-containerd-e2e-gce-device-plugin-gpu-canary
- ci-cri-containerd-e2e-ubuntu-gce
- ci-cri-containerd-e2e-ubuntu-gce-canary
- ci-gce-pd-csi-driver-latest-k8s-master-windows-2004
- ci-gce-pd-csi-driver-latest-k8s-master-windows-2004-migration
- ci-gce-pd-csi-driver-latest-k8s-master-windows-2019
- ci-gce-pd-csi-driver-latest-k8s-master-windows-2019-migration
- ci-gce-pd-csi-driver-latest-k8s-master-windows-20h2
- ci-gce-pd-csi-driver-latest-k8s-master-windows-20h2-migration
- ci-ingress-gce-e2e
- ci-ingress-gce-e2e-canary
- ci-ingress-gce-e2e-release-1-6
- ci-ingress-gce-e2e-scale
- ci-kubemci-ingress-conformance
- ci-kubernetes-e2e-autoscaling-hpa-cpu
- ci-kubernetes-e2e-autoscaling-vpa-actuation
- ci-kubernetes-e2e-autoscaling-vpa-admission-controller
- ci-kubernetes-e2e-autoscaling-vpa-full
- ci-kubernetes-e2e-autoscaling-vpa-recommender
- ci-kubernetes-e2e-autoscaling-vpa-updater
- ci-kubernetes-e2e-gce-alpha-api
- ci-kubernetes-e2e-gce-beta-stable1-downgrade-cluster
- ci-kubernetes-e2e-gce-beta-stable1-downgrade-cluster
- ci-kubernetes-e2e-gce-beta-stable1-downgrade-cluster-parallel
- ci-kubernetes-e2e-gce-beta-stable1-downgrade-cluster-parallel
- ci-kubernetes-e2e-gce-beta-stable1-gci-kubectl-skew
- ci-kubernetes-e2e-gce-beta-stable1-gci-kubectl-skew
- ci-kubernetes-e2e-gce-beta-stable1-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-beta-stable1-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-canary
- ci-kubernetes-e2e-gce-canary
- ci-kubernetes-e2e-gce-coredns-performance
- ci-kubernetes-e2e-gce-coredns-performance-nodecache
- ci-kubernetes-e2e-gce-gci-serial-sig-cli
- ci-kubernetes-e2e-gce-gpu-beta-stable1-cluster-downgrade
- ci-kubernetes-e2e-gce-gpu-beta-stable1-cluster-downgrade
- ci-kubernetes-e2e-gce-gpu-master-stable1-cluster-downgrade
- ci-kubernetes-e2e-gce-gpu-master-stable1-cluster-downgrade
- ci-kubernetes-e2e-gce-gpu-stable1-beta-cluster-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-beta-cluster-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-beta-master-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-beta-master-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-master-cluster-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-master-cluster-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-master-master-upgrade
- ci-kubernetes-e2e-gce-gpu-stable1-master-master-upgrade
- ci-kubernetes-e2e-gce-gpu-stable2-stable1-cluster-upgrade
- ci-kubernetes-e2e-gce-gpu-stable2-stable1-cluster-upgrade
- ci-kubernetes-e2e-gce-gpu-stable2-stable1-master-upgrade
- ci-kubernetes-e2e-gce-gpu-stable2-stable1-master-upgrade
- ci-kubernetes-e2e-gce-iscsi
- ci-kubernetes-e2e-gce-iscsi-serial
- ci-kubernetes-e2e-gce-kubedns-performance
- ci-kubernetes-e2e-gce-kubedns-performance-nodecache
- ci-kubernetes-e2e-gce-master-new-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-master-new-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-network-metric-measurement
- ci-kubernetes-e2e-gce-new-master-gci-kubectl-skew
- ci-kubernetes-e2e-gce-new-master-gci-kubectl-skew
- ci-kubernetes-e2e-gce-new-master-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-new-master-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-node-containerd-throughput
- ci-kubernetes-e2e-gce-node-throughput
- ci-kubernetes-e2e-gce-scalability-canary
- ci-kubernetes-e2e-gce-stable1-beta-gci-kubectl-skew
- ci-kubernetes-e2e-gce-stable1-beta-gci-kubectl-skew
- ci-kubernetes-e2e-gce-stable1-beta-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-stable1-beta-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster-new
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster-new
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster-new-parallel
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster-new-parallel
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster-parallel
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-cluster-parallel
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-master
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-master
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-master-parallel
- ci-kubernetes-e2e-gce-stable1-beta-upgrade-master-parallel
- ci-kubernetes-e2e-gce-stable1-stable2-downgrade-cluster
- ci-kubernetes-e2e-gce-stable1-stable2-downgrade-cluster
- ci-kubernetes-e2e-gce-stable1-stable2-downgrade-cluster-parallel
- ci-kubernetes-e2e-gce-stable1-stable2-downgrade-cluster-parallel
- ci-kubernetes-e2e-gce-stable1-stable2-gci-kubectl-skew
- ci-kubernetes-e2e-gce-stable1-stable2-gci-kubectl-skew
- ci-kubernetes-e2e-gce-stable1-stable2-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-stable1-stable2-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-stable2-stable1-gci-kubectl-skew
- ci-kubernetes-e2e-gce-stable2-stable1-gci-kubectl-skew
- ci-kubernetes-e2e-gce-stable2-stable1-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-stable2-stable1-gci-kubectl-skew-serial
- ci-kubernetes-e2e-gce-stable2-stable1-upgrade-cluster
- ci-kubernetes-e2e-gce-stable2-stable1-upgrade-cluster
- ci-kubernetes-e2e-gce-stable2-stable1-upgrade-cluster-new
- ci-kubernetes-e2e-gce-stable2-stable1-upgrade-cluster-new
- ci-kubernetes-e2e-gce-stable2-stable1-upgrade-master
- ci-kubernetes-e2e-gce-stable2-stable1-upgrade-master
- ci-kubernetes-e2e-gce-taint-evict
- ci-kubernetes-e2e-gci-gce-alpha-enabled-default
- ci-kubernetes-e2e-gci-gce-autoscaling
- ci-kubernetes-e2e-gci-gce-autoscaling-hpa-cm
- ci-kubernetes-e2e-gci-gce-autoscaling-migs
- ci-kubernetes-e2e-gci-gce-autoscaling-migs-hpa
- ci-kubernetes-e2e-gci-gce-containerd
- ci-kubernetes-e2e-gci-gce-coredns
- ci-kubernetes-e2e-gci-gce-coredns-nodecache
- ci-kubernetes-e2e-gci-gce-flaky
- ci-kubernetes-e2e-gci-gce-flaky-repro
- ci-kubernetes-e2e-gci-gce-ingress-manual-network
- ci-kubernetes-e2e-gci-gce-ip-alias
- ci-kubernetes-e2e-gci-gce-ipvs
- ci-kubernetes-e2e-gci-gce-kube-dns
- ci-kubernetes-e2e-gci-gce-kube-dns-nodecache
- ci-kubernetes-e2e-gci-gce-network-proxy-grpc
- ci-kubernetes-e2e-gci-gce-network-proxy-http-connect
- ci-kubernetes-e2e-gci-gce-proto
- ci-kubernetes-e2e-gci-gce-scalability-networkpolicies
- ci-kubernetes-e2e-gci-gce-scalability-node-killer
- ci-kubernetes-e2e-gci-gce-serial
- ci-kubernetes-e2e-gci-gce-serial-kube-dns
- ci-kubernetes-e2e-gci-gce-serial-kube-dns-nodecache
- ci-kubernetes-e2e-gci-gce-sig-cli
- ci-kubernetes-e2e-gci-gce-single-flake-attempt
- ci-kubernetes-e2e-gci-gce-slow
- ci-kubernetes-e2e-gci-gce-statefulset
- ci-kubernetes-e2e-kops-aws-sig-cli
- ci-kubernetes-e2e-prow-canary
- ci-kubernetes-e2e-snapshot
- ci-kubernetes-e2e-storage-selinux
- ci-kubernetes-e2e-ubuntu-gce-network-policies
- ci-kubernetes-e2e-windows-containerd-gce
- ci-kubernetes-e2e-windows-containerd-gce-1-21
- ci-kubernetes-e2e-windows-gce-2004
- ci-kubernetes-e2e-windows-gce-2004-1-21
- ci-kubernetes-e2e-windows-gce-2019
- ci-kubernetes-e2e-windows-gce-2019-1-18
- ci-kubernetes-e2e-windows-gce-2019-1-19
- ci-kubernetes-e2e-windows-gce-2019-1-20
- ci-kubernetes-e2e-windows-gce-2019-1-21
- ci-kubernetes-e2e-windows-gce-20h2
- ci-kubernetes-e2e-windows-gce-20h2-1-21
- ci-kubernetes-e2e-windows-gce-alpha-features
- ci-kubernetes-e2e-windows-gce-serial
- ci-kubernetes-e2e-windows-node-throughput
- ci-kubernetes-e2e-windows-node-throughput-iis
- ci-kubernetes-kubemark-100-canary
- ci-kubernetes-kubemark-100-gce
- ci-kubernetes-kubemark-100-gce-scheduler
- ci-kubernetes-kubemark-100-gce-scheduler-highqps
- ci-kubernetes-kubemark-500-gce
- ci-kubernetes-kubemark-500-gce-1-20
- ci-kubernetes-kubemark-500-gce-1-21
- ci-kubernetes-kubemark-500-gce-canary
- ci-kubernetes-kubemark-500-gce-stable2
- ci-kubernetes-kubemark-500-gce-stable3
- ci-kubernetes-kubemark-gce-scale
- ci-kubernetes-kubemark-gce-scale-canary
- ci-kubernetes-kubemark-gce-scale-scheduler
- ci-kubernetes-kubemark-gce-scale-scheduler-canary
- ci-kubernetes-kubemark-high-density-100-gce
- ci-kubernetes-local-e2e
- ci-kubernetes-soak-gce-gci
- ci-kubernetes-soak-gci-gce-beta
- ci-kubernetes-soak-gci-gce-stable1
- ci-kubernetes-soak-gci-gce-stable2
- ci-kubernetes-soak-gci-gce-stable3
- ci-kubernetes-storage-scalability
- ci-poseidon-e2e-gce
- pull-csi-driver-smb-e2e-gce
- pull-gcp-compute-persistent-disk-csi-driver-e2e-win2019
- pull-gcp-compute-persistent-disk-csi-driver-e2e-windows-20h2
- pull-kube-storage-version-migrator-disruptive
- pull-kube-storage-version-migrator-fully-automated-e2e
- pull-kube-storage-version-migrator-manually-launched-e2e
- pull-kubernetes-csi-csi-driver-host-path-windows
- pull-perf-tests-100-adhoc
- pull-perf-tests-benchmark-kube-dns
- pull-perf-tests-clusterloader2
- pull-perf-tests-clusterloader2-kubemark
- sig-auth-serviceaccount-admission-controller-migration
- sig-auth-serviceaccount-admission-controller-migration
job configs affected:
- ci-kubernetes-build-deprecated
- ci-kubernetes-build-deprecated-1-21
- ci-kubernetes-build-1-20-deprecated
- ci-kubernetes-build-1-19-deprecated
- anything that uses --scenario=kubernetes_bazel
jobs affected:
- anything that doesn't explicitly set KOPS_KUBE_RELEASE_URL that uses
  kops-e2e-runner.sh
job configs affected:
- e2e-kops-gce-latest
- e2e-kops-grid-scenario-ipv6-conformance
- e2e-kops-grid-scenario-ipv6-calico
- e2e-kops-grid-scenario-ipv6-cilium
- e2e-kops-aws-misc-arm64-ci
- e2e-kops-aws-misc-arm64-conformance
- e2e-kops-aws-misc-amd64-conformance
- e2e-kops-aws-k8s-latest
- pull-kops-e2e-k8s-ci
- pull-kops-e2e-k8s-ci-ha
- pull-kops-e2e-ipv6-conformance
- pull-kops-verify-cloudformation
- kops-postsubmit
jobs affected:
- ci-kubernetes-coverage-conformance
- ci-kubernetes-coverage-e2e-gci-gce
jobs affected:
- periodic-kubernetes-bazel-build-1-19
- ci-kubernetes-bazel-build-1-19
- periodic-kubernetes-bazel-build-1-20
- ci-kubernetes-bazel-build-1-20
@k8s-ci-robot k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Jul 9, 2021
@spiffxp spiffxp changed the title [WIP] use gs://k8s-release-dev for CI builds use gs://k8s-release-dev for kubernetes CI builds Jul 9, 2021
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: justaugustus, spiffxp

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit 5168b64 into kubernetes:master Jul 9, 2021
@k8s-ci-robot k8s-ci-robot added this to the v1.22 milestone Jul 9, 2021
@k8s-ci-robot
Copy link
Contributor

@spiffxp: Updated the job-config configmap in namespace default at cluster test-infra-trusted using the following files:

  • key kops-periodics-gce.yaml using file config/jobs/kubernetes/kops/kops-periodics-gce.yaml
  • key kops-periodics-misc2.yaml using file config/jobs/kubernetes/kops/kops-periodics-misc2.yaml
  • key kops-periodics-versions.yaml using file config/jobs/kubernetes/kops/kops-periodics-versions.yaml
  • key kops-presubmits-e2e.yaml using file config/jobs/kubernetes/kops/kops-presubmits-e2e.yaml
  • key kops-presubmits.yaml using file config/jobs/kubernetes/kops/kops-presubmits.yaml
  • key kubernetes-builds.yaml using file config/jobs/kubernetes/sig-release/kubernetes-builds.yaml
  • key 1.19.yaml using file config/jobs/kubernetes/sig-release/release-branch-jobs/1.19.yaml
  • key 1.20.yaml using file config/jobs/kubernetes/sig-release/release-branch-jobs/1.20.yaml
  • key 1.21.yaml using file config/jobs/kubernetes/sig-release/release-branch-jobs/1.21.yaml
  • key coverage.yaml using file config/jobs/kubernetes/sig-testing/coverage.yaml

In response to this:

Part of umbrella issue to migrate the project away from the google.com-owned gs://kubernetes-release-dev GCS bucket: kubernetes/k8s.io#2318

I broke this up into multiple commits for ease of revert. Each commit message includes a list of jobs potentially impacted

Since one of them involves a kubekins-e2e change, a followup bump PR will be required for all changes to take effect.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@spiffxp spiffxp deleted the jobs-use-k8s-release-dev branch July 9, 2021 21:14
@spiffxp
Copy link
Member Author

spiffxp commented Jul 9, 2021

This kicked off two image build jobs:

Before being so bold as to bump all the jobs to pickup the new kubekins, I'd like to ensure it's been built using the bootstrap image that was just pushed. So, a bump PR just for bootstrap image references: #22849

@spiffxp
Copy link
Member Author

spiffxp commented Jul 9, 2021

Now, a bump PR for kubekins-e2e: #22852

@spiffxp spiffxp mentioned this pull request Jul 9, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Jul 9, 2021

Timeline:

@justaugustus
Copy link
Member

FYI @spiffxp @kubernetes/release-engineering -- the build-deprecated-** jobs have been failing

@CecileRobertMichon
Copy link
Member

CecileRobertMichon commented Jul 12, 2021

FYI all the Cluster API Azure/AWS/Openstack (possibly others too) conformance jobs on k8s main started failing over the weekend, I believe due to this PR.

kubernetes-sigs/cluster-api-provider-azure#1510

@spiffxp
Copy link
Member Author

spiffxp commented Jul 12, 2021

@justaugustus

FYI @spiffxp @kubernetes/release-engineering -- the build-deprecated-** jobs have been failing

I opened kubernetes/kubernetes#103647 to track, and this PR should address the issue: #22864

@spiffxp
Copy link
Member Author

spiffxp commented Jul 12, 2021

@CecileRobertMichon from what I can gather this was due to references hardcoded in your repo? or was this due to the deprecated build jobs falling stale?

Looking at the linked issue it seems like y'all addressed with kubernetes-sigs/cluster-api-provider-azure#1512. Let me know if there's anything more to do

@CecileRobertMichon
Copy link
Member

@CecileRobertMichon from what I can gather this was due to references hardcoded in your repo? or was this due to the deprecated build jobs falling stale?

The former, we have custom scripts that fetch artifacts to test Kubernetes CI versions which started failing because they were referencing gs://kubernetes-release-dev and gcr.io/kubernetes-ci-images. We should be good now and have a handle on the fixes needed from our side, just wanted to give a heads up in case you were tracking the blast radius of the change.

@spiffxp
Copy link
Member Author

spiffxp commented Jul 12, 2021

@CecileRobertMichon

We should be good now and have a handle on the fixes needed from our side, just wanted to give a heads up in case you were tracking the blast radius of the change.

kubernetes/k8s.io#2318 is the tracking issue, if "our side" includes the other cluster-api projects I will hold off on PR'ing changes to those, I was going to proceed with kops next

@spiffxp
Copy link
Member Author

spiffxp commented Jul 12, 2021

FWIW I think the change that actually broke you was 10 days ago when we flipped dl.k8s.io to point to k8s-release-dev kubernetes/k8s.io#1857 (comment), I see there are bits of code that reference dl.k8s.io in your repo, e.g. https://github.com/kubernetes-sigs/cluster-api-provider-azure/blob/99748d4eadcbb98d7df50e9afa043048e5b1c7b0/scripts/ci-entrypoint.sh#L75

We just never noticed at that time because we happened to be lucky that the deprecated build job was keeping parity with the regular build job (which was my intent, seems like there's a ~90% parity between the two these days). I think kubernetes/kubernetes#103647 is what allowed you to notice that buckets had changed.

Apologies for the bump there.

@spiffxp
Copy link
Member Author

spiffxp commented Jul 12, 2021

Surveying for kops:

So it's not really easy to get a sense of "did everything break compared to before". I can maybe try throwing a query at the k8s-gubernator:builds dataset and see what comes back.

Having said that, I'm pretty sure I should expect to see a /kops/ subdir here in the new bucket and I don't

$ gsutil ls -l gs://kubernetes-release-dev/kops/ci/*/linux/amd64/kops | sort -k2
# ...
 163099887  2021-07-07T03:28:33Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+442e5eacbd/linux/amd64/kops
 163115795  2021-07-07T11:48:18Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+1a23c7dfef/linux/amd64/kops
 163115795  2021-07-07T16:10:30Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+b58caf6130/linux/amd64/kops
 163115795  2021-07-07T23:56:31Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+8aefbb3e29/linux/amd64/kops
 163139492  2021-07-08T12:29:14Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+53c7849d97/linux/amd64/kops
 163139492  2021-07-08T16:19:41Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+098a4a91ee/linux/amd64/kops
 163135412  2021-07-08T21:42:34Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+b358037896/linux/amd64/kops
 163135412  2021-07-09T12:12:50Z  gs://kubernetes-release-dev/kops/ci/1.22.0-alpha.2+e6ce40c8e4/linux/amd64/kops
$ gsutil ls -l gs://k8s-release-dev
                                 gs://k8s-release-dev/ci/

Searching around it seems like this job started failing: https://testgrid.k8s.io/kops-presubmits#kops-postsubmit

From the first failure:

>ResumableUploadAbortException: 403 pr-kubekins@kubernetes-jenkins-pull.iam.gserviceaccount.com does not have storage.objects.create access to k8s-release-dev/kops/ci/1.22.0-alpha.2+4fe31321db/darwin/amd64/kops.

Yeah... that makes sense. This job is not running on the community-owned prow build cluster, and so far only the community cluster has privileges to write to community-iwned GCS buckets. I really wish I could get a list of everything else that's trying and failing to write to this bucket.

The quick fix would be to give pr-kubekins write access. But pr-kubekins is used to run more than just kubernetes community jobs (ref: #12863), so we could see things land in there that might need to be purged.

The longer fix would be to migrate this job, and any job trying and failing to write to this bucket, to the community-owned cluster. This then bumps into the can of worms that right now, merge-blocking and release-blocking jobs don't actually have their own dedicated node-pools. They've been getting by on the fact that nothing else substantial is on the community-owned cluster.

I'm gonna go ahead and say the "quick fix" is the one to take here, and we can gameday or schedule access revocation to see what truly needs it.

@spiffxp
Copy link
Member Author

spiffxp commented Jul 13, 2021

Opened kubernetes/k8s.io#2333 for the quick fix approach

@spiffxp
Copy link
Member Author

spiffxp commented Jul 14, 2021

New problem for kops

Building synchronization state...
Starting synchronization...
Copying file:///home/prow/go/src/k8s.io/kops/.bazelbuild/upload/kops/1.22.0-alpha.2+963149791f/darwin/amd64/kops [Content-Type=application/octet-stream]...
ResumableUploadAbortException: 400 Cannot insert legacy ACL for an object when uniform bucket-level access is enabled. Read more at https://cloud.google.com/storage/docs/uniform-bucket-level-access

Which, yeah, I'm holding the line on that. It needs to be changed to work with a UBLA bucket.

@spiffxp
Copy link
Member Author

spiffxp commented Jul 15, 2021

kubernetes/kops#11994 should fix the kops job

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/config Issues or PRs related to code in /config area/images area/jobs area/kubetest area/release-eng Issues or PRs related to the Release Engineering subproject area/scenarios cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. sig/release Categorizes an issue or PR as relevant to SIG Release. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants