Cleanup usage of kubernetes-release-pull in kubernetes presubmits #18789

amwat · 2020-08-11T22:15:31Z

What should be cleaned up or changed:
We stage builds to gs://kubernetes-release-pull in almost every presubmit job.
But from what I can tell nothing is actually consuming those builds since the jobs also use extract=local .
It's a non-trivial overhead to upload the release tars in every presubmit and we should remove all the non-required usages.

Provide any links for context:
https://cs.k8s.io/?q=kubernetes-release-pull&i=nope&files=&repos=

test-infra/kubetest/extract_k8s.go

Lines 449 to 466 in c4628a3

    
           case local: 
        
           	url := util.K8s("kubernetes", "_output", "gcs-stage") 
        
           	files, err := ioutil.ReadDir(url) 
        
           	if err != nil { 
        
           		return err 
        
           	} 
        
           	var release string 
        
           	for _, file := range files { 
        
           		r := file.Name() 
        
           		if strings.HasPrefix(r, "v") { 
        
           			release = r 
        
           			break 
        
           		} 
        
           	} 
        
           	if len(release) == 0 { 
        
           		return fmt.Errorf("No releases found in %v", url) 
        
           	} 
        
           	return getKube(fmt.Sprintf("file://%s", url), release, extractSrc)

Random GCE provider job: https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-e2e-gce/1293275406807339008#1:build-log.txt%3A903

/cc @spiffxp @BenTheElder @MushuEE

EDIT(@spiffxp): I made a list of the offending jobs going off the criteria --extract=local and --stage=gs://kubernetes-release-pull/*

if the job triggers for a single branch it's labeled as job@branch
if the job triggers for all branches it's labeled as job
there are no presubmits that trigger for N branches (where all > N > 1)
there are no periodics or postsubmits that touch gs://kubernetes-release-pull
this picks up some --provider=aws jobs (kops), it remains to be seen whether they need --stage or not

EDIT(@BenTheElder): I removed the outdated checklist and instead i'm going to provide a search: https://github.com/search?q=repo%3Akubernetes%2Ftest-infra+%22--stage%3Dgs%3A%2F%2Fkubernetes-release-pull%22&type=code

The text was updated successfully, but these errors were encountered:

BenTheElder · 2020-08-13T05:38:51Z

we should test this in a canary just because this stuff is old and brittle and I can't remember why we were doing this anymore 🙃

amwat · 2020-08-14T19:53:48Z

update:
https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/92316/pull-kubernetes-e2e-gce-no-stage/1294345203125063682/#1:build-log.txt%3A355

the local path

test-infra/kubetest/extract_k8s.go

Line 450 in c4628a3

url := util.K8s("kubernetes", "_output", "gcs-stage")

comes from
https://github.com/kubernetes/release/blob/8d6bd15010efeec44018e4847860d464d2682d97/lib/releaselib.sh#L1245-L1247

which shouldn't be needed since we already have them under
bazel-bin/build/release-tars (but without the hashes)
https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/92316/pull-kubernetes-e2e-gce-no-stage/1294345203125063682/#1:build-log.txt%3A338

fejta-bot · 2020-11-26T10:09:15Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-12-26T10:53:59Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

BenTheElder · 2021-01-06T19:57:42Z

still worth doing?

spiffxp · 2021-01-08T22:34:31Z

/remove-lifecycle rotten
I think so. The other option is to continue as-is, meaning jobs that use this bucket need to switch to use k8s-release-pull as they migrate to k8s-infra.

amwat · 2021-01-08T23:44:00Z

sadly looks like those gcs links have been gced.
seems like one of the steps involved as part of --stage is copying the artifacts from the bazel output path to the make output path _output/gcs-stage and then uploading them to gcs.

and our presubmit jobs are configured to --extract=local instead of --extract=bazel while using --build=bazel
so they were relying on them being in the make output path.
https://github.com/kubernetes/test-infra/blob/master/config/jobs/kubernetes/sig-cloud-provider/gcp/gcp-gce.yaml#L48

testing out in the canary job: #20427

spiffxp · 2021-01-21T19:40:24Z

/milestone v1.21
/sig testing
/wg-k8s-infra

amwat · 2021-01-21T21:49:12Z

We have a succesful run at https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-e2e-gce-no-stage/1352340847076577280

not sure why the total test duration is higher as compared to
https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-e2e-gce/1351850610516824064

but we atleast saved 154 seconds of stage time (which should be the only delta here)

https://storage.googleapis.com/kubernetes-jenkins/pr-logs/pull/92316/pull-kubernetes-e2e-gce-no-stage/1352340847076577280/artifacts/junit_runner.xml

as compared to

https://storage.googleapis.com/kubernetes-jenkins/pr-logs/pull/97894/pull-kubernetes-e2e-gce/1351850610516824064/artifacts/junit_runner.xml

and 1.84 GiB of unnecessary GCS uploads

$ gsutil du -sh gs://kubernetes-release-pull/ci/pull-kubernetes-e2e-gce/v1.18.16-rc.0.3+9f5c61d324a62b
1.84 GiB     gs://kubernetes-release-pull/ci/pull-kubernetes-e2e-gce/v1.18.16-rc.0.3+9f5c61d324a62b

spiffxp · 2021-01-22T23:02:52Z

/priority important-soon

spiffxp · 2021-01-26T23:18:25Z

/assign @amwat @spiffxp
Assigning to us for now. If we think this is eligible for /help or don't have time to do it ourselves we can writeup how to proceed

jbpratt · 2022-10-23T13:54:46Z

Based on #22892 (comment) (and the changes being reverted), how should we proceed with this? I started working through this and realized I was re-doing @spiffxp's changes 😄

SD-13 · 2023-12-09T20:30:35Z

Hi, I am interested to work on this issue but I have some questions or queries.

Seems like the list of jobs to be fixed is outdated
Please help me to understand the fix we need to follow here
I don't think the fix @amwat mentioned here here don't apply because we are not using bazel to build.
So as discussed here config/jobs: run no-stage on k8s-infra, drop extract #24238 (comment), can we now remove extract and stage?
Please feel free to correct me if I am wrong

cc @spiffxp @BenTheElder @ameukam

BenTheElder · 2024-04-01T23:09:30Z

Sorry, a couple of the people you pinged don't work on tthis anymore and I'm kinda buried.

I've lost context on this one.

BenTheElder · 2024-08-06T19:41:12Z

I'm not sure we ever got no-stage working? It's hard to follow at this point.

BenTheElder · 2024-08-06T19:47:50Z

#28176 renamed the test job, testing in kubernetes/kubernetes#126563

BenTheElder · 2024-08-06T20:10:27Z

It does, it will stage to a generated bucket under the rented boskos project (which the boskos janitors should clean up if they don't already), so we can carefully start dropping these I think ... very belatedly.

BenTheElder · 2024-08-07T20:13:56Z

beginning bulk migration in #33259, starting with a subset of optional, non-blocking, not always_run jobs

We have to drop both --extract=local and --stage at the same time. We don't need to locally extract what we just built, it's running fine and uploading to a bucket under the boskos project.

You can see sample runs in kubernetes/kubernetes#126563

Inspect these logs:
https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/126563/pull-kubernetes-e2e-gce-cos-no-stage/1820909457454927872
https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/126563/pull-kubernetes-e2e-gce-pull-through-cache/1821254221408768000

BenTheElder · 2024-08-07T20:41:24Z

If anyone wants to help:

Break these up into easily reverted commits
Make sure to drop both extract=local and stage at the same time
Make sure the jobs you're touching are not required for merge, we'll do those last
You must agree to follow-up to make sure you didn't break anything shortly after merging these, and definitely before doing any more. If you're not confident in / familiar with this part, I'd ask that you select a different issue, we need to get this sorted out as part of migrating to the community infra in the immediate future but we don't want to break CI especially at this point in the release cycle, and I expect to be done before we get to the safer periods in the release cycle.

NOTE: spiffxp and amwat don't work on Kubernetes anymore. I'm taking over this problem.

BenTheElder · 2024-08-09T00:37:23Z

#33278 does everything but the one remaining PR blocking job, for which we'll wait a bit and check some more things

BenTheElder · 2024-08-09T00:47:40Z

Once we have test results we can do #33280, and then I'll delete the bucket

BenTheElder · 2024-08-13T17:52:48Z

This is done, I just need to follow-up with eliminating that bucket.

BenTheElder · 2024-08-13T20:47:33Z

Done!

amwat added the kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. label Aug 11, 2020

amwat mentioned this issue Aug 14, 2020

Add a shadow job to test if staging builds is actually required. #18838

Merged

spiffxp mentioned this issue Aug 19, 2020

Migrate merge-blocking jobs to dedicated cluster: pull-kubernetes-e2e-gce #18852

Closed

BenTheElder added the area/jobs label Aug 28, 2020

spiffxp mentioned this issue Aug 31, 2020

Migrate jobs away from gs://kubernetes-release-dev kubernetes/k8s.io#846

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 26, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Dec 26, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jan 8, 2021

amwat mentioned this issue Jan 8, 2021

Match extract and build strategy for no-stage canary job. #20427

Merged

spiffxp mentioned this issue Jan 13, 2021

Migrate away from google.com gcp project kubernetes-jenkins-pull kubernetes/k8s.io#1526

Open

k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label Jan 21, 2021

k8s-ci-robot added this to the v1.21 milestone Jan 21, 2021

spiffxp mentioned this issue Jan 21, 2021

DO NOT MERGE -- Add empty commit for testing CI kubernetes/kubernetes#92316

Closed

spiffxp added the wg/k8s-infra label Jan 21, 2021

spiffxp added this to Backlog (infra to migrate) in sig-k8s-infra Jan 22, 2021

k8s-ci-robot added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Jan 22, 2021

k8s-ci-robot assigned amwat and spiffxp Jan 26, 2021

ameukam mentioned this issue Jan 25, 2023

migrate pull-kubernetes-e2e-gce from bootstrap #28513

Merged

upodroid mentioned this issue Nov 21, 2023

Download binaries from dl.k8s.io instead of a bucket kubernetes-sigs/kubetest2#251

Merged

BenTheElder assigned BenTheElder and unassigned spiffxp and amwat Aug 6, 2024

This was referenced Aug 7, 2024

migrate pull-kubernetes-e2e-gce-pull-through-cache off of gs://kubernetes-release-pull #33257

Merged

Add missing stage in pull-e2e-gci-gce-alpha-enabled-default #33258

Merged

drop staging from some optional presubmit jobs #33259

Merged

dims mentioned this issue Aug 7, 2024

Drop both --extract and --stage from pull-e2e-gci-gce-alpha-enabled-default #33261

Merged

BenTheElder mentioned this issue Aug 8, 2024

drop staging from all jobs using k8s-release-pull #33276

Merged

BenTheElder added the lifecycle/active Indicates that an issue or PR is actively being worked on by a contributor. label Aug 8, 2024

BenTheElder removed this from the v1.25 milestone Aug 8, 2024

This was referenced Aug 8, 2024

drop providerless job from 1.31, 1.31.0 will not contain the provider… #33277

Merged

removing remaining optional job --stage=gs://kubernetes-release-pull #33278

Merged

BenTheElder mentioned this issue Aug 9, 2024

drop --stage from pull-kubernetes-e2e-gce #33280

Merged

BenTheElder closed this as completed Aug 13, 2024

sig-k8s-infra automation moved this from Backlog (infra to migrate) to Done Aug 13, 2024

sig-testing issues automation moved this from Backlog to Done Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup usage of kubernetes-release-pull in kubernetes presubmits #18789

Cleanup usage of kubernetes-release-pull in kubernetes presubmits #18789

amwat commented Aug 11, 2020 •

edited by BenTheElder

Loading

BenTheElder commented Aug 13, 2020

amwat commented Aug 14, 2020

fejta-bot commented Nov 26, 2020

fejta-bot commented Dec 26, 2020

BenTheElder commented Jan 6, 2021

spiffxp commented Jan 8, 2021

amwat commented Jan 8, 2021

spiffxp commented Jan 21, 2021

amwat commented Jan 21, 2021 •

edited

Loading

spiffxp commented Jan 22, 2021

spiffxp commented Jan 26, 2021

jbpratt commented Oct 23, 2022

SD-13 commented Dec 9, 2023

BenTheElder commented Apr 1, 2024

BenTheElder commented Aug 6, 2024

BenTheElder commented Aug 6, 2024

BenTheElder commented Aug 6, 2024

BenTheElder commented Aug 7, 2024

BenTheElder commented Aug 7, 2024

BenTheElder commented Aug 9, 2024

BenTheElder commented Aug 9, 2024

BenTheElder commented Aug 13, 2024

BenTheElder commented Aug 13, 2024

Cleanup usage of kubernetes-release-pull in kubernetes presubmits #18789

Cleanup usage of kubernetes-release-pull in kubernetes presubmits #18789

Comments

amwat commented Aug 11, 2020 • edited by BenTheElder Loading

BenTheElder commented Aug 13, 2020

amwat commented Aug 14, 2020

fejta-bot commented Nov 26, 2020

fejta-bot commented Dec 26, 2020

BenTheElder commented Jan 6, 2021

spiffxp commented Jan 8, 2021

amwat commented Jan 8, 2021

spiffxp commented Jan 21, 2021

amwat commented Jan 21, 2021 • edited Loading

spiffxp commented Jan 22, 2021

spiffxp commented Jan 26, 2021

jbpratt commented Oct 23, 2022

SD-13 commented Dec 9, 2023

BenTheElder commented Apr 1, 2024

BenTheElder commented Aug 6, 2024

BenTheElder commented Aug 6, 2024

BenTheElder commented Aug 6, 2024

BenTheElder commented Aug 7, 2024

BenTheElder commented Aug 7, 2024

BenTheElder commented Aug 9, 2024

BenTheElder commented Aug 9, 2024

BenTheElder commented Aug 13, 2024

BenTheElder commented Aug 13, 2024

amwat commented Aug 11, 2020 •

edited by BenTheElder

Loading

amwat commented Jan 21, 2021 •

edited

Loading