Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop a prow migration plan #752

Open
spiffxp opened this issue Apr 15, 2020 · 21 comments
Open

Develop a prow migration plan #752

spiffxp opened this issue Apr 15, 2020 · 21 comments
Assignees
Labels
area/infra Infrastructure management, infrastructure design, code in infra/ priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Milestone

Comments

@spiffxp
Copy link
Member

spiffxp commented Apr 15, 2020

Break out into issues when we've gotten consensus

Current sketch is:

  • decide whether we want a build cluster, and/or how many we we want
  • allow prow.k8s.io to schedule jobs on our build cluster(s)
  • setup a prow instance (in aaa?)
  • figure out how we want to migrate/cut-over github traffic

Some concerns include:

  • jobs that run in same cluster as prow.k8s.io to take advantage of the ghproxy instance that lives there
  • using a new github bot user/token so as not to conflict with k8s-ci-robot while prow.k8s.io continues to use it
  • setting up boskos in our build cluster(s)
  • setting up greenhouse in our build cluster(s)
  • setting up projects for boskos to manage
  • migration of google-owned buckets like kubernetes-jenkins
  • on-call
@spiffxp
Copy link
Member Author

spiffxp commented Apr 15, 2020

/wg k8s-infra
/sig testing
/area cluster-infra
/assign

@k8s-ci-robot k8s-ci-robot added wg/k8s-infra sig/testing Categorizes an issue or PR as relevant to SIG Testing. area/infra Infrastructure management, infrastructure design, code in infra/ labels Apr 15, 2020
@spiffxp spiffxp added this to In Progress in sig-k8s-infra Apr 27, 2020
@spiffxp
Copy link
Member Author

spiffxp commented Apr 27, 2020

I prototyped a single build cluster here #806

I have it hooked up to my personal prow instance here: https://prow.bashfire.dev

@spiffxp
Copy link
Member Author

spiffxp commented Apr 28, 2020

Trying to work through what the plan looks like, it's unlikely we'll get to discussing the actual "migrate prow.k8s.io" part in much detail this week.

https://docs.google.com/document/d/1g7RBDhbTAH1VBMl2kJZR71OfzM6cJF277zbysd2AoCQ/edit#

@spiffxp
Copy link
Member Author

spiffxp commented Apr 29, 2020

Discussed during today's meeting, came away with:

  • We'll do two build clusters, each in their own project (trusted, untrusted)
  • Untrusted build cluster will use same machine type as used by prow.k8s.io's existing build cluster (n1-highmem-8)
  • Will use regional clusters
  • Will start with e2e project pool that is not sig/job-specific
    • Adjust when costs get to unknown/unexplained size
    • Consider per-sig project pools
  • Naming Is Hard:
    • k8s-infra-prow-build - untrusted build cluster (and project, serviceaccount, etc)
    • k8s-infra-prow-build-trusted - trusted build cluster (and project, serviceaccount, etc)
    • k8s-infra-e2e-{suffix} - e2e projects intended to be pinned to a job
    • k8s-infra-e2e-boskos-{nnn} - e2e projects intended to be managed by boskos
  • What I'm aiming for in two weeks:
    • clusters hooked up to prow.k8s.io
    • confirmed migration of:
      • ci-kubernetes-node-kubelet
      • ci-kubernetes-e2e-gci-gce
      • (an image pushing job?)
    • dev/test cycle for migrating jobs

@spiffxp
Copy link
Member Author

spiffxp commented May 6, 2020

#830 implements all of the above except dev/test cycle

@spiffxp
Copy link
Member Author

spiffxp commented May 6, 2020

Opened up some more issues to represent followup work:

@MHBauer
Copy link

MHBauer commented May 27, 2020

Learning about sig-node ci, can I help with or shadow along changes to ci-kubernetes-node-kubelet?

@spiffxp
Copy link
Member Author

spiffxp commented Jun 12, 2020

@MHBauer add yourself to the k8s-infra-prow-viewers@kubernetes.io group in groups/groups.yaml and let me know if you can see https://console.cloud.google.com/monitoring/dashboards?project=k8s-infra-prow-build

@helenfeng737
Copy link
Contributor

I can help with the migration work too if needed. Already added myself in k8s-infra-prow-viewers

@spiffxp
Copy link
Member Author

spiffxp commented Oct 28, 2020

About ready to enumerate:

  • setting up a staging prow instance to experiment with running in community infra
  • how to shift traffic from prow.k8s.io

@spiffxp
Copy link
Member Author

spiffxp commented Jan 20, 2021

/assign @ameukam
since also working on staging prow (ref: #1394)

@spiffxp
Copy link
Member Author

spiffxp commented Jan 22, 2021

/priority important-longterm
I don't know how much time I'll have to work on completing a plan by end of v1.21

@spiffxp spiffxp moved this from In Progress to Blocked in sig-k8s-infra Feb 19, 2021
@fejta-bot
Copy link

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 22, 2021
@ameukam
Copy link
Member

ameukam commented Apr 23, 2021

/remove-lifecycle stale

@k8s-ci-robot k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 23, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Jun 9, 2021

/lifecycle frozen

@k8s-ci-robot k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jun 9, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Jul 16, 2021

/milestone v1.23

@k8s-ci-robot k8s-ci-robot added this to the v1.23 milestone Jul 16, 2021
@spiffxp spiffxp added this to Backlog in sig-testing issues Jul 27, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Sep 15, 2021

/remove-priority important-longterm
/priority important-soon

@k8s-ci-robot k8s-ci-robot added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. labels Sep 15, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Sep 29, 2021

The proposal linked in this issue likely needs to be refreshed, but dropping a few thoughts before I forget.

Blockers to migrating all jobs over to a community-run build cluster include:

Blockers to shutting down prow.k8s.io and then standing it all up over in kubernetes-public include:

  • all of the existing kubernetes/test-infra tooling / manifests assume deployment to its own cluster vs. a shared cluster
  • allowing a community-owned prow to trigger builds in the google.com-owned "default" build cluster (if we fail to migrate all jobs over
  • allowing a community-owned prow to write/update gs://k8s-testgrid
  • allowing a community-owned prow to write/update gs://kubernetes-jenkins
  • allowing a community-owned prow to write to the google.com-owned k8s-prow project
  • prow.k8s.io currently runs jobs for more than the kubernetes community (Migrate non-Kubernetes repos off of prow.k8s.io test-infra#12863)
  • (not sure what else I'm missing here...)

@k8s-ci-robot k8s-ci-robot added sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. and removed wg/k8s-infra labels Sep 29, 2021
@ameukam
Copy link
Member

ameukam commented Dec 6, 2021

/milestone v1.24

@k8s-ci-robot k8s-ci-robot modified the milestones: v1.23, v1.24 Dec 6, 2021
@ameukam ameukam removed this from the v1.24 milestone Mar 22, 2022
@ameukam
Copy link
Member

ameukam commented Mar 3, 2024

/milestone v1.30
/lifecycle clear

Rebooted this effort. See: https://docs.google.com/document/d/1erBhuCwY26d0UfPbzt8lEj6bYT2hOUKzc2j36YHVqfM/edit?usp=sharing

@k8s-ci-robot k8s-ci-robot added this to the v1.30 milestone Mar 3, 2024
@ameukam ameukam removed the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Mar 6, 2024
@ameukam
Copy link
Member

ameukam commented Apr 19, 2024

/milestone v1.31

@k8s-ci-robot k8s-ci-robot modified the milestones: v1.30, v1.31 Apr 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/infra Infrastructure management, infrastructure design, code in infra/ priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Projects
Status: In Progress
sig-k8s-infra
  
Blocked
Development

No branches or pull requests

6 participants