Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove ci-cri-containerd tests failing continually over past 400 days #18569

Merged
merged 1 commit into from
Aug 6, 2020

Conversation

tpepper
Copy link
Member

@tpepper tpepper commented Jul 31, 2020

This set of containerd tests have been 100% failure runs for over 400
days. This means they have not ran successfully or had actionably
constructive monitoring during the lifecycle of any Kubernetes release
currently supported by the community, making their results irrelevant to
the community today.

Signed-off-by: Tim Pepper tpepper@vmware.com

@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Jul 31, 2020
@k8s-ci-robot k8s-ci-robot added area/config Issues or PRs related to code in /config area/jobs area/testgrid sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. labels Jul 31, 2020
@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 1, 2020
This set of containerd tests have been 100% failure runs for over 400
days.  This means they have not ran successfully or had actionably
constructive monitoring during the lifecycle of any Kubernetes release
currently supported by the community, making their results irrelevant to
the community today.

Signed-off-by: Tim Pepper <tpepper@vmware.com>
@k8s-ci-robot k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 3, 2020
@tpepper
Copy link
Member Author

tpepper commented Aug 4, 2020

Looks like @kubernetes/sig-node-pr-reviews don't consider an item until it has
/lgtm
?

@k8s-ci-robot
Copy link
Contributor

@tpepper: you cannot LGTM your own PR.

In response to this:

Looks like @kubernetes/sig-node-pr-reviews don't consider an item until it has
/lgtm
?

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Copy link
Contributor

@alejandrox1 alejandrox1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Checking on these jobs in https://storage.googleapis.com/k8s-gubernator/triage/index.html?date=2020-08-04&pr=1&job=ci-cri-containerd-e2e-gce-stackdriver
and for the most part these have been failing for years.
We should delete them.

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 4, 2020
@MHBauer
Copy link
Contributor

MHBauer commented Aug 4, 2020

Would be nice to have a summary of what's deleted in the commit message.
deleting jobs:
name: ci-cri-containerd-e2e-gce-stackdriver
name: ci-cri-containerd-e2e-gci-gce-es-logging
name: ci-cri-containerd-e2e-gci-gce-sd-logging
name: ci-cri-containerd-e2e-gci-gce-sd-logging-k8s-resources
name: ci-cri-containerd-e2e-gci-gce-statefulset
name: ci-cri-containerd-node-e2e-flaky
name: ci-cri-containerd-node-e2e-serial

The only one I might be concerned about is ci-cri-containerd-node-e2e-flaky which may be expected to fail given the name.

Otherwise, if they've been failing for over a year, I think this is reasonable. We would have to reverse engineer each of them individually to find the original intent.\

@alejandrox1
Copy link
Contributor

For this set of jobs, it seems like it may be more useful to start from scratch: audit the e2e jobs that were being executed by these jobs; see whats useful and whats not; build new jobs #18570

/cc @spiffxp
ptal 🙏

@spiffxp
Copy link
Member

spiffxp commented Aug 4, 2020

/hold
/assign @vpickard @SergeyKanzhelev
I'd like to hear from you two since you are the proposed leads of the SIG Node CI subproject that was presented at last week's SIG Node meeting

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 4, 2020
@vpickard
Copy link
Contributor

vpickard commented Aug 6, 2020

I think it makes sense to go ahead and delete these failing jobs. I looked at the test results board, https://storage.googleapis.com/k8s-gubernator/triage/index.html?date=2020-08-04&pr=1&job=ci-cri-containerd-e2e-gce-stackdriver , which is new to me. I'm not quite sure how to see the history of this job beyond the default layout, which is about 19 days. Any pointers? This looks to be a good place/tool to get a broader view of how a test is performing (as well as testgrid).

Given that there is also an open issue to come up with a test plan for containerd tests, I would expect to have these tests back online in a reasonable period of time. We just need to be sure to not let that issue linger too long, perhaps adjust the priority to high and add it the the project tracking dashboard. I will do that now.

@vpickard
Copy link
Contributor

vpickard commented Aug 6, 2020

/lgtm

@vpickard
Copy link
Contributor

vpickard commented Aug 6, 2020

/area test

@k8s-ci-robot
Copy link
Contributor

@vpickard: The label(s) area/test cannot be applied, because the repository doesn't have them

In response to this:

/area test

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@dims
Copy link
Member

dims commented Aug 6, 2020

/approve
/lgtm

@vpickard please feel free to remove hold

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dims, tpepper

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 6, 2020
@vpickard
Copy link
Contributor

vpickard commented Aug 6, 2020

/hold cancel

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 6, 2020
@k8s-ci-robot k8s-ci-robot merged commit 87e945b into kubernetes:master Aug 6, 2020
@k8s-ci-robot k8s-ci-robot added this to the v1.19 milestone Aug 6, 2020
@k8s-ci-robot
Copy link
Contributor

@tpepper: Updated the job-config configmap in namespace default at cluster default using the following files:

  • key containerd.yaml using file config/jobs/kubernetes/sig-node/containerd.yaml

In response to this:

This set of containerd tests have been 100% failure runs for over 400
days. This means they have not ran successfully or had actionably
constructive monitoring during the lifecycle of any Kubernetes release
currently supported by the community, making their results irrelevant to
the community today.

Signed-off-by: Tim Pepper tpepper@vmware.com

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/config Issues or PRs related to code in /config area/jobs area/testgrid cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

8 participants