bugfix: get capacity grpc request should have timeout #688

bai3shuo4 · 2021-12-10T09:21:10Z

/kind bug

bug
What this PR does / why we need it:
If one node csidriver fails or it does not set any timeout for grpc, this sync capacity will stuck here. If we only use one thread to process CSIStoragecapacities, it will stuck forever. Provisioning pv or delete all has timeout, it should have timeout for GetCapacity as well
Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

Add getCapacity request timeout to avoid hang forever

k8s-ci-robot · 2021-12-10T09:21:18Z

Hi @bai3shuo4. Thanks for your PR.

I'm waiting for a kubernetes-csi member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

pohly · 2021-12-10T10:26:28Z

cmd/csi-provisioner/csi-provisioner.go

@@ -475,6 +475,7 @@ func main() {
 			factoryForNamespace.Storage().V1beta1().CSIStorageCapacities(),
 			*capacityPollInterval,
 			*capacityImmediateBinding,
+			*operationTimeout,


Setting some timeout makes sense, I just wonder whether we should use operationTimeout for it. It's currently defined as

operationTimeout = flag.Duration("timeout", 10*time.Second, "Timeout for waiting for creation or deletion of a volume")

Perhaps change that into Timeout for volume operations (creation, deletion. capacity queries)?

Creation and deletion all use same timeoutOperation. I think we should keep this pattern. We don't need define too much timeout which is unnecessary

I agree, but then the description should be changed to include the new usage. Querying capacity is not "creation or deletion of a volume" (current description).

Oh！I miss it, I will add it

@pohly done~

pohly

/lgtm

bai3shuo4 · 2021-12-13T12:51:24Z

/lgtm

k8s-ci-robot · 2021-12-13T12:51:25Z

@bai3shuo4: you cannot LGTM your own PR.

In response to this:

/lgtm

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

bai3shuo4 · 2021-12-13T12:54:23Z

@pohly Sorry, I fix the unit test problem, why all this check stales here? How to recover it

bai3shuo4 · 2021-12-13T12:56:30Z

/retest

k8s-ci-robot · 2021-12-13T12:56:43Z

@bai3shuo4: Cannot trigger testing until a trusted user reviews the PR and leaves an /ok-to-test message.

In response to this:

/retest

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

pohly · 2021-12-13T13:23:19Z

/ok-to-test

xing-yang · 2021-12-13T20:31:41Z

Can you add a release note?

bai3shuo4 · 2021-12-14T03:13:56Z

/retest

bai3shuo4 · 2021-12-14T03:39:39Z

Can you add a release note?

@xing-yang Done

bai3shuo4 · 2021-12-14T03:41:56Z

@pohly Hi, I think it's ok to merge :)

pohly · 2021-12-14T06:52:41Z

/lgtm
/assign @xing-yang

For approval.

xing-yang · 2021-12-14T21:15:36Z

/lgtm
/approve

k8s-ci-robot · 2021-12-14T21:15:51Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bai3shuo4, pohly, xing-yang

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [xing-yang]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Dec 10, 2021

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Dec 10, 2021

k8s-ci-robot requested review from Jiawei0227 and pohly December 10, 2021 09:21

pohly reviewed Dec 10, 2021

View reviewed changes

bai3shuo4 force-pushed the bugfix/sync-capacity-timeout branch from 332fa4b to e0bdab2 Compare December 13, 2021 03:13

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Dec 13, 2021

bai3shuo4 requested a review from pohly December 13, 2021 08:04

pohly approved these changes Dec 13, 2021

View reviewed changes

k8s-ci-robot assigned pohly Dec 13, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 13, 2021

bai3shuo4 force-pushed the bugfix/sync-capacity-timeout branch from e0bdab2 to 83eb8c7 Compare December 13, 2021 11:42

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 13, 2021

bai3shuo4 closed this Dec 13, 2021

bai3shuo4 reopened this Dec 13, 2021

bai3shuo4 force-pushed the bugfix/sync-capacity-timeout branch from 83eb8c7 to 95d34c0 Compare December 13, 2021 12:52

bai3shuo4 requested a review from pohly December 13, 2021 12:55

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Dec 13, 2021

pohly mentioned this pull request Dec 13, 2021

Add changelog for v3.1.0 #686

Merged

bugfix: get capacity grpc request should have timeout

dcae33f

bai3shuo4 force-pushed the bugfix/sync-capacity-timeout branch from 95d34c0 to dcae33f Compare December 14, 2021 03:12

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Dec 14, 2021

k8s-ci-robot assigned xing-yang Dec 14, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 14, 2021

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 14, 2021

k8s-ci-robot merged commit 0b71727 into kubernetes-csi:master Dec 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: get capacity grpc request should have timeout #688

bugfix: get capacity grpc request should have timeout #688

bai3shuo4 commented Dec 10, 2021 •

edited

Loading

k8s-ci-robot commented Dec 10, 2021

pohly Dec 10, 2021

bai3shuo4 Dec 11, 2021

pohly Dec 12, 2021

bai3shuo4 Dec 13, 2021

bai3shuo4 Dec 13, 2021

pohly left a comment

bai3shuo4 commented Dec 13, 2021

k8s-ci-robot commented Dec 13, 2021

bai3shuo4 commented Dec 13, 2021

bai3shuo4 commented Dec 13, 2021

k8s-ci-robot commented Dec 13, 2021

pohly commented Dec 13, 2021

xing-yang commented Dec 13, 2021

bai3shuo4 commented Dec 14, 2021

bai3shuo4 commented Dec 14, 2021 •

edited

Loading

bai3shuo4 commented Dec 14, 2021

pohly commented Dec 14, 2021

xing-yang commented Dec 14, 2021

k8s-ci-robot commented Dec 14, 2021

bugfix: get capacity grpc request should have timeout #688

bugfix: get capacity grpc request should have timeout #688

Conversation

bai3shuo4 commented Dec 10, 2021 • edited Loading

k8s-ci-robot commented Dec 10, 2021

pohly Dec 10, 2021

Choose a reason for hiding this comment

bai3shuo4 Dec 11, 2021

Choose a reason for hiding this comment

pohly Dec 12, 2021

Choose a reason for hiding this comment

bai3shuo4 Dec 13, 2021

Choose a reason for hiding this comment

bai3shuo4 Dec 13, 2021

Choose a reason for hiding this comment

pohly left a comment

Choose a reason for hiding this comment

bai3shuo4 commented Dec 13, 2021

k8s-ci-robot commented Dec 13, 2021

bai3shuo4 commented Dec 13, 2021

bai3shuo4 commented Dec 13, 2021

k8s-ci-robot commented Dec 13, 2021

pohly commented Dec 13, 2021

xing-yang commented Dec 13, 2021

bai3shuo4 commented Dec 14, 2021

bai3shuo4 commented Dec 14, 2021 • edited Loading

bai3shuo4 commented Dec 14, 2021

pohly commented Dec 14, 2021

xing-yang commented Dec 14, 2021

k8s-ci-robot commented Dec 14, 2021

bai3shuo4 commented Dec 10, 2021 •

edited

Loading

bai3shuo4 commented Dec 14, 2021 •

edited

Loading