Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cinder-CSI Plugin reports 0 healthy nodes/controllers after node decommissioning #10073

Closed
ksitnik-tc opened this issue Feb 23, 2021 · 4 comments
Assignees
Labels
stage/accepted Confirmed, and intend to work on. No timeline committment though. theme/storage type/bug
Milestone

Comments

@ksitnik-tc
Copy link

ksitnik-tc commented Feb 23, 2021

Nomad version

Nomad v1.0.2

Issue

We run our client nodes with the cinder-csi plugin for persistent storage.

When we decommission client nodes, Nomad reports plugin health status of 0.

For example, prior to stopping the clients,

nomad plugin status
Container Storage Interface
ID        Provider                  Controllers Healthy/Expected  Nodes Healthy/Expected
dev  cinder.csi.openstack.org  6/6                           6/6

After removing the old clients,

nomad plugin status
Container Storage Interface
ID        Provider                  Controllers Healthy/Expected  Nodes Healthy/Expected
dev  cinder.csi.openstack.org  4/0                           4/0

Re-applying the cinder-csi plugin job does not remedy this issue.

@ksitnik-tc ksitnik-tc changed the title Cinder-CSI Plugin reports 0 healthy nodes/controllers after partial node decommissioning Cinder-CSI Plugin reports 0 healthy nodes/controllers after node decommissioning Feb 23, 2021
@apollo13
Copy link
Contributor

Probably related/similar cause to #9810

@shoenig shoenig added this to Needs Triage in Nomad - Community Issues Triage via automation Feb 24, 2021
@shoenig shoenig moved this from Needs Triage to Needs Roadmapping in Nomad - Community Issues Triage Feb 24, 2021
@shoenig shoenig added stage/accepted Confirmed, and intend to work on. No timeline committment though. theme/storage type/bug labels Feb 24, 2021
@tgross tgross removed this from Needs Roadmapping in Nomad - Community Issues Triage Mar 4, 2021
@tgross
Copy link
Member

tgross commented Feb 3, 2022

In #11758 (comment) I've noted what's probably the cause of and fix for this issue:

The plugin counts don't accurately match the state of allocations. I'm working up a patch the resolves plugin counts in a similar fashion to what we did for volume claims in #11890

@tgross
Copy link
Member

tgross commented Feb 8, 2022

I've just opened #12027 which should close out this issue.

@github-actions
Copy link

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Oct 11, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
stage/accepted Confirmed, and intend to work on. No timeline committment though. theme/storage type/bug
Projects
None yet
Development

No branches or pull requests

4 participants