Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added
rpk cluster health
command #4295Added
rpk cluster health
command #4295Changes from all commits
a7ba00d
6f64a1b
c3dcd9c
8766ee3
62432ac
e43a5a2
0319db5
7f864d0
4ddb6b2
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is the idea here that any node can serve a health report, and that it will be refreshed if it is stale? I wonder if we should always retrieve health reports from the controller?
I see that
ec
is used to determineis_healthly
. Presumably it said the cluster isn't healthy if we couldn't refresh?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
exactly, i assumed that if we can not refresh cluster is in bad condition.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hmm I'm not very familiar with this code so please tell me if I have some wrong assumptions.
but I think if the cluster was healthy, but we couldn't refresh, then we may have
cluster_health_overview
with no nodes down, all nodes have leaders, butis_healthy
is false, which may be confusing.Maybe we should add a field like
obsolete cluster state
to thecluster_health_overview
?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also this comment doesn't mention that cluster is not healthy if we couldn't refresh health information (does that mean that some nodes are not responding?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is implicitly covered in
all cluster nodes are responding
, I've changed the comment to fix error in the fist item