You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Node status (Up/Down)
Container status (Up/Down)
Gluster peer in cluster status (Connected/Disconnected)
Glusterd2 service status (Up/Down)
Cluster status
The text was updated successfully, but these errors were encountered:
@shtripat How do we get these metrics? @JohnStrunk I was thinking if these metrics should come from an exporter in anthill as it would have the real updates about the Cluster or Node health which it maintains for reconciliation.
@JohnStrunk I was thinking if these metrics should come from an exporter in anthill as it would have the real updates about the Cluster or Node health which it maintains for reconciliation.
I'm hesitant to get these items from Anthill. It will have its own view of each, but we then get a dependency... If the operator is down or malfunctioning, the alerts are potentially wrong.
I would expect many of these to come via data from gluster-prometheus or health checks on labeled pods. The benefit of using g-p is that as long as 1 gd2 pod is ready, the exporter should be available through the gd2 client service.
Need following status alerts:
Node status (Up/Down)
Container status (Up/Down)
Gluster peer in cluster status (Connected/Disconnected)
Glusterd2 service status (Up/Down)
Cluster status
The text was updated successfully, but these errors were encountered: