Report `tracing_unbounded` channel size to prometheus #1489

dmitry-markin · 2023-09-11T09:35:36Z

Fixes issue #611 by introducing a metric for the channel sizes.

Introduce the new metric substrate_unbounded_channel_size labeled by the channel name that contains the size of the channel (the number of messages in transit).

sandreim · 2023-09-11T09:54:22Z

substrate/client/utils/src/metrics.rs

 }

+pub static SENT_LABEL: &'static str = "send";


Suggested change

pub static SENT_LABEL: &'static str = "send";

pub static SENT_LABEL: &'static str = "sent";

The original label name was "send", and I'm not sure if it's a good idea to break the compatibility — there are probably a lot of automation scripts using this.

vstakhov · 2023-09-11T13:45:22Z

substrate/client/utils/src/metrics.rs

+		),
+		&["entity", "action"], // name of channel, send|received|dropped
+	).expect("Creating of statics doesn't fail. qed");
+	pub static ref UNBOUNDED_CHANNELS_SIZE: GenericGaugeVec<AtomicU64> = GenericGaugeVec::new(


I'm not quite sure that a single gauge will provide enough data here. Since a gauge is a simple counter cell, Prometheus will get just a single value on the next collection. However, we will not track any spikes of this value between the collection intervals. I would suggest to use histogram here, as it includes both Gauge functionality and buckets that will allow to track anomalities and peaks easily. On the other hand, it is more expensive.

I'll go with a simple gauge here, because it's hard to estimate the performance impact of using a histogram, considering we use channels extensively. So far this is merely a fix for the instant channel size calculation, which was done on the CI side previously using metrics not intended for this. But thanks for the suggestion anyway.

#1568) # Description Follow up for #1489. Closes #611 Before we calculated the channel size during alert expression but in #1489 a new metric was introduced that reports channel size. ## Changes: 1. updated alert rule to use new metric.

paritytech#1568) # Description Follow up for paritytech#1489. Closes paritytech#611 Before we calculated the channel size during alert expression but in paritytech#1489 a new metric was introduced that reports channel size. ## Changes: 1. updated alert rule to use new metric.

Report tracing_unbounded channel size to prometheus

c9a93f1

dmitry-markin added the T0-node This PR/Issue is related to the topic “node”. label Sep 11, 2023

dmitry-markin requested a review from altonen September 11, 2023 09:35

dmitry-markin added 2 commits September 11, 2023 12:40

minor: rustfmt

54a1220

Fix compilation

daf4e23

sandreim approved these changes Sep 11, 2023

View reviewed changes

altonen approved these changes Sep 11, 2023

View reviewed changes

vstakhov reviewed Sep 11, 2023

View reviewed changes

lexnv approved these changes Sep 11, 2023

View reviewed changes

dmitry-markin merged commit f5ca403 into master Sep 12, 2023

dmitry-markin deleted the dm-report-channel-size-to-prometheus branch September 12, 2023 11:38

dmitry-markin mentioned this pull request Sep 12, 2023

Channel mpsc_import_notification_stream #611

Closed

BulatSaif mentioned this pull request Sep 14, 2023

Update the alerts to use a new metric substrate_unbounded_channel_size #1568

Merged

kiltbot mentioned this pull request Oct 19, 2023

[AUTOMATIC] Update Polkadot dependencies from 1.1.0 to 1.2.0 KILTprotocol/kilt-node#571

Closed

ahmadkaouk mentioned this pull request Nov 16, 2023

Update polkadot-sdk from v.1.1.0 to v1.3.0 moonbeam-foundation/moonbeam#2565

Closed

bgallois pushed a commit to duniter/duniter-polkadot-sdk that referenced this pull request Mar 25, 2024

Report tracing_unbounded channel size to prometheus (paritytech#1489)

12ff34d

bkchr pushed a commit that referenced this pull request Apr 10, 2024

backport named events PR (#1489)

4929493

This was referenced Jun 5, 2024

Update polkadot-sdk from v1.7.0 to v1.11.0 moondance-labs/tanssi#573

Closed

Update polkadot-sdk from v1.10.0 to v1.11.0 moondance-labs/tanssi#577

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report `tracing_unbounded` channel size to prometheus #1489

Report `tracing_unbounded` channel size to prometheus #1489

dmitry-markin commented Sep 11, 2023 •

edited

Loading

sandreim Sep 11, 2023

dmitry-markin Sep 11, 2023 •

edited

Loading

vstakhov Sep 11, 2023

dmitry-markin Sep 11, 2023

	pub static SENT_LABEL: &'static str = "send";
	pub static SENT_LABEL: &'static str = "sent";

Report tracing_unbounded channel size to prometheus #1489

Report tracing_unbounded channel size to prometheus #1489

Conversation

dmitry-markin commented Sep 11, 2023 • edited Loading

sandreim Sep 11, 2023

Choose a reason for hiding this comment

dmitry-markin Sep 11, 2023 • edited Loading

Choose a reason for hiding this comment

vstakhov Sep 11, 2023

Choose a reason for hiding this comment

dmitry-markin Sep 11, 2023

Choose a reason for hiding this comment

Report `tracing_unbounded` channel size to prometheus #1489

Report `tracing_unbounded` channel size to prometheus #1489

dmitry-markin commented Sep 11, 2023 •

edited

Loading

dmitry-markin Sep 11, 2023 •

edited

Loading