Remove dynamic metric names #226

dapplion · 2022-11-11T14:02:12Z

@achingbrain Using a variable metric name is a really bad idea this will break dashboards. Metric name should be constant and contain only _ as special character

js-libp2p-tcp/src/listener.ts

Line 94 in 367d0ca

status: context.metrics.registerMetric(`libp2p_tcp_${addr}_server_status`, {

I don't really see the point for splitting metrics for different ports. Once someone requests that feature we can add a label on the metric per address. But please do not do this now adding extra labels adds cost to metrics at no-one request.

Originally posted by @dapplion in #223 (comment)

The text was updated successfully, but these errors were encountered:

achingbrain · 2022-11-16T15:52:42Z

It's possible to configure multiple TCP listeners for a libp2p node, the address is in the metric name to disambiguate between them.

If you don't want this, you can supply an implementation of the Metrics interface that strips them out. Individual metrics are set up once at transport creation time so the overhead is minimal.

wemeetagain · 2022-11-16T16:26:00Z

Can we add the address as another label for these metrics?

achingbrain · 2022-11-17T11:19:19Z

I don't think so as the metric names have to be unique.

If I start a node with two tcp listen addresses using the tcp addresses as labels I get errors like "Error: A metric with the name libp2p_tcp_connections_count has already been registered."

dapplion · 2022-11-18T10:45:36Z

@achingbrain Then let consumers add a metrics suffix and prefix that they choose. Again current master is just un-usable and a terrible for actual deployments

achingbrain · 2022-11-18T11:03:38Z

Then let consumers add a metrics suffix and prefix that they choose

This makes it impossible to comply with the prometheus naming conventions:

A metric name...
..should have a (single-word) application prefix relevant to the domain the metric belongs to.
...should have a suffix describing the unit, in plural form

So prefixes and suffixes are out, it has to be something in the middle of the metric name.

Again current master is just un-usable and a terrible for actual deployments

What is the actual problem this causes? The only change here is that it used to be "libp2p_tcp_server_status_info" and now it's "libp2p_tcp_0_0_0_0_4002_server_status_info" (or similar).

The metric name is stable and predictable unless you specify 0 as a tcp port.

Could you please go into a bit more detail about how this is un-usable?

dapplion · 2022-11-18T16:57:10Z

Then let consumers add a metrics suffix and prefix that they choose

This makes it impossible to comply with the prometheus naming conventions:

A metric name...
..should have a (single-word) application prefix relevant to the domain the metric belongs to.
...should have a suffix describing the unit, in plural form

So prefixes and suffixes are out, it has to be something in the middle of the metric name.

Again current master is just un-usable and a terrible for actual deployments

What is the actual problem this causes? The only change here is that it used to be "libp2p_tcp_server_status_info" and now it's "libp2p_tcp_0_0_0_0_4002_server_status_info" (or similar).

The metric name is stable and predictable unless you specify 0 as a tcp port.

Could you please go into a bit more detail about how this is un-usable?

The dashboards we commit and distribute would become deployment specific to the port level. That's awful. Change the port of 1 of your deployed nodes and bricked.

If you want to comply with suffix prefix rule, then allow to inject some word in the middle:

function getMetricName(opt: {metricKeyword?: string}): string {
 if (opt.metricKeyword) {
   return `libp2p_tcp_${opt.metricKeyword}_server_status_info`
 } else {
  return "libp2p_tcp_server_status_info"
 }
}

There is no reason to force us into dealing with libp2p_tcp_0_0_0_0_4002_server_status_info without consent. Let consumers take that decision.

achingbrain · 2022-11-19T12:03:37Z

Can we add the address as another label for these metrics?

I don't think so as the metric names have to be unique.

Thinking about this this a bit more, if we refactor @libp2p/prometheus-metrics to globally cache created metrics (prom-client metrics are global anyway so this isn't as crazy as it sounds) and re-use them if the same metric is created again, then we can track transport metrics as metric groups and use the socket address as the key for the metric which seems to work well.

It gets a little complicated as for calculated metrics we'll need to turn multiple values into a single value which may be surprising to users looking at the final metrics but it should hopefully be obvious from the metrics that something isn't right.

dapplion · 2022-11-20T11:41:45Z

So we'll get rid of libp2p_tcp_0_0_0_0_4002_server_status_info?

achingbrain · 2022-11-20T12:20:07Z

Yes, the address is in a label rather than the metric name

dapplion · 2022-11-21T17:15:11Z

Yes, the address is in a label rather than the metric name

Can you do a PR?

achingbrain · 2022-11-22T08:55:21Z

It's here: #230

p-shahi mentioned this issue Nov 15, 2022

C. 🔮 Ergonomic Observability libp2p/js-libp2p#1458

Closed

2 tasks

p-shahi added this to js-libp2p Nov 15, 2022

p-shahi moved this to Weekly Candidates in js-libp2p Nov 15, 2022

mpetrunic added P0 Critical: Tackled by core team ASAP P1 High: Likely tackled by core team if no one steps up and removed P0 Critical: Tackled by core team ASAP labels Nov 15, 2022

achingbrain mentioned this issue Nov 19, 2022

fix: allow multiple consumers of metrics libp2p/js-libp2p-prometheus-metrics#6

Merged

mpetrunic closed this as completed Nov 22, 2022

Repository owner moved this from Weekly Candidates/Discuss to Done in js-libp2p Nov 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove dynamic metric names #226

Remove dynamic metric names #226

dapplion commented Nov 11, 2022

achingbrain commented Nov 16, 2022

wemeetagain commented Nov 16, 2022

achingbrain commented Nov 17, 2022 •

edited

Loading

dapplion commented Nov 18, 2022

achingbrain commented Nov 18, 2022

dapplion commented Nov 18, 2022

achingbrain commented Nov 19, 2022

dapplion commented Nov 20, 2022

achingbrain commented Nov 20, 2022

dapplion commented Nov 21, 2022

achingbrain commented Nov 22, 2022

Remove dynamic metric names #226

Remove dynamic metric names #226

Comments

dapplion commented Nov 11, 2022

achingbrain commented Nov 16, 2022

wemeetagain commented Nov 16, 2022

achingbrain commented Nov 17, 2022 • edited Loading

dapplion commented Nov 18, 2022

achingbrain commented Nov 18, 2022

dapplion commented Nov 18, 2022

achingbrain commented Nov 19, 2022

dapplion commented Nov 20, 2022

achingbrain commented Nov 20, 2022

dapplion commented Nov 21, 2022

achingbrain commented Nov 22, 2022

achingbrain commented Nov 17, 2022 •

edited

Loading