Always print connectivity report #3677

alexggh · 2024-03-13T13:23:37Z

This is printed every 10 minutes, I see no reason why it shouldn't be in all the logs, it would give us valuable information about what is going on with node connectivity when validators come-back to us to report issues.

This is printed every 10 minutes, I see no reason why it shouldn't be in all the logs, it would give us valuable information about what is going on with node connectivity when validators come-back to us to report issues. Signed-off-by: Alexandru Gheorghe <alexandru.gheorghe@parity.io>

sandreim · 2024-03-13T13:32:37Z

This can get pretty big if there are many unconnected authorities. Are we interested in exactly these? Because otherwise the metrics are always available per protocol - polkadot_parachain_peer_count

eskimor · 2024-03-13T14:15:23Z

This can get pretty big if there are many unconnected authorities. Are we interested in exactly these? Because otherwise the metrics are always available per protocol - polkadot_parachain_peer_count

Right, but then there is also some network issue which would justify a bit of verbosity every 10 minutes. If we assume that these logs would be 1Meg in the worst case, then it would still take us 7 days to fill a GB. Sounds sensible for today's hardware.

sandreim · 2024-03-13T14:26:20Z

This can get pretty big if there are many unconnected authorities. Are we interested in exactly these? Because otherwise the metrics are always available per protocol - polkadot_parachain_peer_count

Right, but then there is also some network issue which would justify a bit of verbosity every 10 minutes. If we assume that these logs would be 1Meg in the worst case, then it would still take us 7 days to fill a GB. Sounds sensible for today's hardware.

Your estimations might be right, but my question is mostly if we do this specifically to know the unconnected authorities or we are interested mainly in the connectivity ratio.

sandreim

In the end it shouldn't hurt.

alexggh · 2024-03-13T14:28:14Z

This can get pretty big if there are many unconnected authorities. Are we interested in exactly these? Because otherwise the metrics are always available per protocol

Those metrics are good to tell you something went wrong, but they will fail to tell which nodes are not connecting or some other stuff.

The flow I'm actually trying to optimise here, is people coming to us observing high level stuff is not working and providing us the log files they have, but there is literally nothing in there to tell you what went wrong. So at minimum we need to understand how well connected that node was by default.

Right, but then there is also some network issue which would justify a bit of verbosity every 10 minutes. If we assume that these logs would be 1Meg in the worst case, then it would still take us 7 days to fill a GB. Sounds sensible for today's hardware.

My thoughts as well given how rarely we print this, I don't thing there is any danger there.

alexggh · 2024-03-13T14:29:43Z

Your estimations might be right, but my question is mostly if we do this specifically to know the unconnected authorities or we are interested mainly in the connectivity ratio.

The unnconnected list proved useful here #3314 (comment), so yeah I think we want it.

alexggh · 2024-03-13T14:30:45Z

In the end it shouldn't hurt.

famous last words :D

burdges · 2024-03-13T19:07:51Z

Always possible to make a runlength encoded bitfield and then 7-bit or hex encode that, if youre worried about size.

* master: (65 commits) collator protocol changes for elastic scaling (validator side) (#3302) Contracts use polkavm workspace deps (#3715) Add elastic scaling support in ParaInherent BenchBuilder (#3690) Removes `as [disambiguation_path]` from `derive_impl` usage (#3652) fix(paseo-spec): New Paseo Bootnodes (#3674) Improve Penpal runtime + emulated tests (#3543) Staking ledger bonding fixes (#3639) DescribeAllTerminal for HashedDescription (#3349) Increase timeout for assertions (#3680) Add subsystems regression tests to CI (#3527) Always print connectivity report (#3677) Revert "FRAME: Create `TransactionExtension` as a replacement for `SignedExtension` (#2280)" (#3665) authority-discovery: Add log for debugging DHT authority records (#3668) Construct Runtime v2 (#1378) Support for `keyring` in runtimes (#2044) Add api-name in `cannot query the runtime API version` warning (#3653) Add a PolkaVM-based executor (#3458) Adds default config for assets pallet (#3637) Bump handlebars from 4.3.7 to 5.1.0 (#3248) [Collator Selection] Fix weight refund for `set_candidacy_bond` (#3643) ...

This is printed every 10 minutes, I see no reason why it shouldn't be in all the logs, it would give us valuable information about what is going on with node connectivity when validators come-back to us to report issues. Signed-off-by: Alexandru Gheorghe <alexandru.gheorghe@parity.io>

bkchr · 2024-03-30T20:16:16Z

Debug output is not to be printed using info logs. If people come to you complaining, you should either tell them which cli flags to change or come up for example with a dedicated RPC endpoint that generates debug information.

This said, the change in this pr should be reverted.

bkchr · 2024-03-30T20:20:44Z

#3913

eskimor approved these changes Mar 13, 2024

View reviewed changes

alexggh added the R0-silent Changes should not be mentioned in any release notes label Mar 13, 2024

sandreim approved these changes Mar 13, 2024

View reviewed changes

eskimor added this pull request to the merge queue Mar 13, 2024

Merged via the queue into master with commit 878b5dd Mar 13, 2024
133 of 136 checks passed

eskimor deleted the alexaggh/dump_connectivity branch March 13, 2024 14:52

stakeworld mentioned this pull request Mar 30, 2024

Regular connectivity reports make log reading more difficult #3906

Closed

alexggh mentioned this pull request Apr 1, 2024

Revert log level changes #3913

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Always print connectivity report #3677

Always print connectivity report #3677

alexggh commented Mar 13, 2024

sandreim commented Mar 13, 2024

eskimor commented Mar 13, 2024

sandreim commented Mar 13, 2024

sandreim left a comment

alexggh commented Mar 13, 2024

alexggh commented Mar 13, 2024 •

edited

Loading

alexggh commented Mar 13, 2024

burdges commented Mar 13, 2024

bkchr commented Mar 30, 2024

bkchr commented Mar 30, 2024

Always print connectivity report #3677

Always print connectivity report #3677

Conversation

alexggh commented Mar 13, 2024

sandreim commented Mar 13, 2024

eskimor commented Mar 13, 2024

sandreim commented Mar 13, 2024

sandreim left a comment

Choose a reason for hiding this comment

alexggh commented Mar 13, 2024

alexggh commented Mar 13, 2024 • edited Loading

alexggh commented Mar 13, 2024

burdges commented Mar 13, 2024

bkchr commented Mar 30, 2024

bkchr commented Mar 30, 2024

alexggh commented Mar 13, 2024 •

edited

Loading