Fix RPC name override #11813

dhaidashenko · 2024-01-18T18:32:36Z

No description provided.

github-actions · 2024-01-18T18:32:58Z

I see that you haven't updated any CHANGELOG files. Would it make sense to do so?

jmank88 · 2024-01-18T19:06:04Z

core/chains/legacyevm/chain.go

 			name := fmt.Sprintf("eth-sendonly-rpc-%d", i)
+			if node.Name != nil && *node.Name != "" {
+				name = *node.Name
+			}
 			rpc := evmclient.NewRPCClient(lggr, empty, (*url.URL)(node.HTTPURL), name, int32(i), chainID,
 				commonclient.Secondary)


It is safe to pass the nodeName directly. We validate that all nodes are named. Notice that we already pas it without nil check on the line below.

Suggested change

name := fmt.Sprintf("eth-sendonly-rpc-%d", i)

if node.Name != nil && *node.Name != "" {

name = *node.Name

}

rpc := evmclient.NewRPCClient(lggr, empty, (*url.URL)(node.HTTPURL), name, int32(i), chainID,

commonclient.Secondary)

rpc := evmclient.NewRPCClient(lggr, empty, (*url.URL)(node.HTTPURL), *node.Name, int32(i), chainID,

commonclient.Secondary)

samsondav

Can we get a test demonstrating what this fixes?

jmank88 · 2024-01-18T19:45:39Z

Can we get a test demonstrating what this fixes?

We haven't had the ability to test prom metrics coherently in the past, since they are all mixed up in global state. However, we could add a txtar test and checks the /metrics endpoint, like for /health: https://github.com/smartcontractkit/chainlink/blob/develop/testdata/scripts/health/multi-chain.txtar

dhaidashenko · 2024-01-19T17:07:47Z

Can we get a test demonstrating what this fixes?

We haven't had the ability to test prom metrics coherently in the past, since they are all mixed up in global state. However, we could add a txtar test and checks the /metrics endpoint, like for /health: https://github.com/smartcontractkit/chainlink/blob/develop/testdata/scripts/health/multi-chain.txtar

Added the test, but it does not feel that this is the right approach. The script seems too complex for txtar test. Also it's quite limited in terms of available metrics as node does not perform any actions. Automated check against soaked node might be a better option.

jmank88 · 2024-01-19T17:27:54Z

Can we get a test demonstrating what this fixes?

We haven't had the ability to test prom metrics coherently in the past, since they are all mixed up in global state. However, we could add a txtar test and checks the /metrics endpoint, like for /health: develop/testdata/scripts/health/multi-chain.txtar

Added the test, but it does not feel that this is the right approach. The script seems too complex for txtar test. Also it's quite limited in terms of available metrics as node does not perform any actions. Automated check against soaked node might be a better option.

I was hoping it could be made simpler, but soak tests would be fine too 🤷

cl-sonarqube-production · 2024-01-19T17:35:17Z

SonarQube Quality Gate

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

100.0% Coverage
0.0% Duplication

prashantkumar1982

Thanks for fixing this!

dhaidashenko · 2024-01-22T12:11:14Z

Can we get a test demonstrating what this fixes?

We haven't had the ability to test prom metrics coherently in the past, since they are all mixed up in global state. However, we could add a txtar test and checks the /metrics endpoint, like for /health: develop/testdata/scripts/health/multi-chain.txtar

Added the test, but it does not feel that this is the right approach. The script seems too complex for txtar test. Also it's quite limited in terms of available metrics as node does not perform any actions. Automated check against soaked node might be a better option.

I was hoping it could be made simpler, but soak tests would be fine too 🤷

@jmank88 on second thought, implementation of a more generalized "framework" for metrics testing seems like a high-effort low-priority task. As the current approach gets the job done in this particular case, do you mind merging the fix as is?

jmank88 · 2024-01-22T12:42:14Z

I was hoping it could be made simpler, but soak tests would be fine too 🤷

@jmank88 on second thought, implementation of a more generalized "framework" for metrics testing seems like a high-effort low-priority task. As the current approach gets the job done in this particular case, do you mind merging the fix as is?

Did you try using the basic stderr/stdout commands? They accept grep patterns, so it seems like this could be checked with one line for each metric, like:

curl $NODEURL/metrics

stdout 'evm_pool_rpc_node_dials_total{evmChainID="68472",nodeName="BlueEVMPrimaryNode"}'
stdout ...

I don't fully understand the extra retry logic though. Was it flakey without?

dhaidashenko · 2024-01-22T12:49:42Z

I was hoping it could be made simpler, but soak tests would be fine too 🤷

@jmank88 on second thought, implementation of a more generalized "framework" for metrics testing seems like a high-effort low-priority task. As the current approach gets the job done in this particular case, do you mind merging the fix as is?

Did you try using the basic stderr/stdout commands? They accept grep patterns, so it seems like this could be checked with one line for each metric, like:
curl $NODEURL/metrics

stdout 'evm_pool_rpc_node_dials_total{evmChainID="68472",nodeName="BlueEVMPrimaryNode"}'
stdout ...
I don't fully understand the extra retry logic though. Was it flakey without?

I did not observe the flake. But Dial happens in a separate goroutine, so there is no guarantee that the metric is visible, when API is ready to handle requests.

Fix rpc name override

50a2e1a

dhaidashenko temporarily deployed to sdlc January 18, 2024 18:32 — with GitHub Actions Inactive

dhaidashenko marked this pull request as ready for review January 18, 2024 18:59

dhaidashenko requested a review from samsondav as a code owner January 18, 2024 18:59

jmank88 reviewed Jan 18, 2024

View reviewed changes

use configured name without additional checks

4ceb04c

dhaidashenko temporarily deployed to sdlc January 18, 2024 19:08 — with GitHub Actions Inactive

dhaidashenko requested a review from jmank88 January 18, 2024 19:35

samsondav reviewed Jan 18, 2024

View reviewed changes

multi-node metrics test

094c84d

dhaidashenko temporarily deployed to sdlc January 19, 2024 17:02 — with GitHub Actions Inactive

Fix typo

85b5ccd

dhaidashenko temporarily deployed to sdlc January 19, 2024 17:24 — with GitHub Actions Inactive

prashantkumar1982 approved these changes Jan 20, 2024

View reviewed changes

jmank88 added this pull request to the merge queue Jan 22, 2024

Merged via the queue into develop with commit 41f2497 Jan 22, 2024
82 checks passed

jmank88 deleted the fix/BCI-2605-rpc-name branch January 22, 2024 14:46

This was referenced Feb 22, 2024

release/2.9.0 -> master #12148

Closed

Chore/release 2.9.0 -> master #12190

Closed

snehaagni mentioned this pull request Mar 7, 2024

chore/release 2.9.1 to master #12346

Closed

snehaagni mentioned this pull request Mar 21, 2024

chore/release 2.9.1 to master take2 #12530

Closed

snehaagni mentioned this pull request Mar 22, 2024

chore/release 2.9.1 to master take3 #12551

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix RPC name override #11813

Fix RPC name override #11813

dhaidashenko commented Jan 18, 2024

github-actions bot commented Jan 18, 2024

jmank88 Jan 18, 2024

samsondav left a comment

jmank88 commented Jan 18, 2024

dhaidashenko commented Jan 19, 2024 •

edited

Loading

jmank88 commented Jan 19, 2024

cl-sonarqube-production bot commented Jan 19, 2024

prashantkumar1982 left a comment

dhaidashenko commented Jan 22, 2024

jmank88 commented Jan 22, 2024

dhaidashenko commented Jan 22, 2024

Fix RPC name override #11813

Fix RPC name override #11813

Conversation

dhaidashenko commented Jan 18, 2024

github-actions bot commented Jan 18, 2024

jmank88 Jan 18, 2024

Choose a reason for hiding this comment

samsondav left a comment

Choose a reason for hiding this comment

jmank88 commented Jan 18, 2024

dhaidashenko commented Jan 19, 2024 • edited Loading

jmank88 commented Jan 19, 2024

cl-sonarqube-production bot commented Jan 19, 2024

prashantkumar1982 left a comment

Choose a reason for hiding this comment

dhaidashenko commented Jan 22, 2024

jmank88 commented Jan 22, 2024

dhaidashenko commented Jan 22, 2024

dhaidashenko commented Jan 19, 2024 •

edited

Loading