Reduce bandwidth over the VC<>BN API using dependant roots #4157

paulhauner · 2023-04-03T02:16:22Z

Description

Presently the VC will poll for attestation duties each slot. Our polling achieves:

On the first poll after the VC starts, we always learn the current- and next-epoch duties.
On the first poll of the epoch, we always learn the next-epoch duties.
Occasionally, during any poll, we might learn of a re-org than changes either the next- or current-epoch duties. Such epochs are very uncommon on mainnet.

Looking at the /eth/v1/validator/duties/attester/{epoch} endpoint, we see that its structure is:

Request Body: An array of the validator indices for which to obtain the duties.

[
  "1"
]

Note: we also included the epoch in the URL.

Response Body

{
  "dependent_root": "0xcf8e0d4e9587369b2301d0790347320302cc0943d5a1884560367e8208d920f2",
  "execution_optimistic": false,
  "data": [
    {
      "pubkey": "0x93247f2209abcacf57b75a51dafae777f9dd38bc7053d1af526f220a7489a6d3a2753e5f3e8b1cfe39b56f43611df74a",
      "validator_index": "1",
      "committee_index": "1",
      "committee_length": "1",
      "committees_at_slot": "1",
      "validator_committee_index": "1",
      "slot": "1"
    }
  ]
}

In our current implementation, the request will contain all validators managed by the VC. I claim that, in the best case, we could only send a request for one validator and know whether or not all the other validators need updating.

I claim this because a set of shuffling is uniquely identified by (epoch, dependent_root) (see "Background Info" below if this isn't clear to you). We already use this assumption in the VC:

lighthouse/validator_client/src/duties_service.rs

Lines 740 to 749 in a53830f

    
           local_pubkeys.contains(&duty.pubkey) && { 
        
               // Only update the duties if either is true: 
        
               // 
        
               // - There were no known duties for this epoch. 
        
               // - The dependent root has changed, signalling a re-org. 
        
               attesters.get(&duty.pubkey).map_or(true, |duties| { 
        
                   duties 
        
                       .get(&epoch) 
        
                       .map_or(true, |(prior, _)| *prior != dependent_root) 
        
               })

With the above statement we're filtering out any duties that are already have the same (epoch, dependent_root) signature. So, our current flow goes like:

Download allduties/attester for all validators.
Filter out any duties with a known (epoch, dependent_root).
Update the duties_service.attesters with any new duties (reference).

I propose that we should instead:

Download duties/attester for INITIAL_DUTIES_QUERY_SIZE = 1 validators.
Find any validators which have conflicting (epoch, dependent_root) values.
If there are any conflicting validators, send a second duties/attester request for those validators.
Update the duties_service.attesters with any new duties which resulted from the first or second request.

In the case where we don't expect the duties to change (i.e., it's not the first request after VC boot, it's not the first request of an epoch and there wasn't a re-org) then we should reduce the bandwidth by a factor of the number of validators in that VC (e.g., if the VC has 100 validators then they request/response should be ~100x smaller).

Additional Details

There's some extra detail regarding the first request of INITIAL_DUTIES_QUERY_SIZE. I propose that we should actually make this query of size max(INITIAL_DUTIES_QUERY_SIZE, num_uninitialized_validators) where num_uninitialized_validators is the count of validators for which we don't already know their duties for that epoch. This will be all validators when booting for the first time or querying for the "next epoch". It'll avoid us doing the second request when we know we need the duties for all validators.

Background Info

Whilst the term "dependent root" doesn't appear in the specification, it exists as a concept in get_beacon_committee. We use dependant_root to refer to the block root at the same slot as get_seed uses to load the randao_mix as used as the input to compute_committee.

The argument is that any chain which has dependant_root in its history will always return the same result for get_beacon_committee given the same epoch.

Here's a scrappy diagram that might help:

The term "dependent root" was introduced to the API some time after we'd implemented the concept in Lighthouse for keying our internal shuffling caches. Internally, we will sometimes refer to it as the "shuffling decision" root (example).

The text was updated successfully, but these errors were encountered:

michaelsproul · 2023-04-03T02:24:43Z

Something that Teku does which we could also consider is subscribing to the SSE event stream and using that to infer a change of dependent root. I guess the head event is sufficient, as it includes the previous and current dependent roots: https://ethereum.github.io/beacon-APIs/?urls.primaryName=dev#/Events/eventstream

This would be more of a major architectural change though, and may come with other complications (handling stream reconnects, etc). Perhaps the reduced polling approach is more pragmatic

jimmygchen · 2023-04-05T04:04:18Z

@paulhauner thanks a lot for the nice write up! I'd like to look into this.

jimmygchen · 2023-04-06T08:35:10Z

PR created here #4170

## Issue Addressed #4157 ## Proposed Changes See description in #4157. In diagram form: ![reduce-attestation-bandwidth](https://user-images.githubusercontent.com/742762/230277084-f97301c1-0c5d-4fb3-92f9-91f99e4dc7d4.png) Co-authored-by: Jimmy Chen <jimmy@sigmaprime.io>

jimmygchen · 2023-07-05T05:25:35Z

Completed in #4170

## Issue Addressed sigp#4157 ## Proposed Changes See description in sigp#4157. In diagram form: ![reduce-attestation-bandwidth](https://user-images.githubusercontent.com/742762/230277084-f97301c1-0c5d-4fb3-92f9-91f99e4dc7d4.png) Co-authored-by: Jimmy Chen <jimmy@sigmaprime.io>

paulhauner added the optimization Something to make Lighthouse run more efficiently. label Apr 3, 2023

jimmygchen self-assigned this Apr 5, 2023

jimmygchen mentioned this issue Apr 6, 2023

[Merged by Bors] - Reduce bandwidth over the VC<>BN API using dependant roots #4170

Closed

jimmygchen closed this as completed Jul 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce bandwidth over the VC<>BN API using dependant roots #4157

Reduce bandwidth over the VC<>BN API using dependant roots #4157

paulhauner commented Apr 3, 2023

michaelsproul commented Apr 3, 2023

jimmygchen commented Apr 5, 2023

jimmygchen commented Apr 6, 2023

jimmygchen commented Jul 5, 2023

Reduce bandwidth over the VC<>BN API using dependant roots #4157

Reduce bandwidth over the VC<>BN API using dependant roots #4157

Comments

paulhauner commented Apr 3, 2023

Description

Request Body: An array of the validator indices for which to obtain the duties.

Response Body

Additional Details

Background Info

michaelsproul commented Apr 3, 2023

jimmygchen commented Apr 5, 2023

jimmygchen commented Apr 6, 2023

jimmygchen commented Jul 5, 2023