Engine API: add getPayloadBodies method #146

mkalinin · 2021-12-10T13:40:25Z

Proposal

Adds engine_getPayloadBodies method to the Engine API
Given the block_hash values the method returns ExecutionPayloadBody objects which consist of transactions list of the corresponding payloads
The logic of the new method maps on the logic of existing GetBlockBodies message in ETH protocol -- this is the crux of this proposal

Concerns

The main concern is that retrieval of the bodies by hashes doesn't allow for utilizing linearity of block data as if the request was made by block numbers. Utilizing the linearity property opens up an opportunity for more optimal way of accessing this information, especially it makes sense for the finalized part of the blockchain.

See #137 and discussion thread for more details

Implementations

Nethermind, GetPayloadBodiesV1Handler.cs#L40

lightclient

Would it make more sense for the interface here to be getPayloadBodies(hash) -> body and lean into JSON-RPC batching instead of enshrine the batching? Otherwise, looks good to me.

mkalinin · 2021-12-22T08:04:03Z

Would it make more sense for the interface here to be getPayloadBodies(hash) -> body and lean into JSON-RPC batching instead of enshrine the batching? Otherwise, looks good to me.

Good question. I don't know how convenient JSON-RPC batching interface is across different client libraries. I think that as it's straightforward for EL clients to implement this proposal (and it is because of already existing logic) we can accept batching to be enshrined.

mkalinin · 2022-02-02T06:46:53Z

Would it make more sense for the interface here to be getPayloadBodies(hash) -> body and lean into JSON-RPC batching instead of enshrine the batching? Otherwise, looks good to me.

Valuable input on that from Discord https://discord.com/channels/595666850260713488/692062809701482577/931309418879012945:

My preference is definitely for adding the batching logic in the method itself rather than relying on JSONRPC batch support. At fair number of clients/libraries don't have full support for JSONRPC batch requests (including IIRC github.com/ethereum/go-ethereum/ethclient).

Batching enshrined in a method semantics is preferable.

henridf · 2022-02-10T17:03:26Z

Is there any concern about JSON ser/deser overhead? If I'm following correctly it's sending what was previously p2p traffic over json. I'd guess that the overhead will be significant and may become the bottleneck for sync.

ryanschneider · 2022-03-21T16:35:01Z

Is there any concern about JSON ser/deser overhead? If I'm following correctly it's sending what was previously p2p traffic over json. I'd guess that the overhead will be significant and may become the bottleneck for sync.

Each transaction is still in raw format:

Array of transaction objects, each object is a byte list (DATA)

So I think the ser/deser overhead is pretty minimal.

terencechain · 2022-04-29T14:40:35Z

When EL clients are snap syncing and CL clients are up to head optimistically. Will the EL clients respond to requests of getPayloadBodies?

mkalinin · 2022-04-29T15:05:58Z

When EL clients are snap syncing and CL clients are up to head optimistically. Will the EL clients respond to requests of getPayloadBodies?

Good question. Does CL client have to serve blocks to remote peers while syncing?

potuz · 2022-05-05T17:53:16Z

When EL clients are snap syncing and CL clients are up to head optimistically. Will the EL clients respond to requests of getPayloadBodies?

Good question. Does CL client have to serve blocks to remote peers while syncing?

If it is optimistically syncing yes, it can gossip blocks regularly. Typically we would have those blocks on a hot cache. But we may be requested old blocks.

## Proposed Changes Reduce post-merge disk usage by not storing finalized execution payloads in Lighthouse's database. :warning: **This is achieved in a backwards-incompatible way for networks that have already merged** :warning:. Kiln users and shadow fork enjoyers will be unable to downgrade after running the code from this PR. The upgrade migration may take several minutes to run, and can't be aborted after it begins. The main changes are: - New column in the database called `ExecPayload`, keyed by beacon block root. - The `BeaconBlock` column now stores blinded blocks only. - Lots of places that previously used full blocks now use blinded blocks, e.g. analytics APIs, block replay in the DB, etc. - On finalization: - `prune_abanonded_forks` deletes non-canonical payloads whilst deleting non-canonical blocks. - `migrate_db` deletes finalized canonical payloads whilst deleting finalized states. - Conversions between blinded and full blocks are implemented in a compositional way, duplicating some work from Sean's PR #3134. - The execution layer has a new `get_payload_by_block_hash` method that reconstructs a payload using the EE's `eth_getBlockByHash` call. - I've tested manually that it works on Kiln, using Geth and Nethermind. - This isn't necessarily the most efficient method, and new engine APIs are being discussed to improve this: ethereum/execution-apis#146. - We're depending on the `ethers` master branch, due to lots of recent changes. We're also using a workaround for gakonst/ethers-rs#1134. - Payload reconstruction is used in the HTTP API via `BeaconChain::get_block`, which is now `async`. Due to the `async` fn, the `blocking_json` wrapper has been removed. - Payload reconstruction is used in network RPC to serve blocks-by-{root,range} responses. Here the `async` adjustment is messier, although I think I've managed to come up with a reasonable compromise: the handlers take the `SendOnDrop` by value so that they can drop it on _task completion_ (after the `fn` returns). Still, this is introducing disk reads onto core executor threads, which may have a negative performance impact (thoughts appreciated). ## Additional Info - [x] For performance it would be great to remove the cloning of full blocks when converting them to blinded blocks to write to disk. I'm going to experiment with a `put_block` API that takes the block by value, breaks it into a blinded block and a payload, stores the blinded block, and then re-assembles the full block for the caller. - [x] We should measure the latency of blocks-by-root and blocks-by-range responses. - [x] We should add integration tests that stress the payload reconstruction (basic tests done, issue for more extensive tests: #3159) - [x] We should (manually) test the schema v9 migration from several prior versions, particularly as blocks have changed on disk and some migrations rely on being able to load blocks. Co-authored-by: Paul Hauner <paul@paulhauner.com>

michaelsproul · 2022-09-15T23:16:00Z

We're seeing issues with ELs timing out under a flood of eth_getBlockByHash requests, so I think it would be great to prioritise this standard if we can find the bandwidth 🙏

MarekM25 · 2022-09-16T15:38:47Z

We're seeing issues with ELs timing out under a flood of eth_getBlockByHash requests, so I think it would be great to prioritise this standard if we can find the bandwidth 🙏

Agree. We have some users that reporting this problem.

mkalinin · 2022-12-21T09:17:30Z

Closing in favour of #218

mkalinin added 3 commits December 10, 2021 15:47

Engine API: add getPayloadBodies method

b1a4ebd

Engine API: fix spellchecker

6be1088

Engine API: fix spellchecker. Take 2

f500c48

lightclient approved these changes Dec 17, 2021

View reviewed changes

Merge branch 'main' into get-payload-bodies

b490341

mkalinin mentioned this pull request Feb 3, 2022

Ethereum Core Devs Meeting 131 Agenda ethereum/pm#459

Closed

michaelsproul mentioned this pull request Apr 12, 2022

[Merged by Bors] - Separate execution payloads in the DB sigp/lighthouse#3157

Closed

4 tasks

arnetheduck mentioned this pull request May 6, 2022

Engine API: add getPayloadBodiesByRangeV1 to #146 #218

Closed

lightclient added the A-engine Area: for future consideration label May 19, 2022

siladu mentioned this pull request Dec 8, 2022

Add getPayloadBodies and getPayloadBodiesByRangeV1 Methods hyperledger/besu#4787

Closed

mkalinin closed this Dec 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Engine API: add getPayloadBodies method #146

Engine API: add getPayloadBodies method #146

mkalinin commented Dec 10, 2021 •

edited

Loading

lightclient left a comment

mkalinin commented Dec 22, 2021

mkalinin commented Feb 2, 2022

henridf commented Feb 10, 2022

ryanschneider commented Mar 21, 2022

terencechain commented Apr 29, 2022

mkalinin commented Apr 29, 2022

potuz commented May 5, 2022

michaelsproul commented Sep 15, 2022

MarekM25 commented Sep 16, 2022

mkalinin commented Dec 21, 2022

Engine API: add getPayloadBodies method #146

Engine API: add getPayloadBodies method #146

Conversation

mkalinin commented Dec 10, 2021 • edited Loading

Proposal

Concerns

Implementations

lightclient left a comment

Choose a reason for hiding this comment

mkalinin commented Dec 22, 2021

mkalinin commented Feb 2, 2022

henridf commented Feb 10, 2022

ryanschneider commented Mar 21, 2022

terencechain commented Apr 29, 2022

mkalinin commented Apr 29, 2022

potuz commented May 5, 2022

michaelsproul commented Sep 15, 2022

MarekM25 commented Sep 16, 2022

mkalinin commented Dec 21, 2022

mkalinin commented Dec 10, 2021 •

edited

Loading