stagedsync: fix bor heimdall mining flow #9149

taratorio · 2024-01-05T17:38:30Z

Currently the mining loop is broken for the polygon chain. This PR fixes this.

High level changes:

Introduces new Bor<->Heimdall stage specifically for the needs of the mining flow
Extracts out common logic from Bor<->Heimdall sync and mining stages into shared functions
Removes mine flag for the Bor<->Heimdall sync stage
Extends the current StartMining function to prefetch span zero if needed before the mining loop is started
Fixes Bor to read span zero (instead of span 1) from heimdall when the span is not initially set in the local smart contract that the Spanner uses

Test with devnet "state-sync" scenario:

…reader-span-and-loggers

…ning-loop

manav2401 · 2024-01-09T10:17:01Z

eth/stagedsync/stage_mining_bor_heimdall.go

+
+	// Whitelist service is called to check if the bor chain is on the canonical chain according to milestones
+	whitelistService := whitelist.GetWhitelistingService()
+	if whitelistService != nil && !whitelistService.IsValidChain(headerNum, []*types.Header{header}) {


As discussed, we might not need this check at all in the mining stage as the block passes through sync stages anyways. But, if the plan is to refactor that part and prevent the block from going to every sync staged when it's mined by the node itself, then yeah we can keep this.

@manav2401 blocks are broadcast to the p2p network when they are freshly out of the mining loop (in addition to being communicated to the sync loop) - think this check is here to prevent the miner from broadcasting a bad block, ive left it in to maintain same logic as before my refactor

Alright yeah makes sense. Although, it's very rare for this to happen but if the node is out of sync and still tries to produce a block, it may happen. Thanks!

yes, we can always revise this later once we do bigger refactors - will keep it in mind

…ning-loop

This PR fixes 2 things which are more commonly visible in multi client devnet setups for polygon. Context: The span logic is a bit different when it comes to first 2 spans. Bor/Erigon makes the first commit for a span during the start of sprint (i.e. if sprint length is 16, it will call `commitSpan` at block 16 in bor consensus for the first time). Span 0 is hard coded in genesis contracts so it needs to commit span with `id=1` on that block (see equivalent code in bor [here](https://github.com/maticnetwork/bor/blob/v1.2.3/consensus/bor/bor.go#L1150-L1152)). At that time, it needs to have 1st span available. Bor fetches it on the go while erigon processes it in a separate stage and stores in a snapshot. Hence, we'd need to fetch 1st span as an exception in erigon while we're still in span 0 but also need to make sure that it doesn't block processing of any previous blocks. Based on #9149, the span ID used to fetch and commit span was wrong and the span 1 needs to be loaded explicitly in 1st sprint.

taratorio added 4 commits January 5, 2024 13:23

stagedsync: implement bor span for chain reader and fix loggers

b449fb3

Merge branch 'devel' of github.com:ledgerwatch/erigon into fix-chain-…

c5ffa9a

…reader-span-and-loggers

stagedsync: fix bor heimdall mining flow

924077f

Merge branch 'devel' of github.com:ledgerwatch/erigon into fix-bor-mi…

c84de21

…ning-loop

taratorio added the polygon label Jan 5, 2024

taratorio requested review from mh0lt and battlmonstr January 5, 2024 17:38

stagedsync: simplify FetchSpanZeroForMiningIfNeeded

df827be

taratorio requested a review from AlexeyAkhunov January 5, 2024 18:10

mh0lt approved these changes Jan 5, 2024

View reviewed changes

taratorio added 5 commits January 5, 2024 18:22

stagedsync: add comment

d29c78c

stagedsync: fix tests

9a4cab8

Merge branch 'devel' of github.com:ledgerwatch/erigon into fix-bor-mi…

c3b7c10

…ning-loop

stagedsync: check bor header extra data only in sync-er stage

62c409b

Merge branch 'devel' of github.com:ledgerwatch/erigon into fix-bor-mi…

32baf10

…ning-loop

manav2401 reviewed Jan 9, 2024

View reviewed changes

Merge branch 'devel' of github.com:ledgerwatch/erigon into fix-bor-mi…

55ffcf4

…ning-loop

manav2401 approved these changes Jan 9, 2024

View reviewed changes

taratorio merged commit 74ec3a9 into devel Jan 9, 2024
7 checks passed

taratorio deleted the fix-bor-mining-loop branch January 9, 2024 11:37

taratorio mentioned this pull request Jan 9, 2024

eth/stagedsync: fixes for mining on devnet #8874

Closed

manav2401 mentioned this pull request Jan 25, 2024

eth, polygon/bor: fix fetch span logic for devnets #9312

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stagedsync: fix bor heimdall mining flow #9149

stagedsync: fix bor heimdall mining flow #9149

taratorio commented Jan 5, 2024 •

edited

Loading

manav2401 Jan 9, 2024

taratorio Jan 9, 2024

manav2401 Jan 9, 2024

taratorio Jan 9, 2024

stagedsync: fix bor heimdall mining flow #9149

stagedsync: fix bor heimdall mining flow #9149

Conversation

taratorio commented Jan 5, 2024 • edited Loading

manav2401 Jan 9, 2024

Choose a reason for hiding this comment

taratorio Jan 9, 2024

Choose a reason for hiding this comment

manav2401 Jan 9, 2024

Choose a reason for hiding this comment

taratorio Jan 9, 2024

Choose a reason for hiding this comment

taratorio commented Jan 5, 2024 •

edited

Loading