Store finalized block roots in database (3s startup) #3320

arnetheduck · 2022-01-25T22:31:11Z

When the chain has finalized a checkpoint, the history from that point
onwards becomes linear - this is exploited in .era files to allow
constant-time by-slot lookups.

In the database, we can do the same by storing finalized block roots in
a simple sparse table indexed by slot, bringing the two representations
closer to each other in terms of conceptual layout and performance.

Doing so has a number of interesting effects:

mainnet startup time is improved 3-5x (3s on my laptop)
the first startup might take slightly longer as the new index is
being built - ~10s on the same laptop
we no longer rely on the beacon block summaries to load the full dag -
this is a lot faster because we no longer have to look up each block by
parent root
a collateral benefit is that we no longer need to load the full
summaries table into memory - we get the RSS benefits of DAG loading: don't preload summaries in memory #3164 without
the CPU hit.

Other random stuff:

simplify forky block generics
fix withManyWrites multiple evaluation
fix validator key cache not being updated properly in chaindag
read-only mode
drop pre-altair summaries from kvstore
recreate missing summaries from altair+ blocks as well (in case
database has lost some to an involuntary restart)
print database startup timings in chaindag load log
avoid allocating superfluos state at startup
use a recursive sql query to load the summaries of the unfinalized
blocks

github-actions · 2022-01-25T23:51:37Z

Unit Test Results

    12 files ±0   806 suites +4 31m 12s ⏱️ - 7m 58s
1 652 tests +1 1 604 ✔️ +1   48 💤 ±0 0 ❌ ±0
9 665 runs +4 9 561 ✔️ +4 104 💤 ±0 0 ❌ ±0

Results for commit fba2cae. ± Comparison against base commit 29e2169.

♻️ This comment has been updated with latest results.

zah · 2022-01-26T10:46:04Z

beacon_chain/beacon_chain_db.nim

+      yield res
+    elif (let blck = db.getMergeBlock(res.root); blck.isSome()):
+      res.summary = blck.get().message.toBeaconBlockSummary()
+      yield res


Looks like you can place a single yield after the if/elif/else section here. Reducing the number of yeilds in an inline iterator is beneficial for reducing the overall code size.

zah · 2022-01-26T10:56:26Z

beacon_chain/consensus_object_pools/blockchain_dag.nim

@@ -433,19 +454,59 @@ proc init*(T: type ChainDAGRef, cfg: RuntimeConfig, db: BeaconChainDB,
    withBlck(genesisBlock): BlockRef.init(genesisBlockRoot, blck.message)

  var
-    headRef: BlockRef
+    backfillBlocks = newSeq[Eth2Digest](tailRef.slot.int)


Doesn't this have to be tailRef.slot.int - backfill.slot semantically?

no: the seq is there to answer by-slot requests from the rest and libp2p api and needs to cover the entire genesis->tail range (for which we have no blockref) - backfill.slot identifies the "valid" part of this seq.

in other words: after backfilling is done, this seq holds an in-memory mapping of slot->root which we then use to load blocks from the database with.

btw, this is the seq we talked about that potentially could be removed in future releases and replaced with a database query, in a future release - it's still a bit early for that though

Isn't the name misleading then? My first interpretation of backfillBlocks will be "the blocks that we backfilled"

Isn't finalizedBlocks the sequence that starts from genesis?

one, perhaps confusing inconsistency is that finalizedBlocks in the database covers all finalized blocks, while the in-memory structure is split in two, at the tail. I don't have good ideas for how to make that more clear though, right now

But why does backfillBlocks need to go to genesis? Shouldn't it cover only the weak subjectivity horizon?

maybe in the future? right now, the de-facto standard is to backfill to genesis because that's the only point from which you reliably can regenerate states (there's no standard way to distribute states, except the debug REST api) - also, it's needed for clients such as .. say .. nimbus, that don't support checkpoint syncing: a hard-fork is needed before genesis support is dropped.

Well, and this gets us to my original question. Isn't the size of backfillBlocks semantically tailRef.slot.int - backfill.slot? (this works OK now because backfill.slot is zero and it will work OK in the future).

still no - backfill.slot is the "first" slot for which we have a block in the database. on checkpoint sync start, it will have the value of tail.slot, then it moves towards genesis. we don't want to reallocate the seq every time a block is backfilled, so we allocate the full seq at once.

When the chain has finalized a checkpoint, the history from that point onwards becomes linear - this is exploited in `.era` files to allow constant-time by-slot lookups. In the database, we can do the same by storing finalized block roots in a simple sparse table indexed by slot, bringing the two representations closer to each other in terms of conceptual layout and performance. Doing so has a number of interesting effects: * mainnet startup time is improved 3-5x (3s on my laptop) * the _first_ startup might take slightly longer as the new index is being built - ~10s on the same laptop * we no longer rely on the beacon block summaries to load the full dag - this is a lot faster because we no longer have to look up each block by parent root * a collateral benefit is that we no longer need to load the full summaries table into memory - we get the RSS benefits of #3164 without the CPU hit. Other random stuff: * simplify forky block generics * fix withManyWrites multiple evaluation * fix validator key cache not being updated properly in chaindag read-only mode * drop pre-altair summaries from `kvstore` * recreate missing summaries from altair+ blocks as well (in case database has lost some to an involuntary restart) * print database startup timings in chaindag load log * avoid allocating superfluos state at startup * use a recursive sql query to load the summaries of the unfinalized blocks

* fix yields * dispose minIdStmt

zah reviewed Jan 26, 2022

View reviewed changes

arnetheduck force-pushed the fin-block-db branch from b77dce7 to 5df81cd Compare January 26, 2022 16:52

arnetheduck added 3 commits January 30, 2022 08:58

fixup

0b4ebf6

* fix yields * dispose minIdStmt

oops

fba2cae

arnetheduck force-pushed the fin-block-db branch from 105dd34 to fba2cae Compare January 30, 2022 07:58

zah merged commit d583e8e into unstable Jan 30, 2022

zah deleted the fin-block-db branch January 30, 2022 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store finalized block roots in database (3s startup) #3320

Store finalized block roots in database (3s startup) #3320

arnetheduck commented Jan 25, 2022

github-actions bot commented Jan 25, 2022 •

edited

Loading

zah Jan 26, 2022

arnetheduck Jan 26, 2022

zah Jan 26, 2022

arnetheduck Jan 26, 2022

arnetheduck Jan 26, 2022 •

edited

Loading

zah Jan 27, 2022

zah Jan 27, 2022

arnetheduck Jan 28, 2022

zah Jan 28, 2022

arnetheduck Jan 28, 2022

zah Jan 29, 2022

arnetheduck Jan 29, 2022

Store finalized block roots in database (3s startup) #3320

Store finalized block roots in database (3s startup) #3320

Conversation

arnetheduck commented Jan 25, 2022

github-actions bot commented Jan 25, 2022 • edited Loading

Unit Test Results

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arnetheduck Jan 26, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Jan 25, 2022 •

edited

Loading

arnetheduck Jan 26, 2022 •

edited

Loading