Remove head tracker in favour of fork choice #1785

michaelsproul · 2020-10-19T06:05:40Z

Description

The head tracker contains redundant information that is already present in the fork choice block DAG. This is undesirable as it means there isn't just "one source of truth", and the two data structures need to be kept in sync across concurrent executions, which is highly non-trivial (see #1771 for more)

Steps to resolve

Write code to derive the list of heads from fork choice
Remove the HeadTracker struct from the beacon chain
Port the migration code to use the head data from fork choice
Work out how to reconcile the pruning of fork choice with pruning of the database. Possible approaches:
- Mark blocks deleted from disk as deleted in fork choice, until they are cleaned up by fork choice's own pruning mechanism
- Mirror the pruning of the on-disk database in fork choice (i.e. prune the two structures together)

The text was updated successfully, but these errors were encountered:

## Issue Addressed Closes #2028 Replaces #2059 ## Proposed Changes If writing to the database fails while importing a block, revert fork choice to the last version stored on disk. This prevents fork choice from being ahead of the blocks on disk. Having fork choice ahead is particularly bad if it is later successfully written to disk, because it renders the database corrupt (see #2028). ## Additional Info * This mitigation might fail if the head+fork choice haven't been persisted yet, which can only happen at first startup (see #2067) * This relies on it being OK for the head tracker to be ahead of fork choice. I figure this is tolerable because blocks only get added to the head tracker after successfully being written on disk _and_ to fork choice, so even if fork choice reverts a little bit, when the pruning algorithm runs, those blocks will still be on disk and OK to prune. The pruning algorithm also doesn't rely on heads being unique, technically it's OK for multiple blocks from the same linear chain segment to be present in the head tracker. This begs the question of #1785 (i.e. things would be simpler with the head tracker out of the way). Alternatively, this PR could just revert the head tracker as well (I'll look into this tomorrow).

dapplion · 2024-01-18T13:29:33Z

Some notes for non-michael readers and myself

HeadTracker is used for two things currently:

As optimisation to query heads quickly to serve API requests. Premature optimization IMO, or could be integrated into the fork-choice and does not require to be persisted on disk.
To optimize pruning hot data, more below

How to prune hot data

Blocks and states are persisted in the hot DB by root. To figure out what to prune you need to build a block DAG and compute what branches are non-viable / abandoned. Some ways to do it:

"Brute-force": stream all items in the DB's hot block column, build graph, prune.
With a HeadTracker: helps with tip discovery, then iterate each head chain to figure out if it's viable or not. With the current RootsIterator requires to load a full BeaconState every 8192 slots.
With fork-choice prune output: fork-choice prune can return a Vec of pruned proto-nodes, from which you can build a partial graph of abandoned chains, and prune.

The pruning routine must be crash safe, options 2 and 3 risk "forgetting" data or causing inconsistencies like #4773 if not implemented properly.

Brute-force

The current implementation uses the RootsIterator which loads a full BeaconState every 8192 slots. With a state of size of 100MB (likely more represented in memory) that's 12 KB per slot. Blinded beacon blocks are ~50KB? so that's not too far off. I don't see any immediate issues with such approach besides the others being more optimal. Even in the worse case, pruning runs on a background thread so streaming some hundred MB from the DB in the case of long non-finality does not sound too bad. Thoughts?

Using fork-choice

Using the fork-choice for pruning without holding its lock for too long may require persisting some prune helper data to the database. The desired order of operations is:

Aquire fork-choice write lock
Prune fork-choice
Prune database
Release fork-choice write lock
Persist pruned fork-choice

If the lock is not hold for this long and the node crashes during 3 then you lose track of pruned branches that will never be pruned. Another approach is to:

Aquire fork-choice write lock
Prune fork-choice
Persist pruning summary (i.e. pruned heads)
Release fork-choice write lock

Then in a background thread, and in an atomic operation

Prune database
Clear pruning summary

Here you are sort of storing a HeadTracker but only for prune data that is decoupled from the fork-choice.

HeadTracker but eagerly persisted

We can achieve better atomicity in the HeadTracker by persisting each head to the DB individually in separate keys. Heads are inserted in the HeadTracker in the existing atomic transaction during block import. Heads are pruned during the atomic operation of hot DB pruning. No action is necessary on shutdown. The pruning routine can stream all keys in that column range to iterate the HeadTracker.

This option allows to keep the pruning logic the same but remove the potential inconsistencies on shutdown.

michaelsproul · 2024-01-19T00:39:54Z

I don't mind the brute-force approach, but it would only be viable with tree-states. On stable, all the blocks are stored in the hot DB; they never get migrated to the freezer (like states do). So most DBs have like 30GB of blocks in that column. On tree-states, they get migrated, compressed and indexed by slot, so the hot DB only has ~64-256 on average when finality is happening (the 256 because we sometimes delay migration to avoid I/O).

I also quite like the pruning-summary & per-block head tracker. Maybe the pruning summary is the simplest for now? Or we wait until tree-states and do the brute force. I like the simplicity of not worrying about lock ordering.

michaelsproul · 2024-01-22T22:33:24Z

More reasons to delete:

Starting block is never added to head_tracker #4631
Hold HeadTracker lock until persisting to disk #5084 (see section about remaining atomicity issues)

michaelsproul added the t Consensus & Verification label Oct 19, 2020

paulhauner added the A1 label Nov 9, 2020

michaelsproul mentioned this issue Dec 8, 2020

[Merged by Bors] - Revert fork choice if disk write fails #2068

Closed

michaelsproul added consensus An issue/PR that touches consensus code, such as state_processing or block verification. and removed t Consensus & Verification labels Nov 9, 2022

michaelsproul mentioned this issue Jan 16, 2024

[DO NOT MERGE] Head tracker bug repro #5073

Closed

dapplion mentioned this issue Jan 18, 2024

[WIP] Drop HeadTracker dapplion/lighthouse#8

Draft

michaelsproul mentioned this issue Jan 22, 2024

Lighthouse space usage - Increasing dramastically from 87GB to 430GB in 7 days #4773

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove head tracker in favour of fork choice #1785

Remove head tracker in favour of fork choice #1785

michaelsproul commented Oct 19, 2020 •

edited

Loading

dapplion commented Jan 18, 2024

michaelsproul commented Jan 19, 2024

michaelsproul commented Jan 22, 2024

Remove head tracker in favour of fork choice #1785

Remove head tracker in favour of fork choice #1785

Comments

michaelsproul commented Oct 19, 2020 • edited Loading

Description

Steps to resolve

dapplion commented Jan 18, 2024

How to prune hot data

Brute-force

Using fork-choice

HeadTracker but eagerly persisted

michaelsproul commented Jan 19, 2024

michaelsproul commented Jan 22, 2024

michaelsproul commented Oct 19, 2020 •

edited

Loading