trie/triedb/pathdb: improve dirty node flushing trigger #28426

rjl493456442 · 2023-10-27T11:13:39Z

This pull request fixes an edge case in state history management.

In the context of the path model, Geth maintains a list of state histories to facilitate the rollback of the persistent state when needed. To prevent uncontrolled expansion of state history, Geth also provides a mechanism to prune the oldest state histories.

Because Geth manages the state layer in a tree structure, whenever a new layer is piled on top, it will trigger the merging of bottom diff layer with disk layer. This merging operation also involves the construction of the corresponding state history of the bottom diff layer and the truncation of oldest state histories.

However, a potential issue can happen if an unclean shutdown occurs after persisting the state history but before flushing the cached states to disk. In this scenario, we've defined a recovery mechanism to truncate any excess state history above the disk layer during the next restart, ensuring that the state history always aligns with the disk layer.

Now, let's discuss the specific edge case we're addressing. When we flush a state history, this process implicitly triggers the removal of the oldest history objects (tail truncation). The concern is what happens if the new tail history object is even newer than the persistent state. In such a situation, the recovery mechanism fails after an unclean shutdown because all state histories are now newer than the persistent state, it's impossible to align the state history and disk layer anymore.

The fix for this edge case is to introduce a guarantee that the state history always cover the persistent state. To achieve this, we've added a new condition that enforces the flushing of the cached state if this guarantee is not met. Specifically, if the oldest history after tail truncation is higher than persistent state, forcibly flush the cached states before the tail truncation.

trie/triedb/pathdb/disklayer.go

karalabe · 2023-10-30T12:25:02Z

trie/triedb/pathdb/disklayer.go

 		return nil, err
 	}
+	// To remove outdated history objects from the end, we set the 'tail' parameter
+	// to 'oldest-1' due to the offset between the freezer index and the history ID.


An interesting implicit thing here is that nodebuffer.flush() will "surely" push the disk layer beyond oldest-1. This is kind of true, most of the time, since only the 128 diff layers re main after the flush and limit in theory is more like 90K.

Thus two questions/requests:

Would be nice to maybe mention this fact in the comments.

What happens if limit is configured to be 64? Perhaps we should forbid the limit being below the diff layer count (apart from 0 meaning infinite)?

Ah, no oldest = bottom.stateID() - limit + 1, so flushing everything will move the disk layer to bottom.stateID().

In that case oldest - 1 will be bottom.stateID() - limit + 1 - 1 == bottom.stateID() - limit. So anything above 0 limit should be ok.

karalabe · 2023-10-30T12:25:29Z

trie/triedb/pathdb/disklayer.go

+		if err != nil {
+			return nil, err
+		}
+		log.Debug("Prune state history", "number", pruned)


Also perhaps "number", use "stateid". We usually use number for block numbers and it's going to be confusing.

The number here prefers to the number of history get pruned.

log.Debug("Pruned state history", "items", pruned, "tailid", oldest) i will fix the log with this.

karalabe

LGTM

* trie/triedb/pathdb: improve dirty node flushing trigger * trie/triedb/pathdb: add tests * trie/triedb/pathdb: address comment

…reum#28426)" This reverts commit e47ad2f.

* trie/triedb/pathdb: improve dirty node flushing trigger * trie/triedb/pathdb: add tests * trie/triedb/pathdb: address comment

trie/triedb/pathdb: improve dirty node flushing trigger

665597c

rjl493456442 force-pushed the fix-state-freezer branch from ddb1dc8 to 665597c Compare October 27, 2023 11:16

trie/triedb/pathdb: add tests

f6a2209

rjl493456442 marked this pull request as ready for review October 27, 2023 12:03

rjl493456442 added this to the 1.13.5 milestone Oct 30, 2023

fynnss reviewed Oct 30, 2023

View reviewed changes

trie/triedb/pathdb/disklayer.go Show resolved Hide resolved

karalabe reviewed Oct 30, 2023

View reviewed changes

trie/triedb/pathdb: address comment

bb59ab4

karalabe approved these changes Oct 31, 2023

View reviewed changes

karalabe merged commit ea2e66a into ethereum:master Oct 31, 2023
2 checks passed

This was referenced Oct 31, 2023

eth, trie/triedb/pathdb: pbss patches bnb-chain/bsc#1956

Closed

eth, trie/triedb/pathdb: pbss patches bnb-chain/bsc#1955

Merged

cherry pick pbss patches from go-ethereum bnb-chain/bsc#1962

Merged

devopsbo3 added a commit to HorizenOfficial/go-ethereum that referenced this pull request Nov 10, 2023

Revert "trie/triedb/pathdb: improve dirty node flushing trigger (ethe…

0a25fa0

…reum#28426)" This reverts commit e47ad2f.

devopsbo3 added a commit to HorizenOfficial/go-ethereum that referenced this pull request Nov 10, 2023

Revert "trie/triedb/pathdb: improve dirty node flushing trigger (ethe…

99dad52

…reum#28426)" This reverts commit e47ad2f.

BrewTestBot mentioned this pull request Nov 14, 2023

ethereum 1.13.5 Homebrew/homebrew-core#154261

Merged

Francesco4203 mentioned this pull request Oct 29, 2024

trie/triedb/pathdb, core/rawdb: pbss fix release v1.13.5 (corner-cases in path scheme state management) axieinfinity/ronin#619

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trie/triedb/pathdb: improve dirty node flushing trigger #28426

trie/triedb/pathdb: improve dirty node flushing trigger #28426

rjl493456442 commented Oct 27, 2023 •

edited

Loading

karalabe Oct 30, 2023

karalabe Oct 30, 2023

karalabe Oct 30, 2023

karalabe Oct 30, 2023

rjl493456442 Oct 30, 2023

rjl493456442 Oct 30, 2023

karalabe left a comment

trie/triedb/pathdb: improve dirty node flushing trigger #28426

trie/triedb/pathdb: improve dirty node flushing trigger #28426

Conversation

rjl493456442 commented Oct 27, 2023 • edited Loading

karalabe Oct 30, 2023

Choose a reason for hiding this comment

karalabe Oct 30, 2023

Choose a reason for hiding this comment

karalabe Oct 30, 2023

Choose a reason for hiding this comment

karalabe Oct 30, 2023

Choose a reason for hiding this comment

rjl493456442 Oct 30, 2023

Choose a reason for hiding this comment

rjl493456442 Oct 30, 2023

Choose a reason for hiding this comment

karalabe left a comment

Choose a reason for hiding this comment

rjl493456442 commented Oct 27, 2023 •

edited

Loading