[QUESTION] Curious case of state cache #4502

vgorkavenko · 2023-07-13T08:11:43Z

Description

We would like to clarify behaviour of state cache from this PR

Please, take a look at this case

     unfinalized              reorg event       finalized
     epoch 100000                  v           epoch 100000
----------|------------------------|----------------|-------------> time
          
          ^                                         ^
  state is requested                          state is requested
response cached by slot number        response is equal to cached (0x123...00)
 with state data 0x123...00          but should be 0x123...01 after finalization

Is it possible? Even if we request a state by hash

The text was updated successfully, but these errors were encountered:

michaelsproul · 2023-07-13T08:15:56Z

Shouldn't be possible. That cache only holds states from the freezer DB which are finalized and can't be reorged. Did you see this in the wild?

vgorkavenko · 2023-07-13T08:23:54Z

But it depends on slots_per_restore_point, am I right?

We received inconsistent behavior of finalized state on multiple hosts (4 hosts responded with the same data, and only one of them got it wrong) and are trying to figure out the cause. As soon as we get the details, I'll share them.

michaelsproul · 2023-07-13T12:15:26Z

But it depends on slots_per_restore_point, am I right?

Yeah, the layout of states on disk depends on slots-per-restore-point. The restore points will determine how many blocks get replayed.

We received inconsistent behavior of finalized state on multiple hosts (4 hosts responded with the same data, and only one of them got it wrong)

Ah, that sounds like it might be this bug: #3011. We've looked long and hard for the root cause of that bug without finding anything, and to be honest we'll probably never find it. We are in the process of overhauling our database and replacing it with something better, and have an alpha of that here: https://github.com/sigp/lighthouse/releases/tag/v4.2.990-exp. Even though it's experimental, it currently has less known bugs than stable due to #3011, so it might be worth adding to your infra.

michaelsproul · 2024-08-19T07:46:04Z

Closing as stale, and soon-to-be-resolved by tree-states

michaelsproul closed this as completed Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] Curious case of state cache #4502

[QUESTION] Curious case of state cache #4502

vgorkavenko commented Jul 13, 2023

michaelsproul commented Jul 13, 2023

vgorkavenko commented Jul 13, 2023

michaelsproul commented Jul 13, 2023

michaelsproul commented Aug 19, 2024

[QUESTION] Curious case of state cache #4502

[QUESTION] Curious case of state cache #4502

Comments

vgorkavenko commented Jul 13, 2023

Description

michaelsproul commented Jul 13, 2023

vgorkavenko commented Jul 13, 2023

michaelsproul commented Jul 13, 2023

michaelsproul commented Aug 19, 2024