Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add freezer DB debugging tools #3511

Closed
wants to merge 1 commit into from
Closed

Conversation

michaelsproul
Copy link
Member

Issue Addressed

Related to:

Proposed Changes

This PR adds some extra capabilities to lighthouse db inspect for examining freezer databases.

The main one that's useful for detecting corruption is the --output gaps option which can be used to search for gaps in the linear arrays of block roots, state roots, etc:

./lighthouse db inspect --datadir /mnt/lighthouse/lighthouse/ --freezer --column bbr --output gaps
Aug 26 07:27:34.751 INFO Running database manager for mainnet network
Aug 26 07:27:39.232 INFO Hot-Cold DB initialized                 split_state: 0xd415ec08990df5be242cf0e2c7f32ee8b7419edb1c214c33eb71eaa1eaad99e7, split_slot: 4556160
No gaps found!
Num keys: 35586
Total: 145756192 bytes

The 4 columns of interest are:

  • bbr: block roots. Should be gap-free whenever historic block sync has completed. Should contain 1 gap otherwise.
  • bsr: state roots. Should be gap-free whenever state reconstruction has completed. Should contain 1 gap otherwise.
  • bhr: historic roots. Same gap-status as state roots.
  • brm: randao mixes. Same gap-status as state roots.

On my node with historic states all of these 4 columns are gap-free as they should be, indicating that my database (probably) isn't corrupt. Next week or in a follow-up PR I'll add some tools for verifying the actual data stored.

@michaelsproul michaelsproul added the work-in-progress PR is a work-in-progress label Aug 26, 2022
@michaelsproul
Copy link
Member Author

michaelsproul commented Aug 26, 2022

It would be great if people who've been playing around with state reconstruction and noticing corruption issues could try this out CC: @xrchz @JustinZal. I'd be very interested to see what the gap-checker thinks of your databases (you need to stop Lighthouse while running these checks).

If you do find irregular gaps, and I suspect you will, you can try looking at them with the --output values mode. You can fiddle with the --skip and --limit options to reduce the output to a manageable size (I wouldn't recommend running with --output values otherwise).

E.g. on my partly synced node I can do:

$ lighthouse --network mainnet --datadir /media/michael/silver/lh/v3 db inspect --freezer --column bbr --output gaps
Aug 26 07:39:28.665 INFO Running database manager for mainnet network
Aug 26 07:39:28.763 INFO Hot-Cold DB initialized                 split_state: 0xf3abdf015aaf26a68e131313e208955186b89632eb75078f2d3bb34c2756ee09, split_slot: 4556224
gap between keys 1 and 17733 (offset: 2)
Num keys: 17855
Total: 73130016 bytes

and then see the partly filled in chunk with:

$ lighthouse --network mainnet --datadir /media/michael/silver/lh/v3 db inspect --freezer --column bbr --output values --skip 2 --limit 1 
Aug 26 07:40:13.053 INFO Running database manager for mainnet network
Aug 26 07:40:13.148 INFO Hot-Cold DB initialized                 split_state: 0xf3abdf015aaf26a68e131313e208955186b89632eb75078f2d3bb34c2756ee09, split_slot: 4556224

Num keys: 1
Total: 4096 bytes

@michaelsproul
Copy link
Member Author

Rolling this into tree-states as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
work-in-progress PR is a work-in-progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant