pruning=everything causes db corruption #10352

ValarDragon · 2021-10-13T04:05:04Z

Summary of Bug

its been repeatedly reported that pruning=everything will cause db corruption for nodes across different cosmos chains. The db corruption looks like failed to load latest version: failed to load store: wanted to load target 1488419 but only found up to 0. Presumably this is coming from restarting a node, but not even keeping the latest state due to {some} issue.

Version

All versions on the v0.42.X line. I am unsure about v0.44.x chains, as I don't actively work on any v0.44.x release chains atm.

Steps to Reproduce

Run a node with pruning=everything, occasionally stop and restart it, and this will happen.

Suggested fix

Perhaps pruning=everything should be equivalent to keep-recent=1 or keep-recent=2? Or is there another solution here to make it safer / less prone to db corruptions? FWIW, on a chainlayer pruned snapshot that got corrupted, the right size of data for state was still there. I did not inspect how different the saved data was to other nodes. (in part b/c idk how, I don't know of a convenient framework to decode leveldb entries)

For Admin Use

Not duplicate issue
Appropriate labels applied
Appropriate contributors tagged
Contributor assigned/self-assigned

The text was updated successfully, but these errors were encountered:

alexanderbez · 2021-10-19T13:27:51Z

Yeah I've also numerous reports of this...it smells like an off-by-one type of situation if I had to guess (as I've implemented most of this logic). I would say this is pretty high priority.

@AmauryM any ideas who has bandwidth to look into this?

amaury1093 · 2021-10-22T14:07:59Z

@alexanderbez do you think you have bandwidth to tackle this? If not we can maybe find someone on the regen team

alexanderbez · 2021-10-28T20:16:04Z

Yeah you can assign it to me, but I don't know when I'll be able to get to it. It might take me a few weeks.

alexanderbez · 2021-10-28T20:18:34Z

In the meantime, I would recommend a custom setting where you only keep a handful, say 100 blocks and prune every 100 blocks.

alexanderbez · 2022-02-10T22:40:59Z

Proposal: Have prune=everything actually keep that last two blocks always as a buffer. A somewhat lazy approach, but I believe this should do the trick rather than spend countless hours debugging where the "off by one" error might be.

Thoughts @ValarDragon ?

ValarDragon · 2022-02-11T00:19:51Z

100% agreed, and we just file an issue for long term figuring out what the actual problem was

ValarDragon added the T:Bug label Oct 13, 2021

alexanderbez added the C:Store label Oct 19, 2021

alexanderbez self-assigned this Oct 28, 2021

tac0turtle mentioned this issue Nov 23, 2021

CONSENSUS FAILURE caused by prune stores #9234

Closed

ValarDragon added the good first issue label Dec 10, 2021

czarcas7ic mentioned this issue Feb 9, 2022

pruning=everything causes db corruption osmosis-labs/docs#14

Closed

2 tasks

alexanderbez mentioned this issue Feb 12, 2022

refactor: prune everything #11177

Merged

19 tasks

alexanderbez closed this as completed in #11177 Feb 23, 2022

mhofman mentioned this issue Sep 19, 2023

Consensus failure on cosmos DB pruning Agoric/agoric-sdk#8354

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pruning=everything causes db corruption #10352

pruning=everything causes db corruption #10352

ValarDragon commented Oct 13, 2021

alexanderbez commented Oct 19, 2021

amaury1093 commented Oct 22, 2021

alexanderbez commented Oct 28, 2021

alexanderbez commented Oct 28, 2021

alexanderbez commented Feb 10, 2022

ValarDragon commented Feb 11, 2022 •

edited

Loading

pruning=everything causes db corruption #10352

pruning=everything causes db corruption #10352

Comments

ValarDragon commented Oct 13, 2021

Summary of Bug

Version

Steps to Reproduce

Suggested fix

For Admin Use

alexanderbez commented Oct 19, 2021

amaury1093 commented Oct 22, 2021

alexanderbez commented Oct 28, 2021

alexanderbez commented Oct 28, 2021

alexanderbez commented Feb 10, 2022

ValarDragon commented Feb 11, 2022 • edited Loading

ValarDragon commented Feb 11, 2022 •

edited

Loading