Mitigate archival node data size growth #6119

bowenwang1996 · 2022-01-19T00:53:55Z

Context can be found [here](https://near.zulipchat.com/#narrow/stream/308695-nearinc.2Fprivate/topic/archival.20node.20data.20size). TLDR: archival node data usage grows at a very fast pace and since we require node operators to use SSDs to operate an archival node, it creates a considerable amount of financial burden for them, which is highly undesirable and unsustainable. Some ideas to mitigate the growth include the following:

Move most of the archival node data usage to HDD and only use SSD for the head of the chain so that an archival node can keep up with the network. This could be implemented through garbage collection where instead of removing data, we write archival nodes' data to a separate storage.
Do not store data that can be reconstructed for archival nodes. For example, partial encoded chunks can be reconstructed from full chunks and we do not need to store them for archival nodes (behind what is needed for block processing). This alone can potentially save us 1/3 of the storage needed. When a request comes in for old partial encoded chunks, we can compute them on the fly.
(Longer-term) Create data snapshots periodically and do not store full history on archival nodes. The snapshot can be stored in a decentralized storage and its metadata can be posted on-chain. We can develop a separate service that serves historical data by routing its query to the relevant snapshot (effective shard historical data by epoch)

bowenwang1996 · 2022-01-19T00:54:20Z

cc @mina86 @posvyatokum

Weirdly, RocksDB C interface does not have a way to just set the `enabled` flag of the bottommost compression options so I’ve decided to replicate that interface in Rust as well. As far as I understand, this unfortunately means that to set bottommost compression one needs to set the type and then also use one of the other methods for configuring the compression. Issue: rust-rocksdb#260 Issue: near/nearcore#6119

Add recompress-storage command which reads all the data from storage and saves it in a new location effectively recompressing all the SST files. This is meant as a temporary tool to speed up migration to zstd compression. Without it, especially on archival nodes, recompressing data may take a long time as most SST files can stay untouched for epochs. Issue: #6119

Issue: near#6119

Issue: #6119

Add --clean-partial-chunks and --clear-trie-changes options to clear out the two respective columns. Data in ColPartialChunks can be recomputed and data in ColTrieChanges is only used by non-archival nodes and can be deleted when running archival node. Issue: near#6119

Add --clean-partial-chunks and --clear-trie-changes options to clear out the two respective columns. Data in ColPartialChunks can be recomputed and data in ColTrieChanges is only used by non-archival nodes and can be deleted when running archival node. Issue: near#6119 Issue: near#6242 Issue: near#6250

Issue: near#6119

Issue: #6119

When recompressing database of an archival node, skip ColPartialChunks, ColInvalidChunks and ColTrieChanges columns which can be safely deleted. Data in the first one can be reconstructed from ColChunks, ColInvalidChunks is only needed at head and the last is never read by archival nodes. Mostly for testing, if someone wants to keep those columns, offer --keep-partial-chunks, --keep-invalid-chunks and --keep-trie-changes switches. They are always on when dealing with non-archival node. Issue: #6119 Issue: #6242 Issue: #6250

This is commit 2030c75 upstream. Add recompress-storage command which reads all the data from storage and saves it in a new location effectively recompressing all the SST files. This is meant as a temporary tool to speed up migration to zstd compression. Without it, especially on archival nodes, recompressing data may take a long time as most SST files can stay untouched for epochs. Issue: near#6119

This is commit a83db11 upstream. Issue: near#6119

…r#6478) This is commit 04f580a upstream. Issue: near#6119

This is commit da7a465 upstream. When recompressing database of an archival node, skip ColPartialChunks, ColInvalidChunks and ColTrieChanges columns which can be safely deleted. Data in the first one can be reconstructed from ColChunks, ColInvalidChunks is only needed at head and the last is never read by archival nodes. Mostly for testing, if someone wants to keep those columns, offer --keep-partial-chunks, --keep-invalid-chunks and --keep-trie-changes switches. They are always on when dealing with non-archival node. Issue: near#6119 Issue: near#6242 Issue: near#6250

The target is used by recompress_storage. Initially the command used no target but then target was added which caused the log lines to be ignored by default. Issue: near#6119

Store::iter strips reference count from reference counted columns and writing returned value to a new database leads to corrupted data since the count is no longer present. The correct way to deal with raw values saved in the database is to use Store::iter_without_rc_logic. Switch recompress_storage to do that. Issue: near#6119

Store::iter strips reference count from reference counted columns and writing returned value to a new database leads to corrupted data since the count is no longer present. The correct way to deal with raw values saved in the database is to use Store::iter_without_rc_logic. Switch recompress_storage to do that. Issue: #6119

The target is used by recompress_storage. Initially the command used no target but then target was added which caused the log lines to be ignored by default. Issue: #6119

Introduce a test which verifies that recompress_storage command does not corrupt the database and that nodes can continue working after it’s used. Issue: #6119

Introduce a test which verifies that recompress-storage command doesn’t corrupt the database and that nodes can continue working after it’s used. Issue: #6119

Issue: near/nearcore#6119

exalate-issue-sync · 2022-10-27T10:56:31Z

Michał Nazarewicz commented:

Last point is now tracked in https://pagodaplatform.atlassian.net/browse/ND-162

bowenwang1996 added C-enhancement Category: An issue proposing an enhancement or a PR with one. A-storage Area: storage and databases T-node Team: issues relevant to the node experience team labels Jan 19, 2022

bowenwang1996 assigned mina86 Jan 19, 2022

mina86 mentioned this issue Jan 21, 2022

Support configuring bottom-most compression level rust-rocksdb/rust-rocksdb#590

Merged

bowenwang1996 added the P-high Priority: High label Jan 24, 2022

janewang assigned posvyatokum Feb 7, 2022

bowenwang1996 unassigned posvyatokum Mar 7, 2022

mina86 mentioned this issue Mar 17, 2022

neard: add recompress-storage command #6447

Merged

mina86 added a commit to mina86/nearcore that referenced this issue Mar 21, 2022

neard: document ‘recompress-storage’ command more

6460603

Issue: near#6119

mina86 mentioned this issue Mar 21, 2022

neard: document ‘recompress-storage’ command more #6469

Merged

mina86 added a commit to mina86/nearcore that referenced this issue Mar 22, 2022

tools: add recompress.sh tool for gcing and recompressing database

b7700e0

Issue: near#6119

mina86 mentioned this issue Mar 22, 2022

tools: add recompress.sh tool for gcing and recompressing database #6472

Closed

near-bulldozer bot pushed a commit that referenced this issue Mar 22, 2022

neard: document ‘recompress-storage’ command more (#6469)

a83db11

Issue: #6119

mina86 mentioned this issue Mar 23, 2022

neard: recompress_storage: clean out unnecessary columns #6477

Merged

mina86 added a commit to mina86/nearcore that referenced this issue Mar 23, 2022

neard: use u64 counters in recompress-storage to avoid overflows

0a9223c

Issue: near#6119

mina86 mentioned this issue Mar 23, 2022

neard: use u64 counters in recompress-storage to avoid overflows #6478

Merged

near-bulldozer bot pushed a commit that referenced this issue Mar 23, 2022

neard: use u64 counters in recompress-storage to avoid overflows (#6478)

04f580a

Issue: #6119

mina86 added a commit to mina86/nearcore that referenced this issue Apr 7, 2022

neard: document ‘recompress-storage’ command more (near#6469)

d4a87e2

This is commit a83db11 upstream. Issue: near#6119

mina86 added a commit to mina86/nearcore that referenced this issue Apr 7, 2022

neard: use u64 counters in recompress-storage to avoid overflows (nea…

cb611db

…r#6478) This is commit 04f580a upstream. Issue: near#6119

mina86 mentioned this issue Apr 8, 2022

neard: enable target:"recompress" info logs by default #6557

Merged

mina86 mentioned this issue Apr 8, 2022

recompress_storage: fix handling of rc columns #6565

Merged

mina86 added a commit that referenced this issue Apr 13, 2022

pytest: add end-to-end recompress_storage sanity test

f128fad

Introduce a test which verifies that recompress_storage command does not corrupt the database and that nodes can continue working after it’s used. Issue: #6119

mina86 added a commit that referenced this issue Apr 13, 2022

pytest: add end-to-end recompress_storage sanity test

1f9fc93

Introduce a test which verifies that recompress_storage command does not corrupt the database and that nodes can continue working after it’s used. Issue: #6119

mina86 mentioned this issue Apr 13, 2022

pytest: add end-to-end recompress-storage sanity test #6601

Merged

mina86 mentioned this issue Apr 20, 2022

Add ‘Recompressing archival node storage’ page near/node-docs#21

Merged

mina86 added a commit to near/node-docs that referenced this issue Apr 20, 2022

Add ‘Recompressing archival node storage’ page (#21)

d80d192

Issue: near/nearcore#6119

exalate-issue-sync bot added T-nodeX and removed T-node Team: issues relevant to the node experience team labels Jun 28, 2022

matklad added T-node Team: issues relevant to the node experience team and removed T-nodeX labels Aug 4, 2022

mina86 assigned posvyatokum and unassigned mina86 Oct 27, 2022

exalate-issue-sync bot closed this as completed Oct 27, 2022

gmilescu added the Node Node team label Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mitigate archival node data size growth #6119

Mitigate archival node data size growth #6119

bowenwang1996 commented Jan 19, 2022 •

edited by exalate-issue-sync bot

Loading

bowenwang1996 commented Jan 19, 2022

exalate-issue-sync bot commented Oct 27, 2022

Mitigate archival node data size growth #6119

Mitigate archival node data size growth #6119

Comments

bowenwang1996 commented Jan 19, 2022 • edited by exalate-issue-sync bot Loading

bowenwang1996 commented Jan 19, 2022

exalate-issue-sync bot commented Oct 27, 2022

bowenwang1996 commented Jan 19, 2022 •

edited by exalate-issue-sync bot

Loading