datastore: disable compaction (fixes 2x memory issue) #1535

teh-cmc · 2023-03-08T16:39:39Z

Disables bucket-level compaction aka. archiving until we support batching, thereby fixing the 2x memory issue. See below for rationale.

This doesn't impact garbage collection nor performance in any noticeable way.

Before:

$ cargo r -p test_image_memory
Logged 100 2048x1024 RGBA images = 800 MiB
1.6 GiB RAM used

After:

$ cargo r -p test_image_memory
Logged 100 2048x1024 RGBA images = 800 MiB
831 MiB RAM used

Closes #1397

The original long form rationale:

Alright so I've continued digging down the LogMsg-less rabbit hole following our discussion, until I hit a dead-end: we have a few places in the code where we actually expect to be able to fetch any message (both control and data) by MsgId so that we can check whether they have been purged or straight up display them in the UI.

Similar to the issues we've had with garbage collection, this is yet another instance that shows the disconnect between the viewer, which drives most of its logic using MsgIds, and the datastore, which has absolutely no clue what a MsgId is (because upon insertion, a message is reduced to its atomic primitives (components) and spread across many different storages (per entity + per timeline) and buckets (per space + per time): there simply is no such thing as a message anymore).

Contrary to the GC situation though, I don't see any hack that could help us workaround the issue in this case (and there are already too many hacks being piled on tbh). The only solution I can think of would be to make MsgIds actual first-class citizens in the store (i.e. create dedicated bi-directional indices to map back and forth between data and MsgIds), which coincidentally would fix all the issues with GC too since the underlying problem is the same: bridging the MsgId-driven viewer with the much lower-level datastore where such a construct is completely ambiguous atm.

Now, I don't think we should do that because A) that is definitely not trivial and B) it will all go to waste as soon we implement batching because at this point MsgIds will become mostly meaningless and we're gonna have to rethink how we do things in the viewer anyway.

So what I propose is this: I'll disable bucket compaction (so-called "archives") the ugly way for now (it can be done without impacting GC), which will fix the double-memory issue, and punt on all of this until we have batching implemented and working.

See also this internal slack thread.

jleibs

Should we consider handling this with a change to the default DataStoreConfig instead so that it could be enabled more easily for testing going forward?

jleibs · 2023-03-09T02:25:39Z

crates/re_arrow_store/src/store_write.rs

+            // TODO(cmc): Compaction is disabled until we implement batching.
+            // See https://github.com/rerun-io/rerun/pull/1535 for rationale.
+            //
+            // This has no noticeable impact on importance.


Can we back up this claim with data?

added benchmarks

disable compaction until we support batching

42de80c

teh-cmc added 🪳 bug Something isn't working ⛃ re_datastore affects the datastore itself labels Mar 8, 2023

teh-cmc mentioned this pull request Mar 8, 2023

datastore: serialize back into a stream of MsgBundles #1527

Closed

link

8c2ebe1

jleibs approved these changes Mar 9, 2023

View reviewed changes

teh-cmc added 2 commits March 9, 2023 23:10

Merge branch 'main' into cmc/disable_archives

621bcf8

make it configurable

90bb1de

teh-cmc merged commit baad9be into main Mar 9, 2023

teh-cmc deleted the cmc/disable_archives branch March 9, 2023 22:34

This was referenced Mar 9, 2023

Save .rrd from store instead of saving LogMsg:es. #1394

Closed

latest_at very slow (O(N)?) #1545

Closed

LogDb: dont split on index bucket size #1558

Merged

Fix garbage collection #1560

Merged

This was referenced Mar 27, 2023

Add a script that generates a changelog from recent PRs and their labels #1718

Merged

Release 0.4.0 - Outlines, web viewer and performance improvements #1722

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datastore: disable compaction (fixes 2x memory issue) #1535

datastore: disable compaction (fixes 2x memory issue) #1535

teh-cmc commented Mar 8, 2023 •

edited

Loading

jleibs left a comment

jleibs Mar 9, 2023

teh-cmc Mar 9, 2023

datastore: disable compaction (fixes 2x memory issue) #1535

datastore: disable compaction (fixes 2x memory issue) #1535

Conversation

teh-cmc commented Mar 8, 2023 • edited Loading

jleibs left a comment

Choose a reason for hiding this comment

jleibs Mar 9, 2023

Choose a reason for hiding this comment

teh-cmc Mar 9, 2023

Choose a reason for hiding this comment

teh-cmc commented Mar 8, 2023 •

edited

Loading