timely-util: implement an accounting container builder #27811

petrosagg · 2024-06-22T16:56:42Z

Motivation

This PR adds a container builder wrapper that keeps track of the heap allocated bytes produced so far. This information can be used by operators to decide when it is a good time to yield.

Tips for reviewer

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
This PR includes the following user-facing behavior changes:

petrosagg · 2024-06-22T16:57:53Z

src/timely-util/src/containers/stack.rs

-        let local = Vec::with_capacity((capacity + Self::CHUNK - 1) / Self::CHUNK);
+        let chunks = (capacity + Self::CHUNK - 1) / Self::CHUNK;
+        let mut local = vec![];
+        local.resize_with(chunks, || Array::with_capacity(Self::CHUNK));


@antiguru the code here was not actually reserving the size in the arrays so I'm doing that here. I have updated the push method to look at capacity to decide whether a new chunk is needed. This allows us to also reserve space from SizableContainer::reserve

It was somewhat intentional to not preallocate the arrays. The cost of getting an array right now vs later should be the same, and getting it later opens up the opportunity to recycle allocations more eagerly.

hm, that's surprising! Isn't the purpose of a with_capacity API to ensure that I can insert X items without extra allocations? I can certainly revert this and make the reserve function a no-op (i.e just add capacity to hold the number of chunks needed) but that seems like the wrong implementation.

Regarding cost, while there are no reallocations of the actual data here there is potentially temporal cost because work that can be done now is deferred for the future and now might be a more convenient time.

In general, with a with_capacity API all these considerations can be thought of at the calling site by using a capacity of 0 or the actual value depending on when they want the allocations to happen.

I would say it's because with_capacity is a under defined API. We have to support it as it's part of the interface, and at the same time this implementation knows more about itself than the caller. At the same time, it might be the best we can offer.

I think it's very important that we do not preallocate: imagine we're forming a new largest batch in a spine, and constructing it takes a long time. We want to minimize the resource footprint during construction to avoid running out of resources. Additionally, differential chooses a capacity of the sum of input sizes, because it needs to assume that nothing will cancel. Often, we end with less than the sum, and we don't want to be in a position to carry unused memory with us, or needing to reallocate.

We want to minimize the resource footprint during construction to avoid running out of resources

Additionally, differential chooses a capacity of the sum of input sizes

I'm still a bit confused because these two seem contradictory. If DD doesn't want preallocation to happen why call with_capacity with a large value? It sounds like DD needs to use with_capacity(0) and everything will work out fine when with_capacity does what it says on the tin. What am I missing?

I reverted to the previous capacity behavior to unblock this PR but something doesn't feel right with how DD is using this API

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

antiguru · 2024-06-24T10:04:49Z

src/timely-util/src/containers/stack.rs

+    }
+
+    fn preferred_capacity() -> usize {
+        if ENABLE_CHUNKED_STACK.load(std::sync::atomic::Ordering::Relaxed) {


You or I should check whether the two branches return different values. I think they should not because changing the preferred capacity at run time is a case I didn't think about and can cause undesired effects.

They both end up in timely_container::buffer::preferred_capacity

Sorry, I've intended to make this a stronger statement: It should just call timely::container::buffer::preferred_capacity to protect against someone changing the preferred capacity and suddenly things starting to crash/oom. preferred_capacity should probably be a const fn.

No worries, I will put up a new PR to fix this

petrosagg requested a review from a team as a code owner June 22, 2024 16:56

petrosagg requested a review from antiguru June 22, 2024 16:56

petrosagg commented Jun 22, 2024

View reviewed changes

petrosagg added 2 commits June 24, 2024 12:37

timely-util: implement SizableContainer for StackWrapper

454c00c

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

timely-util: implement accounting container builder

94ae1eb

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg force-pushed the handle-accounting branch from fb5858a to 94ae1eb Compare June 24, 2024 09:42

antiguru approved these changes Jun 24, 2024

View reviewed changes

petrosagg merged commit 448764f into MaterializeInc:main Jun 24, 2024
76 checks passed

petrosagg deleted the handle-accounting branch June 24, 2024 10:55

petrosagg mentioned this pull request Jun 24, 2024

timely-util: use the same preferred capacity regardless of variant #27845

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

timely-util: implement an accounting container builder #27811

timely-util: implement an accounting container builder #27811

petrosagg commented Jun 22, 2024

petrosagg Jun 22, 2024

antiguru Jun 22, 2024

petrosagg Jun 23, 2024

antiguru Jun 23, 2024

petrosagg Jun 23, 2024

petrosagg Jun 24, 2024

antiguru Jun 24, 2024

petrosagg Jun 24, 2024

antiguru Jun 24, 2024

petrosagg Jun 24, 2024

timely-util: implement an accounting container builder #27811

timely-util: implement an accounting container builder #27811

Conversation

petrosagg commented Jun 22, 2024

Motivation

Tips for reviewer

Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment