Migrate caching of compressed data from SpanManager to BlobReader #232

vkuzniet · 2022-12-13T20:31:12Z

Is your feature request related to a problem? Please describe.
Once #231 is complete, SpanManager needs to stop having the compressed spans cache and keep caching only uncompressed spans going forward. Compressed spans should be cached within BlobReader. Please refer to the design doc from #103 for more details on the concepts.
In short, we need to do the following:

BlobReader now has the cache of spans (aka regions).
SpanManager keeps only the cache of uncompressed spans. If it needs the compressed span, it requests it from BlobReader.

Describe the solution you'd like
The outcome for this issue is a design for the separation.

Describe alternatives you've considered
Keep everything as is.

Additional context
#103

Kern-- · 2022-12-15T19:16:51Z

This needs additional design work. I will update this comment with more specific information later.

Kern-- · 2023-03-29T17:19:47Z

While we're doing this work, we should be thinking about the ability to decouple the actual network requests from the span size. A span is really a unit of decompression, but we also use it as a unit of data to be fetched from the network. We can probably get some better performance if we optimize our network requests separately from the decompression. E.g. S3 likes 8 or 16MiB requests, but we might want 2 or 4MiB spans to reduce necessary compute.

We don't actually have to solve this problem in the initial separation, but we shouldn't design it in such a way that we can't do that later.

Kern-- · 2023-04-03T18:43:40Z

TODO: Merge with #534

vkuzniet added the feature New feature or request label Dec 13, 2022

vkuzniet added this to soci-snapshotter Dec 13, 2022

vkuzniet moved this to ❓ Ungroomed in soci-snapshotter Dec 13, 2022

Kern-- changed the title ~~Refactor the data path for soci snapshotter: part 2~~ Move caching of compressed data from SpanManager to BlobReader Dec 13, 2022

Kern-- changed the title ~~Move caching of compressed data from SpanManager to BlobReader~~ Migrate caching of compressed data from SpanManager to BlobReader Dec 13, 2022

Kern-- mentioned this issue Mar 29, 2023

[FEATURE] Decouple span size from download size #534

Open

Kern-- moved this from ❓ Ungroomed to 📋 Backlog in soci-snapshotter Apr 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate caching of compressed data from SpanManager to BlobReader #232

Migrate caching of compressed data from SpanManager to BlobReader #232

vkuzniet commented Dec 13, 2022 •

edited by Kern--

Loading

Kern-- commented Dec 15, 2022

Kern-- commented Mar 29, 2023

Kern-- commented Apr 3, 2023

Migrate caching of compressed data from SpanManager to BlobReader #232

Migrate caching of compressed data from SpanManager to BlobReader #232

Comments

vkuzniet commented Dec 13, 2022 • edited by Kern-- Loading

Kern-- commented Dec 15, 2022

Kern-- commented Mar 29, 2023

Kern-- commented Apr 3, 2023

vkuzniet commented Dec 13, 2022 •

edited by Kern--

Loading