Optimize DiskV2 Deduplication #878

fkorotkov · 2024-08-05T03:53:59Z

I ran some Instruments CPU profiles pulling ghcr.io/cirruslabs/macos-sonoma-base:latest on top of ghcr.io/cirruslabs/macos-sonoma-vanilla:latest and found a few bottle necks. See commits for details.

This reverts commit a9e2a19.

In case we cloned `disk.img` from a local image, check if data at offset has the expected contents already.

This reverts commit e74e9c8.

fkorotkov · 2024-08-05T03:58:29Z

Sources/tart/OCI/Layerizer/DiskV2.swift


-        if isHoleAligned && chunk == zeroChunk {
+        if actualContentsOnDisk == chunk {


This is still a place for optimization. One though was to not try to optimize wholes on the sub-layer level, only try to deduplicate layers. And in this case whole punch the whole uncompressed layer and then sparse write into it.

edigaryev · 2024-08-05T11:32:21Z

Sources/tart/OCI/Layerizer/DiskV2.swift

-          if try pullResumed && Digest.hash(diskURL, offset: diskWritingOffset, size: uncompressedLayerSize) == uncompressedLayerContentDigest {
-            // Update the progress
-            progress.completedUnitCount += Int64(diskLayer.size)
+          if pullResumed {


I'm not sure if we need this change, because currently short-circuit evaluation will kick in if pullResumed is false, and hash won't be calculated.

Not according to the profiling I did. Both parts of the expression are getting evaluated regardless.

Is it possible that pullResumed was set to true when the profiling was made?

Unlikely, I was running pulls to completion between profiles.

I've just done some debugging and I think that we can safely revert this because the compiler is correctly compiling the original line of code.

It's just that there's one more call to the Digest.hash() that is visible in the Instruments "CPU Profiler", and it is located below:

tart/Sources/tart/OCI/Layerizer/DiskV2.swift

Line 143 in 33b5cfe

if let localLayerCache = localLayerCache, let data = localLayerCache.find(diskLayer.digest), Digest.hash(data) == uncompressedLayerContentDigest {

On Sequoia it's still the case. Maybe something to do about the try, don't know.

edigaryev · 2024-08-05T12:06:49Z

Sources/tart/Data+Chunks.swift

+extension Data {
+  /*
+   * Performant version of splitting a Data into chunks of a given size.
+   * It appers that "Data.chunks` is not as performant as chunking the range of the data


How did you measure this? Have you used "CPU Profiler" of Instruments? Have you ran sudo purge before the test?

I've replaced the tart list's run() function with the following contents:

let disk = try FileHandle(forReadingFrom: URL(fileURLWithPath: "/Users/edi/.tart/vms/macos/disk.img")) let holeGranularityBytes = 64 * 1024 var count = 0 while true { guard let data = try disk.read(upToCount: 64 * 1024 * 1024) else { break } for chunk in data.chunks(ofCount: holeGranularityBytes) { count += chunk.count } } print(count)

Here's a profile when using chunks() which takes ~15 seconds and ~20 Mc to churn through macOS Sonoma's disk image:

Now by changing chunks() to subdataChunks() we do the same with ~27 seconds and using ~110 Mc:

I didn't purge but this was not the biggest optimization. Let's revert and profile separately. See 773a403

This reverts commit e59382a.

This reverts commit 8c569fc

fkorotkov added 5 commits August 1, 2024 16:11

Revert "Lowercase tart.app (#751)"

e74e9c8

This reverts commit a9e2a19.

Optimize DiskV2 deduplication logic

029ee15

In case we cloned `disk.img` from a local image, check if data at offset has the expected contents already.

Hole punch only if needed

8c569fc

Calculate hash only if needed

91f3d32

subdataChunks optimization

e59382a

fkorotkov requested a review from edigaryev as a code owner August 5, 2024 03:53

Reapply "Lowercase tart.app (#751)"

2f51645

This reverts commit e74e9c8.

fkorotkov commented Aug 5, 2024

View reviewed changes

fkorotkov added 2 commits August 4, 2024 23:59

format

5e4f1fb

Save at least 1GB on deduplication logic

34d9faf

edigaryev reviewed Aug 5, 2024

View reviewed changes

fkorotkov added 6 commits August 5, 2024 09:04

Build separately

a52d22c

Revert "subdataChunks optimization"

773a403

This reverts commit e59382a.

Another optimization

f779e6b

Removed debug log

a74302e

reformat

19cef42

Revert "Hole punch only if needed"

b526fbd

This reverts commit 8c569fc

edigaryev approved these changes Aug 5, 2024

View reviewed changes

fkorotkov merged commit ff928ad into main Aug 5, 2024
7 checks passed

fkorotkov deleted the fedor-optimize-diskv2-deduplication branch August 5, 2024 16:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize DiskV2 Deduplication #878

Optimize DiskV2 Deduplication #878

fkorotkov commented Aug 5, 2024

fkorotkov Aug 5, 2024

edigaryev Aug 5, 2024

fkorotkov Aug 5, 2024

edigaryev Aug 5, 2024

fkorotkov Aug 5, 2024

edigaryev Aug 5, 2024 •

edited

Loading

fkorotkov Aug 5, 2024

edigaryev Aug 5, 2024 •

edited

Loading

fkorotkov Aug 5, 2024


		if isHoleAligned && chunk == zeroChunk {
		if actualContentsOnDisk == chunk {

Optimize DiskV2 Deduplication #878

Optimize DiskV2 Deduplication #878

Conversation

fkorotkov commented Aug 5, 2024

fkorotkov Aug 5, 2024

Choose a reason for hiding this comment

edigaryev Aug 5, 2024

Choose a reason for hiding this comment

fkorotkov Aug 5, 2024

Choose a reason for hiding this comment

edigaryev Aug 5, 2024

Choose a reason for hiding this comment

fkorotkov Aug 5, 2024

Choose a reason for hiding this comment

edigaryev Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

fkorotkov Aug 5, 2024

Choose a reason for hiding this comment

edigaryev Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

fkorotkov Aug 5, 2024

Choose a reason for hiding this comment

edigaryev Aug 5, 2024 •

edited

Loading

edigaryev Aug 5, 2024 •

edited

Loading