Deduplicate layer IDs to handle cases where duplicate layers are in the layer tree #1172

komish · 2024-05-31T19:14:37Z

In certain edge cases, a container builder may produce multiple "empty" layers that may have the same digest value. When parsing those layers, we fail to find an RPM database (which is expected) and therefore carry the previous layer's package list forward.

We run into issues when layers with the same ID and no modified files (e.g. "empty" layers) are separated by layers WITH package changes, such that when we carry forward file lists from previous layers, we would be impacting an earlier entry's file list because we use a map where the key is the layer ID. Concrete example below[1].

This PR changes our mapping to glue together a layer's ID and its relative position in the image, which prevents overwriting previous layer values and fixes a bug in how we detect modified files layer-over-layer.

It also adds logging to map DiffID, LayerID, and the deduped/calculated value in trace logs to assist with debugging.

[1]

Assume we have 4 layers with these modified files.

-- base layer omitted --
0: sha256:1111 - [] 
1: sha256:2222 - ["/modified", "/path/to/rpmdb"]
2: sha256:1111 - []
3: sha256:3333 - ["/unrelated"]

We parse layer 0 and see it doesn't have any files (nor RPMDB), so we store a mapping of that layer's ID ("sha256:1111")to a list of packages that might be associated (which will be empty in this example because the previous, omitted layer is the base layer.

As soon as layer 1 (first layer to contain an RPMDB) is evaluated, that becomes our source of truth. We store the package lists associated with it and use those to inform the "do not modify" list. In our mapping, the package information is mapped to this layer's ID as the map key ("sha256:2222").

When layer 2 is parsed and found to be empty, we map its package list to the previous layer (1) package list. Because the layer ID of layer 2 matches that of layer 0, we've now inadvertently populated layer 0's package list. This is the problem.

Later, when we validate each of these layers, we detect that layer 0 has packages (the same as layer 1), and treat that as our source of truth. Layer 1, then, is considered to have modified all of the same files as layer 0 - which fails the check.

To solve this, I bind the layer ID with its index position (e.g. "00-sha256:1111"). It makes things a little bit heavier, but barring refactoring to remove the map AND array usage and/or change the way this works - this seemed simpler and effective enough.

acornett21 · 2024-05-31T19:43:39Z

I'm okay with this implementation, but also want to point out that we could also remove the empty layers before we build the maps/associations (ie before we iterate over the layers). Though, with that type of implementation Im not sure what the cutoff point for empty means size wise, since crane we are dealing with the uncompressed values and they are not zero at that point.

It would be nice to know others thoughts on various approaches as well.

komish · 2024-05-31T19:54:40Z

+1 @acornett21 I briefly tested skipping empty layers based on size - and just ended up at "how do I confidently determine that the layer is truly empty" aside from looking at content. The smallest value I saw returned from layer.Size() was 34. I didn't test much beyond that, but it's another option. Will wait for comment.

acornett21 · 2024-05-31T20:01:11Z

how do I confidently determine that the layer is truly empty

This is where I landed as well, I thought I was missing something as well with the size calculation. The one advantage of this I see, is it could speed up our processing by spiking some layers. But if don't have confidence on what the min size should be, they the saying cycles point is probably moot.

dcibot · 2024-05-31T20:59:42Z

from change #1172:

SUCCESS https://www.distributed-ci.io/jobs/50e4eb08-ae3a-444b-87f3-14bd0b2bb846/jobStates

bcrochet

/lgtm

jfrancin · 2024-06-04T20:30:05Z

I hope this will get reviewed soon - have a partner waiting on the new Preflight version with this fix...

komish · 2024-06-04T20:36:06Z

@jfrancin Yep, sorry for the delay - I've been trying to replicate the failing image with little success, and finally cracked it just a bit ago. Whipping up a test case, and we should be set.

acornett21 · 2024-06-04T20:48:49Z

@jfrancin we are well aware of the urgency, like mentioned we've been trying to understand how the partner built an image with multiple of the same layer (and empty ones at that), and come with with a repeatable test so we do not have an regressions. Figuring out how they got an image in this state, has been extremely difficult to replicate.

komish · 2024-06-04T22:01:08Z

@acornett21 @bcrochet A note -

This PR reconfigures HasModifiedFiles such that the packageDist value is extracted in a separate function from the package file map building process. This simplifies testing of the file mapping process by allowing us to build an ImageReference fragment that doesn't contain the extracted filesystem needed by the packageDist logic.

As a result, several return signatures have changed, dropping the returned packageDist.

Also note that the test written here uses a static fixture. The containerfile used to build it is included - but using a static fixture saves us storing the image in-repo (~110m). It's been documented as a TODO item to optimize at a later date.

internal/policy/container/has_modified_files.go

…he layer tree Signed-off-by: Jose R. Gonzalez <komish@flutes.dev>

coveralls · 2024-06-04T22:11:30Z

coverage: 84.782% (+3.2%) from 81.591%
when pulling 517a88b on komish:fix-hmf-dupe-layers
into 354d10d on redhat-openshift-ecosystem:main.

acornett21

Couple of comments, nothing blocking.

internal/policy/container/has_modified_files.go

internal/policy/container/has_modified_files_test.go

dcibot · 2024-06-04T22:52:01Z

from change #1172:

SUCCESS https://www.distributed-ci.io/jobs/d0c95efb-1909-41f6-acf4-85a38c9e056d/jobStates

bcrochet

/lgtm

openshift-ci · 2024-06-05T15:48:46Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: acornett21, bcrochet, komish

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [acornett21,bcrochet,komish]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci bot requested review from jomkz and tonytcampbell May 31, 2024 19:14

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 31, 2024

komish requested a review from bcrochet May 31, 2024 19:54

bcrochet approved these changes Jun 4, 2024

View reviewed changes

openshift-ci bot assigned bcrochet Jun 4, 2024

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jun 4, 2024

komish force-pushed the fix-hmf-dupe-layers branch from 2ef1122 to 8477aa7 Compare June 4, 2024 21:55

openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Jun 4, 2024

komish force-pushed the fix-hmf-dupe-layers branch from 8477aa7 to 542630e Compare June 4, 2024 21:57

acornett21 reviewed Jun 4, 2024

View reviewed changes

internal/policy/container/has_modified_files.go Show resolved Hide resolved

komish force-pushed the fix-hmf-dupe-layers branch from 542630e to 64272ca Compare June 4, 2024 22:04

deduplicate layer IDs to handle cases where duplicate layers are in t…

517a88b

…he layer tree Signed-off-by: Jose R. Gonzalez <komish@flutes.dev>

komish force-pushed the fix-hmf-dupe-layers branch from 64272ca to 517a88b Compare June 4, 2024 22:08

komish requested review from bcrochet and acornett21 June 4, 2024 22:10

acornett21 reviewed Jun 4, 2024

View reviewed changes

internal/policy/container/has_modified_files.go Show resolved Hide resolved

internal/policy/container/has_modified_files_test.go Show resolved Hide resolved

acornett21 approved these changes Jun 5, 2024

View reviewed changes

openshift-ci bot assigned acornett21 Jun 5, 2024

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jun 5, 2024

bcrochet approved these changes Jun 5, 2024

View reviewed changes

komish merged commit 3bdeab0 into redhat-openshift-ecosystem:main Jun 5, 2024
5 checks passed

komish deleted the fix-hmf-dupe-layers branch June 5, 2024 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deduplicate layer IDs to handle cases where duplicate layers are in the layer tree #1172

Deduplicate layer IDs to handle cases where duplicate layers are in the layer tree #1172

komish commented May 31, 2024 •

edited

Loading

acornett21 commented May 31, 2024

komish commented May 31, 2024 •

edited

Loading

acornett21 commented May 31, 2024

dcibot commented May 31, 2024

bcrochet left a comment

jfrancin commented Jun 4, 2024

komish commented Jun 4, 2024

acornett21 commented Jun 4, 2024

komish commented Jun 4, 2024

coveralls commented Jun 4, 2024

acornett21 left a comment

dcibot commented Jun 4, 2024

bcrochet left a comment

openshift-ci bot commented Jun 5, 2024

Deduplicate layer IDs to handle cases where duplicate layers are in the layer tree #1172

Deduplicate layer IDs to handle cases where duplicate layers are in the layer tree #1172

Conversation

komish commented May 31, 2024 • edited Loading

acornett21 commented May 31, 2024

komish commented May 31, 2024 • edited Loading

acornett21 commented May 31, 2024

dcibot commented May 31, 2024

bcrochet left a comment

Choose a reason for hiding this comment

jfrancin commented Jun 4, 2024

komish commented Jun 4, 2024

acornett21 commented Jun 4, 2024

komish commented Jun 4, 2024

coveralls commented Jun 4, 2024

acornett21 left a comment

Choose a reason for hiding this comment

dcibot commented Jun 4, 2024

bcrochet left a comment

Choose a reason for hiding this comment

openshift-ci bot commented Jun 5, 2024

komish commented May 31, 2024 •

edited

Loading

komish commented May 31, 2024 •

edited

Loading