Hash copied files #2055

bsideup · 2019-11-09T23:01:54Z

depends on #2051

core/src/main/java/org/testcontainers/containers/GenericContainer.java

# Conflicts: # core/src/test/java/org/testcontainers/containers/ReusabilityUnitTests.java

…iles_hash

# Conflicts: # core/src/test/java/org/testcontainers/containers/ReusabilityUnitTests.java

rnorth · 2019-11-27T19:52:20Z

core/src/main/java/org/testcontainers/containers/GenericContainer.java

+        checksum.update(MountableFile.getUnixFileMode(file.toPath()));
+        if (file.isDirectory()) {
+            try (Stream<Path> stream = Files.walk(file.toPath())) {
+                stream.filter(it -> !Files.isDirectory(it)).forEach(path -> {


We have the file mode included in the checksum at line 506 which is cool...

But here, when we walk the contents, are we only walking sub_files_ (i.e. direct child files and child files of subdirectories)?

If that's the case then we'd not capture the file mode of subdirectories - is that the correct interpretation?

Nice catch! Fixed.

rnorth · 2019-11-27T20:02:51Z

core/src/main/java/org/testcontainers/utility/MountableFile.java

        try {
-            return (int) Files.getAttribute(path, "unix:mode");
+            return (int) Files.readAttributes(path, "unix:mode").get("mode");


I was curious as to why we'd need to use readAttributes here - we're still only fetching the mode, so getAttribute perhaps doesn't really have a performance disadvantage.

However, now this makes me wonder if we should also be hashing basic file attributes like created/modified timestamps etc 😭

WDYT? I don't mind deferring this, TBH...

since we checksum content & file mode (the only bits that actually go into the tar archive), I guess we can defer the basic file attributes until someone reports that the hashing works incorrectly for them.

I will also check what Dockerfile builder is using for hashing COPY

Cool, sounds sensible 👍

https://stackoverflow.com/a/59073724/1826422

For the ADD and COPY instructions, the contents of the file(s) in the image are examined and a checksum is calculated for each file. The last-modified and last-accessed times of the file(s) are not considered in these checksums. During the cache lookup, the checksum is compared against the checksum in the existing images. If anything has changed in the file(s), such as the contents and metadata, then the cache is invalidated.

okay, looks pretty aligned with what we're doing

bsideup added 2 commits November 8, 2019 19:57

Fix session label when reuse is not supported but requested

c53c75f

Hash copied files

43212e6

bsideup added area/docker-compose type/feature area/bitbucket-pipelines area/reusable-containers labels Nov 9, 2019

bsideup added this to the next milestone Nov 9, 2019

bsideup requested a review from a team November 9, 2019 23:01

bsideup requested review from kiview and rnorth as code owners November 9, 2019 23:01

bsideup and others added 3 commits November 10, 2019 00:17

Merge branch 'master' into copied_files_hash

309e77d

restore import

8701694

Merge branch 'master' into copied_files_hash

82bc7ba

bsideup removed area/bitbucket-pipelines area/docker-compose labels Nov 10, 2019

bsideup commented Nov 10, 2019

View reviewed changes

core/src/main/java/org/testcontainers/containers/GenericContainer.java Outdated Show resolved Hide resolved

rnorth reviewed Nov 10, 2019

View reviewed changes

core/src/main/java/org/testcontainers/containers/GenericContainer.java Show resolved Hide resolved

rnorth reviewed Nov 10, 2019

View reviewed changes

core/src/main/java/org/testcontainers/containers/GenericContainer.java Show resolved Hide resolved

Merge remote-tracking branch 'origin/master' into copied_files_hash

913f5e5

# Conflicts: # core/src/test/java/org/testcontainers/containers/ReusabilityUnitTests.java

bsideup requested a review from rnorth November 10, 2019 20:19

bsideup and others added 6 commits November 10, 2019 21:49

Merge branch 'master' into copied_files_hash

4db57cc

Merge branch 'master' into copied_files_hash

86749b9

# Conflicts: # core/src/test/java/org/testcontainers/containers/ReusabilityUnitTests.java

Merge remote-tracking branch 'origin/copied_files_hash' into copied_f…

3d2f188

…iles_hash

add file mode to the checksum

bee32a4

Merge remote-tracking branch 'origin/master' into copied_files_hash

ec6bc8f

# Conflicts: # core/src/test/java/org/testcontainers/containers/ReusabilityUnitTests.java

Mock TestcontainersConfiguration

1967319

rnorth reviewed Nov 27, 2019

View reviewed changes

checksum folders too

e8f255f

rnorth approved these changes Nov 27, 2019

View reviewed changes

bsideup merged commit 3cb3379 into master Nov 27, 2019

delete-merged-branch bot deleted the copied_files_hash branch November 27, 2019 20:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hash copied files #2055

Hash copied files #2055

bsideup commented Nov 9, 2019

rnorth Nov 27, 2019

bsideup Nov 27, 2019

rnorth Nov 27, 2019

bsideup Nov 27, 2019

rnorth Nov 27, 2019

bsideup Nov 27, 2019

Hash copied files #2055

Hash copied files #2055

Conversation

bsideup commented Nov 9, 2019

rnorth Nov 27, 2019

Choose a reason for hiding this comment

bsideup Nov 27, 2019

Choose a reason for hiding this comment

rnorth Nov 27, 2019

Choose a reason for hiding this comment

bsideup Nov 27, 2019

Choose a reason for hiding this comment

rnorth Nov 27, 2019

Choose a reason for hiding this comment

bsideup Nov 27, 2019

Choose a reason for hiding this comment