sidecar file naming and format #520

pwinckles · 2020-10-26T17:08:17Z

The specification for the inventory sidecar is awkward to use because you often want to verify the integrity of an inventory file before deserializing it, but this is complicated by the fact that the name of the sidecar file is dependent on the digest algorithm that's defined within the inventory file.

On filesystem implementations, this is annoying but not a big deal. You can just list the files and examine their names to identify the sidecar. However, the problem is more annoying for object store implementations to resolve.

It seems to me that the sidecar file specification was based on the BagIt manifest specification, but it does not seem like a good fit.

With BagIt you can have multiple manifest files, using different algorithms, but with OCFL there may only be one and it must use the algorithm defined in the inventory.
With BagIt the manifest lists the digests for many files, but with OCFL the sidecar only ever contains the digest of the inventory file.

The ship may have sailed on this one, but, to me, it makes more sense if the sidecar MUST be named inventory.json.sidecar (or whatever better name you can come up with), and have contents like ALGORITHM\tDIGEST. Where ALGORITHM MUST be the same as the algorithm that's defined in the inventory.

This would allow the sidecar to be easily located without needing to deserialize the inventory or root around looking for it.

[Edit] Reflecting on it more, I see that the format does align with how checksums are usually stored on unix systems. It's easy for a person to use manually. It's just more complicated to use programmatically.

The text was updated successfully, but these errors were encountered:

awoods · 2020-11-10T18:19:42Z

With an interest in retaining compliance with the 1.0 specification, one approach could be to create an object or storage root extension that defines the digestAlgorithm used. This would facilitate direct access to any given inventory digest file.

zimeon · 2020-11-11T16:15:09Z

It would certainly seem reasonable to have a storage root level statement (ie. an extension) that says "every object will use sha512 digests/sidecars" or at least "the latest version of every object will use sha512 digests/sidecars" -- this would essentially turn any occurrence of something else into a local error

bcail · 2020-11-11T17:27:45Z

having a storage root or object extension that defines the digestAlgorithm sounds fine (actually I might lean toward a storage root extension - not sure it makes much sense have an object extension that you load to find out the algorithm that you can find in other places in the object)

awoods · 2020-11-11T17:44:12Z

I am marking this as a 2.0 issue, with the 1.0 recommendation of defining an extension that details which algorithm is used in order to directly know the name of the inventory file.

rosy1280 · 2023-09-22T20:49:13Z

@pwinckles is this something that you still need addressed, or are you happy with the way things are.

pwinckles · 2023-09-22T21:08:40Z

@rosy1280 you can close

zimeon added the OCFL Object label Nov 3, 2020

awoods added this to the 2.0 milestone Nov 11, 2020

zimeon added the Deferred to V2 label Jan 21, 2022

zimeon removed the Deferred to V2 label Sep 22, 2023

rosy1280 removed this from the 2.0 milestone Sep 22, 2023

rosy1280 closed this as completed Sep 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sidecar file naming and format #520

sidecar file naming and format #520

pwinckles commented Oct 26, 2020 •

edited

Loading

awoods commented Nov 10, 2020

zimeon commented Nov 11, 2020 •

edited

Loading

bcail commented Nov 11, 2020

awoods commented Nov 11, 2020

rosy1280 commented Sep 22, 2023

pwinckles commented Sep 22, 2023

sidecar file naming and format #520

sidecar file naming and format #520

Comments

pwinckles commented Oct 26, 2020 • edited Loading

awoods commented Nov 10, 2020

zimeon commented Nov 11, 2020 • edited Loading

bcail commented Nov 11, 2020

awoods commented Nov 11, 2020

rosy1280 commented Sep 22, 2023

pwinckles commented Sep 22, 2023

pwinckles commented Oct 26, 2020 •

edited

Loading

zimeon commented Nov 11, 2020 •

edited

Loading