Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add eStargz specification to OCI v1 (support lazy pulling) #877

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

ktock
Copy link

@ktock ktock commented Nov 9, 2021

Resolves #815

This PR adds support of lazy pulling to OCI Image Spec v1 by standardizing eStargz image format as an optional extension.
No need to introduce a new layer media type, because eStargz is fully compatible with application/vnd.oci.image.layer.v1.tar+gz.

eStargz has been adopted by tools in the community, including containerd(stargz-snapshotter), Podman, k3s, BuildKit, Kaniko, ko, buildpacks.io, Harbor(acceleration-service). Please refer to containerd/stargz-snapshotter#258 for the complete list.

The following is the summary of this PR:

  • Moves eStargz specification document (estargz.md) from stargz-snapshotter repo to this repo.
  • Adds the link to estargz.md in the existing documents.
  • Adds the annotation org.opencontainers.image.toc.digest defined in estargz.md to specs-go.

@ktock
Copy link
Author

ktock commented Nov 9, 2021

estargz.md Outdated Show resolved Hide resolved
AkihiroSuda
AkihiroSuda previously approved these changes Nov 10, 2021
Copy link
Member

@AkihiroSuda AkihiroSuda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM (IANAM)

@cyphar PTAL

annotations.md Outdated
@@ -49,6 +49,7 @@ This specification defines the following annotation keys, intended for but not l
* This SHOULD be the immediate image sharing zero-indexed layers with the image, such as from a Dockerfile `FROM` statement.
* This SHOULD NOT reference any other images used to generate the contents of the image (e.g., multi-stage Dockerfile builds).
* If the `image.base.name` annotation is specified, the `image.base.digest` annotation SHOULD be the digest of the manifest referenced by the `image.ref.name` annotation.
* **org.opencontainers.image.toc.digest** Digest of the TOC JSON of [eStargz](estargz.md) layer.
Copy link

@jiangliu jiangliu Dec 13, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The concept of TOC digest may be helpful for other usage cases, so is it ok to relax the support not bound to estargz? For example, it may help to enable e2e integrity verification for Nydus(https://github.com/dragonflyoss/image-service).

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are there other formats which use TOC?

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The nydus project extracts all filesystem metadata to a dedicated blob in binary mode, which solves the same role as TOC for estargz. So hope we could reuse the same annotation too.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the suggestion. But if the "filesystem metadata" format is different from the TOC, shouldn't we have a separated key? Or, in following-up PRs, we can define "filesystem metadata" you mention then relax the scope of this key.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the annotation is specific to eStargz, should the key be org.opencontainers.image.estargz.digest?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@sudo-bmitch Thank you for the suggestion. Modified the key to org.opencontainers.image.estargz.toc.digest. I left toc here for futural extensibility (e.g. we may want other toc-related annotations in the future like estargz.toc.size or something)

@stevvooe
Copy link
Contributor

LGTM on the idea of adding this, but I haven't given this a detailed read.

I think we need to have the maintainers read this one in detail. We may also need a description of how to fetch these in the distribution-spec.

@ktock
Copy link
Author

ktock commented Jan 21, 2022

Thank you for the comments.

distribution-spec

Lazy pulling depends on HTTP range request which is already described in the distribution-spec so we don't need changes on the distribution-spec but can add notes that the range request is used by lazy pulling.

Lillithlilly
Lillithlilly previously approved these changes Apr 26, 2022
@AkihiroSuda
Copy link
Member

Needs rebase to make CI green

@ktock ktock dismissed stale reviews from Lillithlilly and AkihiroSuda via c7535c2 June 9, 2022 10:41
@ktock
Copy link
Author

ktock commented Jun 9, 2022

Rebased.

Signed-off-by: Kohei Tokunaga <ktokunaga.mail@gmail.com>
Copy link
Member

@tianon tianon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While this is really interesting, I'm a (soft) NACK on including it in the image-spec itself -- I think it's probably more productive to maintain this outside the spec, especially as there are now multiple (somewhat incompatible) implementations of similar ideas.

@ktock
Copy link
Author

ktock commented Apr 26, 2024

Thank you for your comments.

there are now multiple (somewhat incompatible) implementations of similar ideas.

Yes, there are several lazy pulling implementations in the community. So it would be great if OCI spec officially describes the format for lazy pulling so that runtimes and builders can adopt it and maintain the implementation while keeping interoperability among them.

@kzys
Copy link

kzys commented Jan 11, 2025

I can agree with @tianon. There are SOCI, Nydus and probably some more. The image spec doesn't feel like the place to collect and explain these alternative formats.

@sudo-bmitch
Copy link
Contributor

I'm leaning towards a middle ground, that the byte layout of the blob would ideally be an external spec. However, there are some pieces like that descriptor annotation that points to the TOC digest for validation, along with the layer media type, that make sense to include in the image-spec.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add eStargz specification to OCI v1 (support lazy pulling)
8 participants