cloud agents #12

cgwalters · 2018-07-12T20:26:03Z

Today, Container Linux uses an OEM partition for various cloud agents (e.g. GCE). For Fedora Atomic, we never created such a thing and mostly limped along with the (very limited) support that cloud-init has for different sites. The only exception here is that for RHEL Atomic Host we did make a VMWare agent container.

The architecture for Fedora CoreOS calls for us to close to CL here (Ignition + coreos-metadata) but that doesn't answer the larger cloud agent problem.

A known major issue with the CL approach is that there is no update mechanism for the OEM partition.

We have a few options, and we can consider different strategies per cloud.

Layering it on as a package just for that cloud
Layering but not updating it (i.e. we don't engage the rpm-md machinery)
separate ostree streams per cloud
Rkt/atomic system containers style
Statically linked binary in /opt

bgilbert · 2018-07-12T21:21:37Z

The CL model is to ship a different install image per platform, with common root and /usr partitions but a platform-specific OEM partition containing the agent (if any). Then, for updates, we ship a /usr partition and kernel. That means we can have a single update payload per CPU architecture per release, but conversely we can't update the OEM partition. That causes practical problems (sometimes we have to fix OEM bugs by putting ad-hoc code in coreos-postinst to modify the OEM partition) as well as problems of principle (we can't actually update everything we ship). So I'd say universal automatic updating is a necessity for FCOS, but it'd be good to avoid having separate update streams for each platform.

One other thing we've found: cloud agents are often not very necessary and are sometimes not great code. In many cases they bundle lots of additional functionality which ranges from potentially useful to irrelevant to actively harmful -- such as the ability to manage OS functionality that doesn't even exist on CL, or the ability for the platform to run arbitrary code on the machine. We can't eliminate agents entirely, since some platforms require their agent to report a successful boot before they'll allow the user to interact with the machine. In many cases, though, we should be able to implement a minimal cross-platform agent ourselves, e.g. as part of coreos-metadata.

In early internal discussions, there was a rough consensus around the following:

Short term: Do not build any functionality equivalent to the OEM partition, do not try to ship substantively different images for different platforms, and do not try to install agents via layers or containers. Ship any and all agents as part of the OS, launch them conditionally depending on the current platform, and live with the extra storage overhead.

Long term: Replace the platform agents, where possible, with our own minimal implementations. Ship those as part of the OS.

Thoughts?

cgwalters · 2018-07-12T21:25:03Z

Ah right. I'd forgotten about that discussion. Yes, baking them all in and doing conditional launching is also a pretty simple way to do things.

ajeddeloh · 2018-07-12T21:27:04Z

I am strongly in favor of avoiding shipping agents whenever possible. My dream is that instead of trying to expose all the little odds and ends of clouds (e.g. oslogin on gce) we try to make it as similar across clouds as possible. Running FCOS on gce should be the same as on aws and bare metal. Not only is this easier to manage from a development point of view, it makes FCOS more consistent across cluods. You have "the FCOS way" of adding users, not "The FCOS way, or the gce way, or the aws way, etc".

cgwalters · 2018-07-12T21:31:45Z

You have "the FCOS way" of adding users, not "The FCOS way, or the gce way, or the aws way, etc".

The clouds are going to dislike us for that, but I think I agree. In the end...for the clouds having "nicer/integrated" ways for users to manage guest OSes is sort of a previous battleground anyways, now it's all about services.

ajeddeloh · 2018-07-12T21:38:15Z

Yeah I agree they won't like that, but I think we that's a battle worth fighting with the clouds. We can be explicit that "If you special cloud bits, FCOS is not for you".

eparis · 2018-07-12T21:55:56Z

Or even better, FCOS is for you. Ship your agent as a container and it will work!

dustymabe · 2018-07-26T14:15:52Z

Short term: Do not build any functionality equivalent to the OEM partition, do not try to ship substantively different images for different platforms, and do not try to install agents via layers or containers. Ship any and all agents as part of the OS, launch them conditionally depending on the current platform, and live with the extra storage overhead.

Long term: Replace the platform agents, where possible, with our own minimal implementations. Ship those as part of the OS.

👍 👍

dustymabe · 2018-07-26T18:55:08Z

considering this to be decided then. will close

dustymabe · 2018-10-25T17:19:59Z

FYI this made it into the design doc in #40

bgilbert added the kind/design label Jul 12, 2018

dustymabe closed this as completed Jul 26, 2018

dustymabe added the status/decided label Aug 3, 2018

ajeddeloh mentioned this issue Aug 31, 2018

How to ship cloud specific bits #41

Closed

This was referenced Oct 31, 2018

no cloud agents: virtualbox #73

Closed

no cloud agents: qemu #74

Open

anuraagrijal3138 mentioned this issue Jun 30, 2020

PXE booting fails to install in disk #558

Closed

yuvalk mentioned this issue Jun 1, 2023

New Package Request: NetworkManager-libreswan #1504

Closed

dustymabe mentioned this issue Dec 16, 2023

Platform Request: Microsoft Hyper-V #1411

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cloud agents #12

cloud agents #12

cgwalters commented Jul 12, 2018 •

edited

Loading

bgilbert commented Jul 12, 2018

cgwalters commented Jul 12, 2018

ajeddeloh commented Jul 12, 2018

cgwalters commented Jul 12, 2018

ajeddeloh commented Jul 12, 2018

eparis commented Jul 12, 2018

dustymabe commented Jul 26, 2018

dustymabe commented Jul 26, 2018

dustymabe commented Oct 25, 2018

cloud agents #12

cloud agents #12

Comments

cgwalters commented Jul 12, 2018 • edited Loading

bgilbert commented Jul 12, 2018

cgwalters commented Jul 12, 2018

ajeddeloh commented Jul 12, 2018

cgwalters commented Jul 12, 2018

ajeddeloh commented Jul 12, 2018

eparis commented Jul 12, 2018

dustymabe commented Jul 26, 2018

dustymabe commented Jul 26, 2018

dustymabe commented Oct 25, 2018

cgwalters commented Jul 12, 2018 •

edited

Loading