Machine rebooted though we configured reboot-strategy: off #161

zeisss · 2014-10-05T10:47:29Z

Hi

this morning one of our nodes rebooted and updated from 402 to 457 with a reboot.

We have disabled reboots in the cloud-init though:

#cloud-config

coreos:
  update:
    reboot-strategy: off

  <more config>

/etc/coreos/update.conf says the same:

# cat /etc/coreos/update.conf
GROUP=alpha
REBOOT_STRATEGY=off

The text was updated successfully, but these errors were encountered:

sorccu · 2014-10-05T12:12:34Z

The reboot might have been caused by something else. Once an update has been applied, any reboot will activate it.

zeisss · 2014-10-05T13:27:39Z

Sounds reasonable, as reading the update-engines log doesn't clearly state that it rebooted the machine.
Sadly I have already destroyed the cluster so I cannot check any other logs.

Can we savely disable update-engine.service to be reboot secure?

crawford · 2014-10-05T15:31:16Z

Can you post the update-engine logs?

Masking update-engine.service will disable automatic updates. It obviously won't protect the machine from rebooting due to other mechanisms. Why do you want to disable updates?

zeisss · 2014-10-05T16:07:03Z

The logs I could get from journalctl can be found at https://gist.github.com/ZeissS/c4f7f624c8fd2366b174 - the reboot occured at 8:41.

It obviously won't protect the machine from rebooting due to other mechanisms.

Disabling rebooting is not the goal - changing the software is. Neither etcd nor fleet is currently that stable yet so we prefer pinning coreos to a specific release.

Why do you want to disable updates?

Well, coreos just destroyed our cluster with that reboot ^^ Sounds like reason enough for me to (not yet) trust it (we ran into etcd-io/etcd#815 (comment)).

sorccu · 2014-10-05T16:17:44Z

Well, I can say from personal experience that using alpha is a good way to occasionally destroy your cluster anyway. Instead of disabling updates, why not try sticking to stable (now that it's even newer than what you started with)?

zeisss · 2014-10-05T16:23:02Z

We definitely do this for the non-testing clusters ;) We still prefer for now to stick to a specific release.

Since disable update-engine is the way to go to prevent updates happening even on reboot and there does not seem to any reason to believe update-engine is the source of the reboot I am closing this now.

Thanks guys!

zeisss closed this as completed Oct 5, 2014

sj2208 mentioned this issue May 25, 2016

STF Production setup coreOS restart. openstf/stf#335

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Machine rebooted though we configured reboot-strategy: off #161

Machine rebooted though we configured reboot-strategy: off #161

zeisss commented Oct 5, 2014

sorccu commented Oct 5, 2014

zeisss commented Oct 5, 2014

crawford commented Oct 5, 2014

zeisss commented Oct 5, 2014

sorccu commented Oct 5, 2014

zeisss commented Oct 5, 2014

Machine rebooted though we configured reboot-strategy: off #161

Machine rebooted though we configured reboot-strategy: off #161

Comments

zeisss commented Oct 5, 2014

sorccu commented Oct 5, 2014

zeisss commented Oct 5, 2014

crawford commented Oct 5, 2014

zeisss commented Oct 5, 2014

sorccu commented Oct 5, 2014

zeisss commented Oct 5, 2014