Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport of connect: rewrite envoy bootstrap on every restart into release/1.5.x #19810

Merged

Conversation

hc-github-team-nomad-core
Copy link
Contributor

Backport

This PR is auto-generated from #19787 to be assessed for backporting due to the inclusion of the label backport/1.5.x.

The below text is copied from the body of the original PR.


Fixes #19781

Do not mark the envoy bootstrap hook as done after successfully running once. Since the bootstrap file is written to /secrets, which is a tmpfs on supported platforms, it is not persisted across reboots. This causes the task and allocation to fail on reboot (see #19781).

This fixes it by always rewriting the envoy bootstrap file every time the Nomad agent starts. This does mean we may write a new bootstrap file to an already running Envoy task, but in my testing that doesn't have any impact.

Alternative 1: Use a regular file

An alternative approach would be to write the bootstrap file somewhere other than the tmpfs, but this is unsafe as when Consul ACLs are enabled the file will contain a secret token:
https://developer.hashicorp.com/consul/commands/connect/envoy#bootstrap

Alternative 2: Detect if file is already written

An alternative approach would be to detect if the bootstrap file exists, and only write it if it doesn't.

This is just a more complicated form of the current fix. I think in general in the absence of other factors task hooks should be idempotent and therefore able to rerun on any agent startup. This simplifies the code and our ability to reason about task restarts vs agent restarts vs node reboots by making them all take the same code path.


Overview of commits

@hc-github-team-nomad-core hc-github-team-nomad-core force-pushed the backport/b-neverdone/lately-guiding-walrus branch from 8bf347b to 3dbcf99 Compare January 24, 2024 19:26
@vercel vercel bot temporarily deployed to Preview – nomad January 24, 2024 19:28 Inactive
@vercel vercel bot temporarily deployed to Preview – nomad-storybook-and-ui January 24, 2024 19:36 Inactive
@schmichael schmichael merged commit 627e7b4 into release/1.5.x Jan 24, 2024
22 of 26 checks passed
@schmichael schmichael deleted the backport/b-neverdone/lately-guiding-walrus branch January 24, 2024 19:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants