Consul enable tag override master #5774

bogdanalbei · 2019-06-04T09:16:47Z

At the moment the Consul's EnableTagOverride flag not being set by Nomad, so the default(false) is always used. In certain cases it is useful for services outside Nomad to update the tags of the Consul services, however for that to happen EnableTagOverride has to be set to true.
This change enables setting EnableTagOverride at a service level.
An older request describing this can be found here #2057

Here we retain 0.8.7 behavior of waiting for driver fingerprints before registering a node, with some timeout. This is needed for system jobs, as system job scheduling for node occur at node registration, and the race might mean that a system job may not get placed on the node because of missing drivers. The timeout isn't strictly necessary, but raising it to 1 minute as it's closer to indefinitely blocked than 1 second. We need to keep the value high enough to capture as much drivers/devices, but low enough that doesn't risk blocking too long due to misbehaving plugin. Fixes hashicorp#5579

I noticed that `watchNodeUpdates()` almost immediately after `registerAndHeartbeat()` calls `retryRegisterNode()`, well after 5 seconds. This call is unnecessary and made debugging a bit harder. So here, we ensure that we only re-register node for new node events, not for initial registration.

Noticed that `detected drivers` log line was misleading - when a driver doesn't fingerprint before timeout, their health status is empty string `""` which we would mark as detected. Now, we log all drivers along with their state to ease driver fingerprint debugging.

Currently, when logmon fails to reattach, we will retry reattachment to the same pid until the task restart specification is exhausted. Because we cannot clear hook state during error conditions, it is not possible for us to signal to a future restart that it _shouldn't_ attempt to reattach to the plugin. Here we revert to explicitly detecting reattachment seperately from a launch of a new logmon, so we can recover from scenarios where a logmon plugin has failed. This is a net improvement over the current hard failure situation, as it means in the most common case (the pid has gone away), we can recover. Other reattachment failure modes where the plugin may still be running could potentially cause a duplicate process, or a subsequent failure to launch a new plugin. If there was a duplicate process, it could potentially cause duplicate logging. This is better than a production workload outage. If there was a subsequent failure to launch a new plugin, it would fail in the same (retry until restarts are exhausted) as the current failure mode.

Co-Authored-By: notnoop <mahmood@notnoop.com>

…cation instead of (unupdated) server copy

…brackets

This reverts commit c97e4c6.

…ng left brackets" This reverts commit 7ea3f38.

Fixes hashicorp#5593 Executor seems to die unexpectedly after nomad agent dies or is restarted. The crash seems to occur at the first log message after the nomad agent dies. To ease debugging we forward executor log messages to executor.log as well as to Stderr. `go-plugin` sets up plugins with Stderr pointing to a pipe being read by plugin client, the nomad agent in our case[1]. When the nomad agent dies, the pipe is closed, and any subsequent executor logs fail with ErrClosedPipe and SIGPIPE signal. SIGPIPE results into executor process dying. I considered adding a handler to ignore SIGPIPE, but hc-log library currently panics when logging write operation fails[2] This we opt to revert to v0.8 behavior of exclusively writing logs to executor.log, while we investigate alternative options. [1] https://github.com/hashicorp/nomad/blob/v0.9.0/vendor/github.com/hashicorp/go-plugin/client.go#L528-L535 [2] https://github.com/hashicorp/nomad/blob/v0.9.0/vendor/github.com/hashicorp/go-hclog/int.go#L320-L323

driver/docker: collect tty container logs

…-errs-2 logmon: recover from shutting down call locally

…omad into consul_enable_tag_override_master # Conflicts: # CHANGELOG.md # client/client.go # command/agent/bindata_assetfs.go # drivers/docker/docklog/docker_logger.go # drivers/docker/driver.go # version/version.go

hashicorp-cla · 2019-06-04T09:16:57Z

Thank you for your submission! We require that all contributors sign our Contributor License Agreement ("CLA") before we can accept the contribution. Read and sign the agreement

Learn more about why HashiCorp requires a CLA and what the CLA includes

5 out of 6 committers have signed the CLA.

Nomad Release bot seems not to be a GitHub user.
You need a GitHub account to be able to sign the CLA. If you already have a GitHub account, please add the email address used for this commit to your account.

_{Have you signed the CLA already but the status is still pending? Recheck it.}

bogdanalbei · 2019-06-04T11:19:08Z

Created a different PR, with a cleaner commit history #5775

github-actions · 2023-02-09T02:17:41Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

preetapan and others added 30 commits April 22, 2019 15:29

remove generated code

ba2403c

open fifo on background goroutine

04916c0

locking and opening streams in goroutine comment

58362da

clarify cryptic log line

0e9c3cc

tweak logging level for failed log line

efb41a9

Co-Authored-By: notnoop <mahmood@notnoop.com>

client/metrics: modified metrics to use (updated) client copy of allo…

60f4e11

…cation instead of (unupdated) server copy

0.9.1 changelog

d7bd239

changelog: added entry for hashicorp#5540 fix, and some missing left …

7ea3f38

…brackets

changelog: fixed bad markdown

c97e4c6

Revert "changelog: fixed bad markdown"

9a8617f

This reverts commit c97e4c6.

Revert "changelog: added entry for hashicorp#5540 fix, and some missi…

652b754

…ng left brackets" This reverts commit 7ea3f38.

changelog: fixed markdown formatting

3dfced8

update changelog with hashicorpGH-5598

3f491a6

Generate files for 0.9.1-rc1 release

3e17899

Release v0.9.1-rc1

0010625

Remove Nomad 0.9.1-rc1 generated files

290ed47

Merge pull request hashicorp#5609 from hashicorp/b-docker-tty-logs

1cd7a78

driver/docker: collect tty container logs

Merge pull request hashicorp#5616 from hashicorp/b-retry-logmon-start…

7caa796

…-errs-2 logmon: recover from shutting down call locally

update changelog for hashicorpGH-5609 and hashicorpGH-5616

4f52b1c

Generate files for 0.9.1 release

4b2bdbd

Release v0.9.1

23c4597

added support for consul EnableTagOverride hashicorp#2057

04ac30e

fixed test hashicorp#2057

df31ef7

bogdanalbei closed this Jun 4, 2019

github-actions bot locked as resolved and limited conversation to collaborators Feb 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consul enable tag override master #5774

Consul enable tag override master #5774

bogdanalbei commented Jun 4, 2019

hashicorp-cla commented Jun 4, 2019 •

edited

Loading

bogdanalbei commented Jun 4, 2019

github-actions bot commented Feb 9, 2023

Consul enable tag override master #5774

Consul enable tag override master #5774

Conversation

bogdanalbei commented Jun 4, 2019

hashicorp-cla commented Jun 4, 2019 • edited Loading

bogdanalbei commented Jun 4, 2019

github-actions bot commented Feb 9, 2023

hashicorp-cla commented Jun 4, 2019 •

edited

Loading