Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scheduler: Revert requireCanary logic #8867

Merged
merged 2 commits into from
Sep 15, 2020
Merged

scheduler: Revert requireCanary logic #8867

merged 2 commits into from
Sep 15, 2020

Conversation

notnoop
Copy link
Contributor

@notnoop notnoop commented Sep 10, 2020

Revert the requireCanary check introduced in https://github.com/hashicorp/nomad/pull/8691/files#diff-1801138ac4d10f2064ba6f2e434ac9b4L430-R431 .

The change was intended to fix a case where a canary alloc may fail to
be rescheduled if all the other allocs fail as well (e.g. if all allocs
happen to be placed on a node that died).

However, it introduced some unintended side-effects: if all allocs of a successful deployment fail, the scheduler will place canary allocs unexpectedly if the service has a single alloc that is to be migrated. Services with multiple allocations can be affected if all allocations are to be migrated or lost at once (e.g. all happen to run on a single client that is drained).

I added a test case for the failure, and it's failing in https://app.circleci.com/pipelines/github/hashicorp/nomad/11698/workflows/e90b6ba6-aad3-484d-8042-b1dce5239e50/jobs/99179 but is green ultimately.

Reverting the change for now and will investigate further.

Fixes #8866 .

Mahmood Ali added 2 commits September 10, 2020 14:59
…hicorp/nomad/pull/8691/files#diff-1801138ac4d10f2064ba6f2e434ac9b4L430-R431 .

The change was intended to fix a case where a canary alloc may fail to
be rescheduled if all the other allocs fail as well (e.g. if all allocs
happen to be placed on a node that died).  However, it introduced some
unintended side-effects.

Reverting the change for now and will investigate further.
@notnoop notnoop self-assigned this Sep 10, 2020
@notnoop notnoop merged commit 49a4618 into master Sep 15, 2020
@notnoop notnoop deleted the b-canary-substitution branch September 15, 2020 17:58
@notnoop notnoop added this to the 0.12.5 milestone Sep 15, 2020
notnoop pushed a commit that referenced this pull request Sep 16, 2020
scheduler: Revert requireCanary logic
teutat3s pushed a commit to teutat3s/nomad that referenced this pull request Oct 27, 2020
teutat3s pushed a commit to teutat3s/nomad that referenced this pull request Dec 9, 2020
teutat3s pushed a commit to teutat3s/nomad that referenced this pull request Jan 16, 2021
teutat3s pushed a commit to teutat3s/nomad that referenced this pull request Jan 17, 2021
@github-actions
Copy link

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Dec 18, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Nomad 0.12.4 stop allocation issue
2 participants