Backport of eval delete: move batching of deletes into RPC handler and state into release/1.4.x #15247

hc-github-team-nomad-core · 2022-11-14T19:08:40Z

Backport

This PR is auto-generated from #15117 to be assessed for backporting due to the inclusion of the label backport/1.4.x.

The below text is copied from the body of the original PR.

During unusual outage recovery scenarios on large clusters, a backlog of millions of evaluatons can appear. In these cases, the eval delete command can put excessive load on the cluster by listing large sets of evals to extract the IDs and then sending larges batches of IDs. Although the command's batch size was carefully tuned, we still need to be JSON deserialize, reserialize to messagepack, send the log entries through raft, and get the FSM applied.

To improve performance of this recovery case, move the batching process into the RPC handler and the state store. The design here is a little weird, so let's look at the failed options first:

A naive solution here would be to just send the filter as the raft request and let the FSM apply delete the whole set in a single operation. Benchmarking with 1M evals on a 3 node cluster demonstrated this can block the FSM apply for several minutes, which puts the cluster at risk if there's a leadership failover (the barrier write can't be made while this apply is in-flight).
A less naive but still bad solution would be to have the RPC handler filter and paginate, and then hand a list of IDs to the existing raft log entry. Benchmarks showed this blocked the FSM apply for 20-30s at a time and took roughly an hour to complete.

Instead, we're filtering and paginating in the RPC handler to find a page token, and then passing both the filter and page token in the raft log. The FSM apply recreates the paginator using the filter and page token to get roughly the same page of evaluations, which it then deletes. The pagination process is fairly cheap (only about 5% of the total FSM apply time), so counter-intuitively this rework of the pagination ends up being much faster. A benchmark of 1M evaluations showed this blocked the FSM apply for 20-30ms at a time (typical for normal operations) and completes in less than 4 minutes.

Note that, as with the existing design, this delete is not consistent: a new evaluation inserted "behind" the cursor of the pagination will fail to be deleted.

github-actions · 2023-03-16T02:14:46Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

tgross added 7 commits November 7, 2022 14:01

backport of commit 285d39f

1b69398

backport of commit 58e426b

3f8c6eb

backport of commit d71c26e

fc1ef93

backport of commit a93dbf4

255f1dc

backport of commit 12cb7ca

d6c2d35

backport of commit aa7239a

47f6c8a

backport of commit b71440a

f268825

hc-github-team-nomad-core force-pushed the backport/eval-safe-delete-filter/constantly-ace-mantis branch from 1cb8aae to f268825 Compare November 14, 2022 19:08

hc-github-team-nomad-core merged commit a13a2a5 into release/1.4.x Nov 14, 2022

hc-github-team-nomad-core deleted the backport/eval-safe-delete-filter/constantly-ace-mantis branch November 14, 2022 19:08

vercel bot deployed to Preview – nomad November 14, 2022 19:13 View deployment

vercel bot deployed to Preview – nomad-storybook-and-ui November 14, 2022 19:20 View deployment

github-actions bot locked as resolved and limited conversation to collaborators Mar 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport of eval delete: move batching of deletes into RPC handler and state into release/1.4.x #15247

Backport of eval delete: move batching of deletes into RPC handler and state into release/1.4.x #15247

hc-github-team-nomad-core commented Nov 14, 2022

github-actions bot commented Mar 16, 2023

Backport of eval delete: move batching of deletes into RPC handler and state into release/1.4.x #15247

Backport of eval delete: move batching of deletes into RPC handler and state into release/1.4.x #15247

Conversation

hc-github-team-nomad-core commented Nov 14, 2022

Backport

github-actions bot commented Mar 16, 2023