Normalize plan before sending to increase the plan apply throughput #5407

arshjohar · 2019-03-11T16:04:03Z

This PR adds normalization of the plan to commit only the diff for stopped and preempted allocs to the raft log to enable better throughput. It also starts using omitempty on some of the structs during msgpack serialization to omit the empty fields.

Currently when operators need to log onto a machine where an alloc is running they will need to perform both an alloc/job status call and then a call to discover the node name from the node list. This updates both the job status and alloc status output to include the node name within the information to make operator use easier. Closes #2359 Cloess #1180

arshjohar · 2019-03-11T16:06:08Z

scheduler/testing.go

 		},
 		Deployment:        plan.Deployment,
 		DeploymentUpdates: plan.DeploymentUpdates,
 		EvalID:            plan.EvalID,
 		NodePreemptions:   preemptedAllocs,
 	}

+	if h.optimizePlan {


I didn't find any usages for the SubmitPlan method of Harness, but changed the code to be able to support the newer format of the struct.

nomad/structs/structs.go

schmichael

Not quite done. Looks good so far and made me think of another optimization that might have a tiny impact: #5452

Will finish up ASAP. Great work @arshjohar!

nomad/util.go

nomad/plan_apply.go

nomad/plan_apply_test.go

nomad/plan_normalization_test.go

nomad/state/state_store.go

nomad/state/state_store_test.go

nomad/structs/structs.go

schmichael

I didn't spot any logic errors. I think it's worth considering new types for the Alloc fragments once more. The Stopped/Preempted Allocs just use so few fields I can't imagine many methods are reused from Alloc. Using independent types just gives us extra protection against nil-pointer panics on the servers due to a developer thinking an Allocation is fully hydrated when its not.

Otherwise please just comment as many funcs/methods as possible in the form:

// FuncName something something something.
func FuncName() {}

nomad/structs/structs.go

DingoEatingFuzz · 2019-04-10T23:22:09Z

@arshjohar, the 0.9.1-dev branch has been merged into master. Please reopen this pointed to master.

api/allocations.go

nomad/plan_apply.go

preetapan · 2019-04-11T01:59:20Z

nomad/state/state_store.go

+			return fmt.Errorf("alloc lookup failed: %v", err)
+		}
+		if alloc == nil {
+			continue


This should return an error that bubbles up such that the plan apply fails and the worker is forced to do a index refresh.
Otherwise if there's a race between a forced garbage collection and the scheduler making an update to the alloc, the alloc could be gone from the state store before it gets here and silently return true though the update didn't actually make it.
cc @dadgar to double check the above^

I think it is safer to return an error here as it indicates that the scheduler made an update on stale information. I don't think it is likely to ever be hit though because it not being in the state store means that the allocation has been GC'd which is only possible if the user forced GC'd between the scheduler snapshot and plan apply.

nomad/state/state_store_test.go

github-actions · 2023-02-12T02:17:55Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

jrasell and others added 7 commits March 4, 2019 12:01

Don't display node name if output isn't verbose. Add tests.

2193d1a

Remove redundant assertion and replace regex matches with require

a2130bc

Add code for plan normalization

5ca6649

Add tests for plan normalization

50dc0ec

Compat tags

3ab267c

Remove allowPlanOptimization from schedulers

c242ade

arshjohar commented Mar 11, 2019

View reviewed changes

nomad/structs/structs.go Show resolved Hide resolved

arshjohar added the post-0.9 label Mar 11, 2019

arshjohar requested review from schmichael and preetapan March 11, 2019 23:26

schmichael mentioned this pull request Mar 20, 2019

Add denormalized Eval.DequeueID RPC #5452

Open

schmichael reviewed Mar 20, 2019

View reviewed changes

schmichael requested changes Mar 21, 2019

View reviewed changes

nomad/structs/structs.go Show resolved Hide resolved

preetapan force-pushed the 0.9.1-dev branch from cba97c2 to b705e2c Compare March 25, 2019 14:10

preetapan force-pushed the 0.9.1-dev branch from 9e67270 to 929055a Compare April 10, 2019 15:34

DingoEatingFuzz closed this Apr 10, 2019

preetapan reviewed Apr 11, 2019

View reviewed changes

api/allocations.go Show resolved Hide resolved

preetapan reviewed Apr 11, 2019

View reviewed changes

nomad/plan_apply.go Show resolved Hide resolved

preetapan reviewed Apr 11, 2019

View reviewed changes

nomad/state/state_store_test.go Show resolved Hide resolved

github-actions bot locked as resolved and limited conversation to collaborators Feb 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize plan before sending to increase the plan apply throughput #5407

Normalize plan before sending to increase the plan apply throughput #5407

arshjohar commented Mar 11, 2019

arshjohar Mar 11, 2019

schmichael left a comment

schmichael left a comment •

edited

Loading

DingoEatingFuzz commented Apr 10, 2019

preetapan Apr 11, 2019

dadgar Apr 12, 2019

github-actions bot commented Feb 12, 2023

Normalize plan before sending to increase the plan apply throughput #5407

Normalize plan before sending to increase the plan apply throughput #5407

Conversation

arshjohar commented Mar 11, 2019

arshjohar Mar 11, 2019

Choose a reason for hiding this comment

schmichael left a comment

Choose a reason for hiding this comment

schmichael left a comment • edited Loading

Choose a reason for hiding this comment

DingoEatingFuzz commented Apr 10, 2019

preetapan Apr 11, 2019

Choose a reason for hiding this comment

dadgar Apr 12, 2019

Choose a reason for hiding this comment

github-actions bot commented Feb 12, 2023

schmichael left a comment •

edited

Loading