Evaluation broker #282

lgfa29 · 2020-10-09T15:14:14Z

This PR introduces the evaluation broker, which is responsible for storing, deduping and controlling the distribution and flow of policy evaluation requests to workers.

The eval requests are stored in a set of heaps, sorted by priority and create time (FIFO).

Evals are picked from the broker by workers and they must ACK the evaluation if it completes successful or NACK it otherwise. If an ACK doesn't arrive within the deadline (5 min for now, but should probably be configurable per policy) the eval is considered NACK'd.

A new agent configuration block was also added to control the number of workers for each policy type:

policy_workers {
  cluster = 20
  horizontal = 5
}

cgbaker

no significant problems. one potential (but unlikely) race condition and some comments about logging.

👍👍👍

cgbaker · 2020-10-12T15:29:32Z

policy/file/parse.go

@@ -15,6 +15,12 @@ func decodeFile(file string, p *sdk.ScalingPolicy) error {
 		return err
 	}

+	// Assume file policies are cluster policies unless specificied.
+	// TODO: revisit this assumption.


this is as good as anything... until we get our hands dirty with this, i'm not sure which policy type we'd be more likely to put in a file source (and, therefore, which one should be favored with "default" type).

cgbaker · 2020-10-12T15:30:46Z

policy/nomad/source.go

@@ -199,6 +201,12 @@ func (s *Source) canonicalizePolicy(p *sdk.ScalingPolicy) {
 		return
 	}

+	// Assume a policy coming from Nomad without a type is a horizontal policy.
+	// TODO: review this assumption.


this one makes sense... moving forward with the next nomad release, all policies should have a type. so, a nomad policy without a type is from a previous version of nomad, and the only policy types in previous versions of nomad were "horizontal"

policyeval/base_worker.go

cgbaker · 2020-10-12T15:55:34Z

policyeval/base_worker.go

+		default:
+		}
+
+		eval, token, err := w.broker.Dequeue(ctx, w.queue)


this is a blocking call, correct?

Yup, it will block until there's work in the queue or return nil if ctx is closed.

policyeval/base_worker.go

policyeval/broker.go

cgbaker · 2020-10-12T18:05:51Z

agent/agent.go

@@ -70,6 +73,10 @@ func (a *Agent) Run() error {
 	policyEvalCh := a.setupPolicyManager()
 	go a.policyManager.Run(ctx, policyEvalCh)

+	// Launch eval broker and workers.
+	a.evalBroker = policyeval.NewBroker(a.logger.ResetNamed("policy_eval"), 5*time.Minute, 3)
+	a.initWorkers(ctx)


may want to make this dynamic later, for autoscaling the number of brokers or simply SIGHUP'ing config

Yeah, I was trying to decide between a global config or a per-policy value.

I think per-policy would be better since it can be configured as needed, but global is appealing because you can just set it to your worst-case value.

So probably both? 😄

policyeval/broker.go

jrasell

minor comments alongside those of @cgbaker but its look very nice! 🥳

agent/config/config.go

policyeval/broker.go

sdk/eval.go

policyeval/broker.go

changelog: add entry for #282.

lgfa29 requested review from cgbaker, gogococo, jazzyfresh and jrasell as code owners October 9, 2020 15:14

cgbaker approved these changes Oct 12, 2020

View reviewed changes

jrasell approved these changes Oct 13, 2020

View reviewed changes

agent/config/config.go Show resolved Hide resolved

policyeval/broker.go Show resolved Hide resolved

sdk/eval.go Outdated Show resolved Hide resolved

policyeval/broker.go Outdated Show resolved Hide resolved

lgfa29 added 22 commits October 13, 2020 12:32

initial work on the policy eval broker

c84dfa9

broker actually works! (maybe?)

fcef437

use new eval to test blocking Dequeue request

09c58e5

rename policyeval.Worker to policyeval.BaseWorker

3403580

add godoc comments to policyeval.Broker and improve tests

9318cef

set policy type depending on source

67ee0e2

fix broken test

ab42d19

fix :sadpanda:

70039d0

initializer policy eval broker and workers from agent

b1e7656

fix go.sum

9d5de81

fix tests

7d51d42

godoc policy priority

4e2426e

downgrade log level when policy doesn't generate an action

7f2656c

remove unused Peek code from the eval broker

d4ca048

use named returned in waitForWork

c3419fb

clean up enqueuedEvals and enqueuedPolicies

4fdbcc7

improve docs for eval broker

9ed82f2

run make check on CI

8dbf727

update CI config

87430bb

move helper to sdk

93096a4

small fixes

965e2f6

canonicalize action

1fd1d59

lgfa29 force-pushed the f-eval-broker branch from d1b5b9a to 1fd1d59 Compare October 13, 2020 18:27

lgfa29 merged commit 6e4beb0 into master Oct 13, 2020

lgfa29 deleted the f-eval-broker branch October 13, 2020 18:31

jrasell added a commit that referenced this pull request Oct 27, 2020

changelog: add entry for #282.

c042c94

jrasell added a commit that referenced this pull request Oct 27, 2020

Merge pull request #297 from hashicorp/f-gh-282-changelog

8d1d5df

changelog: add entry for #282.

lgfa29 mentioned this pull request Nov 7, 2020

Block policy handler until the latest evaluation is complete #174

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation broker #282

Evaluation broker #282

lgfa29 commented Oct 9, 2020 •

edited

Loading

cgbaker left a comment

cgbaker Oct 12, 2020

cgbaker Oct 12, 2020

cgbaker Oct 12, 2020

lgfa29 Oct 12, 2020

cgbaker Oct 12, 2020

lgfa29 Oct 12, 2020

jrasell left a comment

Evaluation broker #282

Evaluation broker #282

Conversation

lgfa29 commented Oct 9, 2020 • edited Loading

cgbaker left a comment

Choose a reason for hiding this comment

cgbaker Oct 12, 2020

Choose a reason for hiding this comment

cgbaker Oct 12, 2020

Choose a reason for hiding this comment

cgbaker Oct 12, 2020

Choose a reason for hiding this comment

lgfa29 Oct 12, 2020

Choose a reason for hiding this comment

cgbaker Oct 12, 2020

Choose a reason for hiding this comment

lgfa29 Oct 12, 2020

Choose a reason for hiding this comment

jrasell left a comment

Choose a reason for hiding this comment

lgfa29 commented Oct 9, 2020 •

edited

Loading