Serialize Agent tasks #755

bcwaldon · 2014-08-07T18:34:52Z

bcwaldon · 2014-08-07T18:35:33Z

@jonboulle What do you think of this approach? There may still be some rough edges here.

jonboulle · 2014-08-07T18:41:51Z

agent/task.go

+		return nil, errors.New("task already in flight")
+	}
+
+	if t.Job == nil {


this needs to be on line 60

Yep, moved things around last-minute.

Will add more testing of this.

jonboulle · 2014-08-07T18:53:01Z

Seems OK. At first the lack of error handling around a lot of the edges scared me, but then on reading the existing code I realised stuff isn't handled anyway, soo

jonboulle · 2014-08-07T18:53:42Z

I mean, to be clear, the model is "operations can fail, failures are unhandled, the next reconciliation will clean up"

bcwaldon · 2014-08-07T21:19:16Z

You are correct, the repetitive reconciliation will pound units into submission over time, so we don't need to handle task failures as intelligently right now.

bcwaldon · 2014-08-07T21:27:40Z

@jonboulle hit everything

jonboulle · 2014-08-07T23:05:16Z

agent/agent.go

+	a.um.Stop(jobName)
+
+	a.uGen.Unsubscribe(jobName)
+	a.registry.RemoveUnitState(jobName)


Actually, this can be removed entirely now, right? UnitStateGenerator should clean it up

Ooooh, yes it can.

jonboulle · 2014-08-07T23:11:26Z

Looks good.

bcwaldon · 2014-08-08T16:17:00Z

@jonboulle I'd like to consider one addition to this. In master today, a LoadJob and StartJob will be executed back-to-back. With the task manager, only one task can be in-flight, so the StartJob will be rejected. We have to wait for a subsequent reconciliation to get that StartJob to run, which is essentially introducing a 10s lag in starting jobs. What if we tasks that were passed to taskManager.Do were not simply dropped while another task were in flight, but were queued. The queue would be of size 1, and any call to Do would replace the currently-queued task. This would give us the ability to have a LoadJob in-flight, with the StartJob on deck ready to go whenever the LoadJob finishes (successfully). Eh?

jonboulle · 2014-08-08T20:39:18Z

The queue would be of size 1, and any call to Do would replace the currently-queued task.

I'm not sure this is nuanced enough - we can't silently drop tasks. What if we have, say, LoadJob, StartJob, UnloadJob, LoadJob, StartJob in quick succession, with the original LoadJob being in flight for an extended period?

bcwaldon · 2014-08-08T20:42:59Z

@jonboulle In your example, the startJob would be queued at the end. The intermediate tasks weren't going to be fulfilled anyways given the nature of the reconciler.

It is clear that this is an optimization that could hurt us, especially if we don't take into account the hash of the referenced unit (it could change over time). How do you feel about the proposal if we don't replace the queued task? Still a queue size of 1 to speed up the fleetctl start foo.service use-case, but we don't try to be any more helpful for now.

jonboulle · 2014-08-08T21:14:04Z

Yes, I think that's a reasonable compromise for now.

bcwaldon · 2014-08-12T17:29:33Z

rebased on master and squashed the squishables

jonboulle · 2014-08-12T22:12:58Z

agent/reconcile.go

+	}
+
+	go func() {
+		for res := range reschan {


What's the motivation for even using reschan? Seems like it'd be simpler to just log errors in the taskmanager?

I want to log the result in the context of the AgentReconciler. Maybe it's silly, but I don't want the task manager logging anything.

jonboulle · 2014-08-20T01:24:15Z

justgoforit

Serialize Agent tasks

jonboulle reviewed Aug 7, 2014
View reviewed changes

jonboulle reviewed Aug 12, 2014
View reviewed changes

bcwaldon force-pushed the serialize-tasks branch 3 times, most recently from 2caf619 to 28b9dbc Compare August 20, 2014 01:21

bcwaldon force-pushed the serialize-tasks branch from 28b9dbc to 28a6a2e Compare August 20, 2014 01:38

bcwaldon added 4 commits August 19, 2014 18:40

agent: simplify desired state construction

bb93c50

agent: move task-related code to task.go

edec798

agent: serialize tasks through taskManager

2915ab2

agent: model dependent tasks using taskChains

ee666f1

bcwaldon force-pushed the serialize-tasks branch from 28a6a2e to ee666f1 Compare August 20, 2014 01:48

bcwaldon added a commit that referenced this pull request Aug 20, 2014

Merge pull request #755 from bcwaldon/serialize-tasks

61e9334

Serialize Agent tasks

bcwaldon merged commit 61e9334 into coreos:master Aug 20, 2014

bcwaldon deleted the serialize-tasks branch August 20, 2014 01:48

bcwaldon mentioned this pull request Aug 20, 2014

Serialize systemd jobs properly #646

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serialize Agent tasks #755

Serialize Agent tasks #755

bcwaldon commented Aug 7, 2014

bcwaldon commented Aug 7, 2014

jonboulle Aug 7, 2014

bcwaldon Aug 7, 2014

bcwaldon Aug 7, 2014

jonboulle commented Aug 7, 2014

jonboulle commented Aug 7, 2014

bcwaldon commented Aug 7, 2014

bcwaldon commented Aug 7, 2014

jonboulle Aug 7, 2014

bcwaldon Aug 8, 2014

jonboulle commented Aug 7, 2014

bcwaldon commented Aug 8, 2014

jonboulle commented Aug 8, 2014

bcwaldon commented Aug 8, 2014

jonboulle commented Aug 8, 2014

bcwaldon commented Aug 12, 2014

jonboulle Aug 12, 2014

bcwaldon Aug 15, 2014

jonboulle commented Aug 20, 2014

Serialize Agent tasks #755

Serialize Agent tasks #755

Conversation

bcwaldon commented Aug 7, 2014

bcwaldon commented Aug 7, 2014

jonboulle Aug 7, 2014

Choose a reason for hiding this comment

bcwaldon Aug 7, 2014

Choose a reason for hiding this comment

bcwaldon Aug 7, 2014

Choose a reason for hiding this comment

jonboulle commented Aug 7, 2014

jonboulle commented Aug 7, 2014

bcwaldon commented Aug 7, 2014

bcwaldon commented Aug 7, 2014

jonboulle Aug 7, 2014

Choose a reason for hiding this comment

bcwaldon Aug 8, 2014

Choose a reason for hiding this comment

jonboulle commented Aug 7, 2014

bcwaldon commented Aug 8, 2014

jonboulle commented Aug 8, 2014

bcwaldon commented Aug 8, 2014

jonboulle commented Aug 8, 2014

bcwaldon commented Aug 12, 2014

jonboulle Aug 12, 2014

Choose a reason for hiding this comment

bcwaldon Aug 15, 2014

Choose a reason for hiding this comment

jonboulle commented Aug 20, 2014