[STRMCMP-558] Event improvements #44

mwylde · 2019-07-11T00:58:45Z

This PR makes two improvements to the events that the operator emits for FlinkApplication resources.

First, it ensures that all deploy-related events are unique to a particular deploy (generally by including the hash of the resource in the message). This ensures that if two deploys occur in rapid succession, the second one will still produce events (if more than 10 events with the same message are emitted within a 10 minute period, later events are dropped). This prevents a situation where a developer does a deploy, encounters an error that produces several events, tries the deploy again but this time doesn't seen any events because we have already exhausted our limit.

The second change is to fix the reason field of the events we emit. When I originally added events I misunderstood what the reason field was. Re-reading the docs, I now understand it to be basically a machine-readable code for the event, with the message as its human-readable counterpart:

'reason' is the reason this event is generated. 'reason' should be short and unique; it should be in UpperCamelCase format (starting with a capital letter). "reason" will be used to automate handling of events, so imagine people writing switch statements to handle them. You want to make that easy.

Using unique reasons for each event also prevents distinct events from being improperly combined.

pkg/controller/flink/flink.go

anandswaminathan · 2019-07-11T18:58:19Z

pkg/controller/flink/flink.go

-			fmt.Sprintf("Failed to create job managers: %v", err))
+		f.LogEvent(ctx, application, corev1.EventTypeWarning, "CreateClusterFailed",
+			fmt.Sprintf("Failed to create job managers for deploy %s: %v",
+				HashForApplication(application), err))


Nit: hash := HashForApplication(app), and use hash.

anandswaminathan · 2019-07-11T18:58:47Z

pkg/controller/flink/flink.go

 		return err
 	}

 	if newlyCreatedJm || newlyCreatedTm {
-		f.LogEvent(ctx, application, corev1.EventTypeNormal, "Flink cluster created")
+		f.LogEvent(ctx, application, corev1.EventTypeNormal, "CreatingCluster",


Nit, Move reason to constant variable ?

anandswaminathan · 2019-07-11T18:58:55Z

pkg/controller/flink/flink.go

@@ -388,7 +391,8 @@ func (f *Controller) DeleteOldResourcesForApp(ctx context.Context, app *v1alpha1
 	}

 	for k := range deletedHashes {
-		f.LogEvent(ctx, app, corev1.EventTypeNormal, fmt.Sprintf("Deleted old cluster with hash %s", k))
+		f.LogEvent(ctx, app, corev1.EventTypeNormal, "ToreDownCluster",


Same. Nit, Move reason to constant variable ?

I don't really see any reason to. It's only used here and is descriptive of this event.

Nothing major. Just a nit, so that they can be reused later.

anandswaminathan · 2019-07-11T20:56:16Z

+1

Micah Wylde added 2 commits July 10, 2019 15:59

Make events unique across deploys

8793965

Use unique reason for different events

2ca155b

mwylde requested review from anandswaminathan, glaksh100 and kumare3 as code owners July 11, 2019 00:58

glaksh100 reviewed Jul 11, 2019

View reviewed changes

pkg/controller/flink/flink.go Show resolved Hide resolved

anandswaminathan reviewed Jul 11, 2019

View reviewed changes

mwylde merged commit 1a62f03 into master Jul 11, 2019

mwylde deleted the micah_unique_events branch July 11, 2019 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[STRMCMP-558] Event improvements #44

[STRMCMP-558] Event improvements #44

mwylde commented Jul 11, 2019

anandswaminathan Jul 11, 2019

anandswaminathan Jul 11, 2019

anandswaminathan Jul 11, 2019

mwylde Jul 11, 2019

anandswaminathan Jul 11, 2019

anandswaminathan commented Jul 11, 2019

[STRMCMP-558] Event improvements #44

[STRMCMP-558] Event improvements #44

Conversation

mwylde commented Jul 11, 2019

anandswaminathan Jul 11, 2019

Choose a reason for hiding this comment

anandswaminathan Jul 11, 2019

Choose a reason for hiding this comment

anandswaminathan Jul 11, 2019

Choose a reason for hiding this comment

mwylde Jul 11, 2019

Choose a reason for hiding this comment

anandswaminathan Jul 11, 2019

Choose a reason for hiding this comment

anandswaminathan commented Jul 11, 2019