Whisper importer aggregate conversion #712

replay · 2017-08-23T12:48:44Z

This adds functionality to the whisper-importer-reader which allows it to convert input data to the required schema on the fly while importing.
It expects a schema file as a parameter, in the format like MT uses it. Then it reads the whisper files and generates the required retentions by using the highest resolution available as input. In cases where the generated retention has higher resolution than what's available in the whisper files it "fakes" the higher resolutions by "fake-undoing" the aggregation mechanism.

To demonstrate this fake-increasing of the resolutions I've used some data that we have stored in MT (ops cluster) and also stored as whisper files with differing schemas (ziggurat). Then I imported those whisper files into another MT which uses another schema. Then I'm able to compare the data by using two different datasources:

Fixes #710

Dieterbe

It expects a schema file as a parameter, in the format like MT uses it.

practically, this will be the file that will be used by MT (HM) right? or is it the file that describes the whisper files?

does this follow the description of #710 exactly? but with also merging of different archives i guess?

If there is only an archive with higher resolution and at least the same TTL, it can use the defined consolidateBy function to decrease the resolution.

what happens if there's a whisper archive in 10s resolution but we want 15s ?

If there is only an archive with lower resolution that satisfies the TTL, it will have to repeat each input point factor-times.

this works for gauges, rates, counters etc but is problematic for statsd counts. eg a count can have a minutely value of 120 which means "we counted 120 occurrences over this minutely interval". if the points have to be secondly, their values should average to 2, not 120 every second.

Dieterbe · 2017-08-25T07:35:08Z

mdata/aggregation.go

+		"cnt": a.cnt,
+		"lst": a.lst,
+		"avg": a.sum / a.cnt,
+	}


constructing a new map just to return some values seems rather gross and inefficient.
the simplest alternative, I think, is to just make those members public. and maybe add a comment to the type that the members shouldn't be changed by other packages (rather obvious but still)

Fixed in the latest commit

Dieterbe · 2017-08-25T08:37:15Z

cmd/mt-whisper-importer-reader/main.go

-		if _, ok := readArchives[archiveIdx]; !ok && len(readArchives) > 0 {
-			continue
-		}
+	if !nameFilter.Match([]byte(name)) {


why is this filter used so late? seems like we can do the filtering before we even open/read the file, and before we even put the filename on the channel.

Changed that in the latest commit

why is it still after opening file and after putting it on channel?

moved it to before pushing the name into the chan

replay · 2017-08-25T08:52:35Z

@Dieterbe The format is exactly the same like the MT schema definition. I did that intentionally to make it as easy to use as possible, just give it the MT config file.

It will correctly convert from 10s to 15s too as tested here: https://github.com/raintank/metrictank/pull/712/files#diff-7218fd8d8872a6d3f498aa9f1f42be63R303

Depending on the aggregation mechanism it decides how to increase the resolution of data. If the aggregation mechanism was sum then it will divide by the according factor, so 120 would result in 2: https://github.com/raintank/metrictank/pull/712/files#diff-c3e64180d5dd147d8ea6691375b144c9R131

replay · 2017-08-28T10:47:11Z

@Dieterbe the logic follows the description on #710. it is important to note though that the input archive is selected separately for each generated datapoint, so it is possible that one generated rollup has had multiple inputs, like for example in a case like this:

we want:
SecondsPerPoint: 60
TTL: 30d

we have
0: 
SecondsPerPoint: 60
TTL: 7d
1:
SecondsPerPoint: 3600
TTL: 30d

^^ In such a case the importer would "mix" the two inputs into one

Dieterbe · 2017-08-28T12:50:23Z

cmd/mt-whisper-importer-reader/conversion.go

+			currentBoundary = boundary
+			agg.Add(inPoint.Value)
+		}
+	}


should we make sure to flush any unflushed points? can you add a comment as to why (not)

I'll mention that in a comment too:
Generally we only want to write aggregated points that have been "finished", as in we have a received a point with a ts which is >= the highest ts that would be factored into this aggregated point, in any other case the result would likely be wrong because we can't predict the future.

Dieterbe · 2017-08-28T13:05:43Z

cmd/mt-whisper-importer-reader/conversion.go

+	return out
+}
+
+func decResolution(points []whisper.Point, methods []string, inRes, outRes uint32) map[string][]whisper.Point {


can you document these functions. it's easy to confuse "decrease resolution" to "decrease interval", whereas it's the opposite: we're increasing the interval here.

sidenote:
https://www.google.be/search?q=resolution&oq=resolution&aqs=chrome..69i57j0l5.1000j0j7&sourceid=chrome&ie=UTF-8 has 1 applicable definition for resolution:

the smallest interval measurable by a telescope or other scientific instrument; the resolving power.
the degree of detail visible in a photographic or television image.

this definition seems also inconsistent to me. as they use resolution for both "interval" and "degree of detail", but the lower the interval, the higher the degree of detail..

and we also speak of "high resolution screens" to talk about smaller intervals/pixels, so either way of phrasing seems correct, but just worth commenting what exactly the functions do.

actually i think "decrease interval" is kind of ambiguous because strictly speaking it increases the resolution, as you say, but i think many people might misunderstand it as decreasing the resolution.

since resolution is not well defined and ambiguous, i don't understand what you're saying.
but i have no trouble believing that people can misunderstand interval - even though it's more clearly defined than resolution, so that's why i'm saying to just document the functions better.

Dieterbe · 2017-08-28T13:13:22Z

cmd/mt-whisper-importer-reader/conversion.go

+			continue
+		}
+
+		rangeEnd := inPoint.Timestamp - (inPoint.Timestamp % outRes)


correct me if i'm wrong:

points in whisper are quantized (e.g. for a res of 30s, timestamps in whisper are always divisible by 30 without remainder)

thus, all points divide by inRes without remainder

outRes < inRes

all points divide by outRes without remainder except when outRes doesn't fit evenly in inRes (e.g. outRes is 10 and inRes 15)

rangeEnd always equals inPoint.Timestamp except when outRes doesn't fit evenly in inRes (e.g. outRes is 10 and inRes 15)

if this is correct, worth pointing out i think

I'm not sure if i'm misunderstand what you mean, but don't your point 4) and point 5) conflict with each other?
If inRes is 15 and outRes is 10 then some points will not divide by outRes without remainder (which is fine)

i updated point 4. had a mistake in there.
i don't have any particular point other than I think 5 is an interesting observation that is probably worth documenting (if i'm correct)

added comments

your comment says "outRes is > inRes", that's wrong, no?

lol right, total confusion

Dieterbe · 2017-08-28T13:22:22Z