Use counter time in win perf counters #4267

vlastahajek · 2018-06-11T21:15:26Z

Trying to fix #4250: added possibility to use timestamp from performance counters

Signed CLA.
Associated README.md updated.
Has appropriate unit tests.

…om perf counters

matthenning · 2018-06-13T08:13:03Z

@danielnelson I'm not sure this can be classified as an enhancement.
If it indeed fixes #4250 this is a bugfix. I hope this makes it in 1.7.1.

danielnelson

Can you also rebase or merge master to bring in a build fix?

danielnelson · 2018-06-18T20:34:49Z

plugins/inputs/win_perf_counters/win_perf_counters.go

 	}

 	return nil
 }

+//returns true if err is an error we count with
+func isKnowError(err error) bool {
+	if phderr, ok := err.(*PdhError); ok && (phderr.ErrorCode == PDH_INVALID_DATA || phderr.ErrorCode == PDH_CALC_NEGATIVE_VALUE || phderr.ErrorCode == PDH_CSTATUS_INVALID_DATA) {


Could you line wrap this around 78 chars?

danielnelson · 2018-06-18T20:36:34Z

plugins/inputs/win_perf_counters/win_perf_counters.go

 	}

 	return nil
 }

+//returns true if err is an error we count with
+func isKnowError(err error) bool {


Can you rename this function and redo the comment? I think based on your comment above maybe you could name it isCounterDataError. You can probably remove the comment above if this name is more precise.

- refactoring common measurement creation code to a function

danielnelson · 2018-06-19T18:39:08Z

plugins/inputs/win_perf_counters/win_perf_counters_integration_test.go

@@ -98,6 +103,10 @@ func TestWinPerformanceQueryImpl(t *testing.T) {
 	require.NoError(t, err)

 	arr, err := query.GetFormattedCounterArrayDouble(hCounter)
+	if phderr, ok := err.(*PdhError); ok && phderr.ErrorCode != PDH_INVALID_DATA && phderr.ErrorCode != PDH_CALC_NEGATIVE_VALUE {
+		time.Sleep(time.Second)


This is a red flag for me because it indicates the test may be timing dependent, why do we need to sleep?

Invalid data error happens randomly, but can be seen several times in an hour. It generally means that some counter provided invalid value or it was unable to compute value from samples gathered during provided period. If we give it more time (which happens automatically during next gather cycle in normal Telegraf run), it solves the problem.

When using wildcards expansion, each counter is queried for value separately is it is filtered in my code on that level.
When using _PdhGetFormattedCounterArray, it queries multiple counters at once and it seems that if any counter returns status about invalid data, the function fails completely.

The code historically didn't bothered about errors, they were totally ignored. Maybe it is the way.

It makes sense for me to tell about unexpected errors.

If this can happen in the actual code then we should probably ignore or ignore and log. In the tests though this will actually be more problematic, we don't want to miss failures but we also don't want slow tests or intermittent failures.

The best fix would be to mock the calls to the library, in the meantime mark this test to skip if -short is set during testing, which I just added in ee6e4b0.

The test is already skipped in the short mode.

Do you mean to log such error as warning ? Not as error as it is now..

Regarding commit ee6e4b0. Are all tests (not the short mode) run some check during a pull request checking?

Okay I think we are good then, obviously timing based stuff is problematic but since it is skipped on short it is okay. We always only run go test -short for CI unit tests, which means tests like this are almost always skipped unless a developer runs them by hand.

vlastahajek added 2 commits June 11, 2018 22:54

Trying to fix influxdata#4250 - added possibility to use timestamp fr…

f518141

…om perf counters

Merge branch 'master' into vh-fix#4250

1afbfb2

danielnelson added feat Improvement on an existing feature such as adding a new setting/mode to an existing plugin area/windows Related to windows plugins (win_eventlog, win_perf_counters, win_services) labels Jun 12, 2018

danielnelson added this to the 1.8.0 milestone Jun 12, 2018

Fixed wrong merge

6d13e8b

vlastahajek added 7 commits June 13, 2018 12:46

Add missing error handling to second loop

acd6efe

Added test for gather errors

708a06a

Stabilizing test

ef35f0d

Making test using timestamp from counters

524a5c0

Small code beautification

074e9dd

Merge branch 'master' into vh-fix#4250

0e3bf11

Improved error reporting

4f40a2c

vlastahajek mentioned this pull request Jun 18, 2018

Intermittent unittest failure in win_perf_counters #4301

Closed

danielnelson reviewed Jun 18, 2018

View reviewed changes

- better name for isKnowError function

9c3f34a

- refactoring common measurement creation code to a function

danielnelson reviewed Jun 19, 2018

View reviewed changes

danielnelson changed the title ~~Attempt to fix #4250~~ Use counter time in win perf counters Jun 30, 2018

danielnelson merged commit ed2bc11 into influxdata:master Jun 30, 2018

rgitzel pushed a commit to rgitzel/telegraf that referenced this pull request Oct 17, 2018

Allow use of counter time in win perf counters (influxdata#4267)

bef32c6

otherpirate pushed a commit to otherpirate/telegraf that referenced this pull request Mar 15, 2019

Allow use of counter time in win perf counters (influxdata#4267)

c67d4e0

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use counter time in win perf counters #4267

Use counter time in win perf counters #4267

vlastahajek commented Jun 11, 2018

matthenning commented Jun 13, 2018 •

edited

Loading

danielnelson left a comment

danielnelson Jun 18, 2018

danielnelson Jun 18, 2018

danielnelson Jun 19, 2018

vlastahajek Jun 20, 2018

danielnelson Jun 22, 2018

vlastahajek Jun 22, 2018

danielnelson Jun 30, 2018

Use counter time in win perf counters #4267

Use counter time in win perf counters #4267

Conversation

vlastahajek commented Jun 11, 2018

matthenning commented Jun 13, 2018 • edited Loading

danielnelson left a comment

Choose a reason for hiding this comment

danielnelson Jun 18, 2018

Choose a reason for hiding this comment

danielnelson Jun 18, 2018

Choose a reason for hiding this comment

danielnelson Jun 19, 2018

Choose a reason for hiding this comment

vlastahajek Jun 20, 2018

Choose a reason for hiding this comment

danielnelson Jun 22, 2018

Choose a reason for hiding this comment

vlastahajek Jun 22, 2018

Choose a reason for hiding this comment

danielnelson Jun 30, 2018

Choose a reason for hiding this comment

matthenning commented Jun 13, 2018 •

edited

Loading