-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1.7.0-rc1 PerfMon % Processor Time discrepancy #4250
Comments
Thank you for the help testing. I'm not sure what would cause the difference here, would it be possible to upload an image of your perfmon output against the raw data in influxdb using the |
This is a multi processor system in case that's important. |
I wonder if this could be because we are using the Telegraf time instead of the counter time. @vlastahajek We should bring in the UsePerfCounterTime feature from #4076. |
I'm available for testing of required |
When using Telegraf 1.6, are the results ok? |
In 1.6 there was a different issue with this counter: #4198 |
…om perf counters
Thank you for your effort folks. |
@matthenning Would it be possible for you to test out the fix with a nightly builds to see if it helps with the accuracy? The change was a little larger than I would normally backport for a patch release but if it solves the issue and is well tested we can consider it. |
Will do. I'll report back. |
I fear the problem isn't solved yet. Query:
Relevant Telegraf config:
Telegraf v1.8.0~abfbf4f4 (git: master abfbf4f) |
Taskmgr values almost match your typeperf data. |
The main difference is that taskmgr shows even different scale of values. The values there reach 100%, which wasn't noted by any other method of measurement. I didn't investigated the science behind CPU activity measurement, so it is hard to say what is wrong and what is right. For sure we can say that Telegraf provides good results with respect to the way how it collects data from performance counters. Is there any other metrics you are interested in we can look at and compare? |
Relevant telegraf.conf:
System info:
Telegraf: 1.7.0-rc1
Windows: Server 2012 R2
Steps to reproduce:
Compare % Process Time in Grafana (or the graphing tool of your choice) and perfmon
Expected behavior:
The collected values are identical to the actual values on the system
Actual behavior:
During the time span where CPU Load was at 100% in the following screenshot perfmon reported a steady load of 82-88% of the same exact counter, so 12-15% less than the collected value.
The text was updated successfully, but these errors were encountered: