#1109 Make queue size of MetricJsonListener configurable #1114

anuragw · 2016-03-02T17:28:08Z

The queue size of the MetricJsonListener used by the HystrixMetricsStreamServlet has been made configurable in two ways:

new property "hystrix.stream.defaultMetricListenerQueueSize", which if unspecified will be set to 1000
optional request parameter "queueSize", which can override the default queue size for each request

The changes have been made on the 1.4.x, but can also be propagated to the 1.5.x RC branch. Please let me know if you need any further information or would like to suggest changes.

cloudbees-pull-request-builder · 2016-03-02T17:32:37Z

NetflixOSS » Hystrix » Hystrix-pull-requests #369 SUCCESS
This pull request looks good

spencergibb · 2016-03-02T17:34:22Z

...c/main/java/com/netflix/hystrix/contrib/metrics/eventstream/HystrixMetricsStreamServlet.java

-                MetricJsonListener jsonListener = new MetricJsonListener();
+                int queueSize = defaultMetricListenerQueueSize.get();
+                try {
+                    String q = request.getParameter("queueSize");


Why would you let the request set the queue size?

If a system talks to multiple, dynamically scaleable endpoints, it could be cumbersome to change the queue size only via a property. Providing a request parameter would allow an easy way for a visualization service (hystrix-dashboard) or perhaps an aggregation service (turbine) to reconfigure the queue size by just updating parameters.

IMO, the velocity of metrics is independent of who is consuming the metrics. What problem do you intend to solve by allowing the request to set the queue size?

As I mention in my previous comment, there doesn't seem to be a way of scaling the queue size if the number of backend endpoints is scaled up or down.

For instance, a user-facing email service may only need 25 instances of a backend email retrieval service during off-peak but auto-scale to 100-150 during peak periods. In such a case, we would need to estimate the peak load, and set the queue size accordingly. If the volume of metrics exceeded that, the hystrix stream servlet would need to be reconfigured each time.

The consumer of metrics would be in a better position to anticipate the typical patterns in load, and adjust accordingly if it noticed the stats collection to be failing due to hitting queue size caps.

mattrjacobs · 2016-03-07T17:32:55Z

@anuragw I don't follow your reasoning.

The HystrixMetricsPoller gets invoked within an infinite loop in HystrixMetricsStreamServlet. It loops over all of the metrics and gets the current value of the metric. There's a delay parameter in the URL that determines how frequently to generate these metrics (default is 500ms). Call this value D.

The HystrixMetricsPoller produces 1 String for each command + threadpool + collapser in your running system. Call that value M. So the rate at which the queue fills up is M/D.

The metrics consumption happens in the servlet thread and also has the same delay. It reads all values in the queue.

So the delay parameter to the servlet governs how quickly to both produce/consume metrics. But the number of metrics in the system doesn't really change over time, unless you're adding commands/threadpools/collapsers dynamically.

anuragw · 2016-03-10T16:23:22Z

@mattrjacobs your last line says it all, "unless you're adding commands/threadpools/collapsers dynamically", our system does have this behavior.

One other option might be to configure the queueSize property to 100K or 1M, something huge, then use a very tiny delay, say 1ms. But wouldn't this tiny delay be very close to normal processing overheads in the servlet, making it hard to interpret the reported metrics?

cloudbees-pull-request-builder · 2016-03-10T17:05:08Z

NetflixOSS » Hystrix » Hystrix-pull-requests #378 SUCCESS
This pull request looks good

cloudbees-pull-request-builder · 2016-03-10T18:57:42Z

NetflixOSS » Hystrix » Hystrix-pull-requests #379 SUCCESS
This pull request looks good

…tional queueSize=<int> parameter in stream query

…y hystrix.stream.defaultMetricListenerQueueSize (still defaults to 1000 if unspecified)

cloudbees-pull-request-builder · 2016-03-10T19:00:30Z

NetflixOSS » Hystrix » Hystrix-pull-requests #380 FAILURE
Looks like there's a problem with this pull request

cloudbees-pull-request-builder · 2016-03-10T19:16:32Z

NetflixOSS » Hystrix » Hystrix-pull-requests #382 SUCCESS
This pull request looks good

anuragw · 2016-03-10T19:20:17Z

I've fixed the issue I had on my forked branch, and have pushed again. The PR got auto-closed when I reset to the base fork before replaying my two commits, and so I've reopened it. I'm working on the build failure now and will update once that's done.

Can we make a call on whether or not to include the queueSize in the request parameters soon? I wouldn't mind keeping the property as the sole way of setting the queueSize, but I do think the request parameter could be useful too, in some scenarios.

anuragw · 2016-03-10T20:19:34Z

I'm not sure what the issue is, the gradle build seems to pass on my system. Any ideas why this may be failing?

mattrjacobs · 2016-03-11T20:27:20Z

Even in the case where commands are being added dynamically, there's still a single set for the JVM. So I still don't see how defining a queueSize per request helps you.

I'm trying to clean up a bunch of the unit tests to get rid of the flaky failures of unit tests. They almost always have to do with Travis running things more slowly than our local machines. I hope to have that significantly cleaned up in the next couple of days

anuragw · 2016-03-11T21:29:35Z

@mattrjacobs Let's agree to disagree on whether or not including the queueSize in the request makes sense. I'll remove the request parameter logic and push out the new code today. Once the unit tests are fixed, is it safe to assume that the PR merge shouldn't take much longer?

cloudbees-pull-request-builder · 2016-03-11T21:42:03Z

NetflixOSS » Hystrix » Hystrix-pull-requests #408 SUCCESS
This pull request looks good

#1109 Make queue size of MetricJsonListener configurable

mattrjacobs · 2016-03-11T23:07:52Z

Thanks for the good discussion, and contribution, @anuragw. I plan on getting a release with this change out early next week

anuragw · 2016-03-14T17:54:46Z

@mattrjacobs, I'm always looking forward to contributing! Thanks for your help too, and I'll keep an eye out for the next 1.4.x release.

spencergibb reviewed Mar 2, 2016
View reviewed changes

anuragw closed this Mar 10, 2016

awazalwar added 2 commits March 10, 2016 13:59

#1109 MetricJsonListener queue size can be configured by passing addi…

a88b30b

…tional queueSize=<int> parameter in stream query

#1109 Make default metric listener queue size configurable by propert…

1410bee

…y hystrix.stream.defaultMetricListenerQueueSize (still defaults to 1000 if unspecified)

anuragw reopened this Mar 10, 2016

#1109 Rolling back request parameter support for setting queueSize

ee6c90a

mattrjacobs added a commit that referenced this pull request Mar 11, 2016

Merge pull request #1114 from anuragw/1.4.x

6f55c0a

#1109 Make queue size of MetricJsonListener configurable

mattrjacobs merged commit 6f55c0a into Netflix:1.4.x Mar 11, 2016

mattrjacobs mentioned this pull request Mar 14, 2016

Make queue size of MetricJsonListener configurable #1109

Closed

mattrjacobs mentioned this pull request Mar 16, 2016

Make queue size of MetricJsonListener configurable #1149

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#1109 Make queue size of MetricJsonListener configurable #1114

#1109 Make queue size of MetricJsonListener configurable #1114

anuragw commented Mar 2, 2016

cloudbees-pull-request-builder commented Mar 2, 2016

spencergibb Mar 2, 2016

anuragw Mar 2, 2016

mattrjacobs Mar 3, 2016

anuragw Mar 3, 2016

mattrjacobs commented Mar 7, 2016

anuragw commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

anuragw commented Mar 10, 2016

anuragw commented Mar 10, 2016

mattrjacobs commented Mar 11, 2016

anuragw commented Mar 11, 2016

cloudbees-pull-request-builder commented Mar 11, 2016

mattrjacobs commented Mar 11, 2016

anuragw commented Mar 14, 2016

#1109 Make queue size of MetricJsonListener configurable #1114

#1109 Make queue size of MetricJsonListener configurable #1114

Conversation

anuragw commented Mar 2, 2016

cloudbees-pull-request-builder commented Mar 2, 2016

spencergibb Mar 2, 2016

Choose a reason for hiding this comment

anuragw Mar 2, 2016

Choose a reason for hiding this comment

mattrjacobs Mar 3, 2016

Choose a reason for hiding this comment

anuragw Mar 3, 2016

Choose a reason for hiding this comment

mattrjacobs commented Mar 7, 2016

anuragw commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

cloudbees-pull-request-builder commented Mar 10, 2016

anuragw commented Mar 10, 2016

anuragw commented Mar 10, 2016

mattrjacobs commented Mar 11, 2016

anuragw commented Mar 11, 2016

cloudbees-pull-request-builder commented Mar 11, 2016

mattrjacobs commented Mar 11, 2016

anuragw commented Mar 14, 2016