-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fail variable_width_histogram that collects from many #58619
Fail variable_width_histogram that collects from many #58619
Conversation
Adds an explicit check to `variable_width_histogram` to stop it from trying to collect from many buckets because it can't. I tried to make it do so but that is more than an afternoon's project, sadly. So for now we just disallow it. Relates to elastic#42035
Pinging @elastic/es-analytics-geo (:Analytics/Aggregations) |
I think to fix this we'd want to think about replacing |
Hey, good catch with this! I think the main cause of this problem relates to how the
This will certainly lead to an explosion of bucket if the aggregation is nested, since each sub aggregation could have 50k buckets. One way to fix this could be to just lower the default size of the I'm not quite sure what you mean by
would you mind explaining? |
Its pretty much the TODO that you left. I'm not 100% sure it is needed, but it feels like it'd be fairly important. The trouble is right here. That assertion isn't valid when collecting from many buckets. The second parameter which I usually call It'd be fairly simple to handle this by changing the The trouble comes when merging buckets. Its both tricky to implement and inefficient. Tricky because the
I think it'd be ok doing it the "inefficient but correct" way. But just getting the correct way is tricky. And now we come to my comment about |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍, and cheers for the explanation. Makes sense to lock this down for now until we modify it to handle multiple buckets. It does seem reasonably tricky to do that efficiently, compared to auto-date
Thanks for the detailed explanation! That makes a lot of sense :) |
Adds an explicit check to `variable_width_histogram` to stop it from trying to collect from many buckets because it can't. I tried to make it do so but that is more than an afternoon's project, sadly. So for now we just disallow it. Relates to elastic#42035
Adds an explicit check to `variable_width_histogram` to stop it from trying to collect from many buckets because it can't. I tried to make it do so but that is more than an afternoon's project, sadly. So for now we just disallow it. Relates to #42035
Adds an explicit check to
variable_width_histogram
to stop it fromtrying to collect from many buckets because it can't. I tried to make it
do so but that is more than an afternoon's project, sadly. So for now we
just disallow it.
Relates to #42035