[Monitoring] telemetry fetchers use broken pagination logic #91654

afharo · 2021-02-17T14:48:10Z

As @robbavey pointed out, this is following the same approach as the get_beats_stats piece of logic.

@chrisronline, do you know if .monitoring-* indices have a custom index.max_result_window setting that allows pagination for more than 10k docs? Or if it's even possible that we reach 10k docs on these types of queries?

Originally posted by @afharo in #90850 (comment)

Let's revisit the pagination logics for Beats and Logstash to make sure they work for 10k+ docs. We can use either the Scroll or the Search After approaches.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2021-02-17T14:48:30Z

Pinging @elastic/kibana-telemetry (Team:KibanaTelemetry)

afharo · 2021-02-17T14:49:04Z

Ping @elastic/stack-monitoring-ui

chrisronline · 2021-02-17T21:49:44Z

We don't have anything like that, AFAIK. It seems like it'd come up more but I honestly can't recall the last time I encountered someone with an issue around this

afharo · 2021-02-18T10:15:25Z

This is about collecting telemetry, so any errors will silently fail and simply not send telemetry. I don't think anyone would actually notice.

I also think it's an edge case. That would mean we are dealing with massive clusters with more than 10k Beasts instances, Logstash instances or Logstash's pipelines ephemeral IDs.

chrisronline · 2021-02-18T13:52:08Z

Makes sense. I meant we don't hear issues from customers that they aren't able to see their 10k+ beats in the SM UI, so I'm not sure how often this scenario comes up for anyone (where they exceed 10k instances of anything) so it seems like an isolated use case but maybe something worth noting in docs somewhere.

simianhacker · 2021-06-23T20:02:07Z

I'm not sure what the issue is here. Are we trying to figure out if the UI can handle 10K+ logstash instances?

Bamieh · 2021-06-25T06:23:49Z

@simianhacker telemetry is running queries against the .monitoring-* indices to report them back to our cluster.

The current logic to paginate over the indices does not work because the max_window (from + size) can't exceed 10k docs by default in ES. This means with the current implementation no pagination will happen even if there are over 10k documents reported by beats/logstash.

elasticmachine · 2021-10-01T21:09:57Z

Pinging @elastic/kibana-core (Team:Core)

elasticmachine · 2022-03-04T19:02:30Z

Pinging @elastic/infra-monitoring-ui (Team:Infra Monitoring UI)

jasonrhodes · 2022-03-14T14:41:48Z

@Bamieh @afharo @robbavey we just re-discovered this ticket and we aren't sure if this is a "bug" or not? We are trying to determine if we need to pull this in...

afharo · 2022-03-15T20:35:05Z

@jasonrhodes in #127273 I removed the "broken" pagination logic: it basically doesn't paginate anymore.

However, ideally, the requests in there should use PIT pagination to retrieve all the available data in a less expensive way. For more context of why we prefer PIT requests vs. size: 1000 searches, please refer to #93770

smith · 2022-04-12T21:20:50Z

Closing. If it's something we want to do we can start with:

However, ideally, the requests in there should use PIT pagination to retrieve all the available data in a less expensive way. For more context of why we prefer PIT requests vs. size: 1000 searches, please refer to #93770

afharo added Feature:Stack Monitoring Feature:Telemetry Team:KibanaTelemetry Team:Monitoring Stack Monitoring team labels Feb 17, 2021

sgrodzicki added the bug Fixes for quality problems that affect the customer experience label Jun 7, 2021

simianhacker added chore and removed bug Fixes for quality problems that affect the customer experience labels Jun 23, 2021

lukeelmers added the Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc label Oct 1, 2021

lukeelmers removed the Team:KibanaTelemetry label Oct 1, 2021

exalate-issue-sync bot added impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. loe:small Small Level of Effort labels Nov 4, 2021

lukeelmers removed the Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc label Jan 11, 2022

afharo added Team:Infra Monitoring UI - DEPRECATED DEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_services and removed Team:Monitoring Stack Monitoring team labels Mar 4, 2022

miltonhultgren mentioned this issue Mar 5, 2022

[Stack Monitoring] Make telemetry queries more narrow #126975

Closed

smith closed this as completed Apr 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Monitoring] telemetry fetchers use broken pagination logic #91654

[Monitoring] telemetry fetchers use broken pagination logic #91654

afharo commented Feb 17, 2021

elasticmachine commented Feb 17, 2021

afharo commented Feb 17, 2021

chrisronline commented Feb 17, 2021

afharo commented Feb 18, 2021

chrisronline commented Feb 18, 2021

simianhacker commented Jun 23, 2021

Bamieh commented Jun 25, 2021 •

edited

Loading

elasticmachine commented Oct 1, 2021

elasticmachine commented Mar 4, 2022

jasonrhodes commented Mar 14, 2022

afharo commented Mar 15, 2022

smith commented Apr 12, 2022

[Monitoring] telemetry fetchers use broken pagination logic #91654

[Monitoring] telemetry fetchers use broken pagination logic #91654

Comments

afharo commented Feb 17, 2021

elasticmachine commented Feb 17, 2021

afharo commented Feb 17, 2021

chrisronline commented Feb 17, 2021

afharo commented Feb 18, 2021

chrisronline commented Feb 18, 2021

simianhacker commented Jun 23, 2021

Bamieh commented Jun 25, 2021 • edited Loading

elasticmachine commented Oct 1, 2021

elasticmachine commented Mar 4, 2022

jasonrhodes commented Mar 14, 2022

afharo commented Mar 15, 2022

smith commented Apr 12, 2022

Bamieh commented Jun 25, 2021 •

edited

Loading