[Ingest Manager] Better default value for fleet long polling timeout #76393

nchaulet · 2020-09-01T15:32:51Z

Summary

Resolve #75552

Increasing the long polling timeout allow Fleet to support a higher number of agents.

Done in this PR:

Set the default long polling timeout to 5 minutes instead of 1 minutes
Set the socket idle timeout for the agent checkin route to the xpack.ingestManager.fleet.pollingRequestTimeout. (so we can have a different timeout for Fleet than the rest of Kibana)

elasticmachine · 2020-09-01T15:32:54Z

Pinging @elastic/ingest-management (Team:Ingest Management)

jen-huang · 2020-09-01T17:18:28Z

x-pack/plugins/ingest_manager/server/services/agents/checkin/state_new_actions.ts

+        // Set a timeout 3s before the real timeout to have a chance to respond an empty response before socket timeout
+        Math.max((appContextService.getConfig()?.fleet.pollingRequestTimeout ?? 0) - 3000, 3000)


does this mean we can run into an edge case where the response can come in at almost the request time out limit? let's say 2 seconds before the limit. will this logic cause that to be ignored and treated as an empty response?

Yes we can have an edge case here
The arbitrary 3s delay try to mitigate the risk of not sending any response (the way Kibana handle socket timeout the socket is destroyed immediately so we cannot send any data), but it can still happen if we some huge delay in the event loop.
If the response is happening like 2 seconds before the socket timeout we will send an empty response but it will be fixed during the next checkin.

@jen-huang let me know if I can still clarify that :)

it will be fixed during the next checkin

ah got it. that sounds acceptable to me 👍

kibanamachine · 2020-09-01T18:01:49Z

💚 Build Succeeded

continuous-integration/kibana-ci/pull-request
Commit: 33e7f7d

Build metrics

page load bundle size

id	value	diff	baseline
ingestManager	467.8KB	+321.0B	467.5KB

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

…lastic#76393)

* master: (340 commits) [data.search.SearchSource] Remove legacy ES client APIs. (elastic#75943) [release notes] automatically retry on Github API 5xx errors (elastic#76447) [es_ui_shared] Fix eslint exhaustive deps rule (elastic#76392) [i18n] Integrate 7.9.1 Translations (elastic#76391) [APM] Update aggregations to support script sources (elastic#76429) [Security Solution] Refactor Network Top Countries to use Search Strategy (elastic#76244) Document security settings available on ESS (elastic#76513) [Ingest Manager] Add input revision to the config send to the agent (elastic#76327) [DOCS] Identifies cloud settings for Monitoring (elastic#76579) [DOCS] Identifies Cloud settings in Dev Tools (elastic#76583) [Ingest Manager] Better default value for fleet long polling timeout (elastic#76393) [data.indexPatterns] Fix broken rollup index pattern creation (elastic#76593) [Ingest Manager] Split Registry errors into Connection & Response (elastic#76558) [Security Solution] add an excess validation instead of the exact match (elastic#76472) Introduce TS incremental builds & move src/test_utils to TS project (elastic#76082) fix bad merge (elastic#76629) [Newsfeed] Ensure the version format when calling the API (elastic#76381) remove server_extensions mixin (elastic#76606) Remove legacy applications and legacy mode (elastic#75987) [Discover] Fix sidebar element focus behavior when adding / removing columns (elastic#75749) ...

…76393) (#76644)

[Ingest Manager] Better default value for fleet long polling timeout

33e7f7d

nchaulet added v8.0.0 release_note:skip Skip the PR/issue when compiling release notes v7.10.0 Team:Fleet Team label for Observability Data Collection Fleet team Ingest Management:beta2 labels Sep 1, 2020

nchaulet requested a review from a team September 1, 2020 15:32

nchaulet self-assigned this Sep 1, 2020

jen-huang reviewed Sep 1, 2020

View reviewed changes

nchaulet requested a review from jen-huang September 2, 2020 17:42

jen-huang approved these changes Sep 3, 2020

View reviewed changes

nchaulet merged commit 3d91157 into elastic:master Sep 3, 2020

nchaulet deleted the feature-fleet-better-polling-timeout branch September 3, 2020 15:06

nchaulet mentioned this pull request Sep 3, 2020

[7.x] [Ingest Manager] Better default value for fleet long polling timeout (#76393) #76644

Merged

nchaulet added a commit to nchaulet/kibana that referenced this pull request Sep 3, 2020

[Ingest Manager] Better default value for fleet long polling timeout (e…

1e1f2dc

…lastic#76393)

nchaulet added a commit that referenced this pull request Sep 3, 2020

[Ingest Manager] Better default value for fleet long polling timeout (#…

2312fa9

…76393) (#76644)

nchaulet mentioned this pull request Sep 9, 2020

[Ingest Manager] Increase kibana client timeout to 5 minutes elastic/beats#21037

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Ingest Manager] Better default value for fleet long polling timeout #76393

[Ingest Manager] Better default value for fleet long polling timeout #76393

nchaulet commented Sep 1, 2020

elasticmachine commented Sep 1, 2020

jen-huang Sep 1, 2020

nchaulet Sep 1, 2020

nchaulet Sep 2, 2020 •

edited

Loading

jen-huang Sep 3, 2020

kibanamachine commented Sep 1, 2020

		// Set a timeout 3s before the real timeout to have a chance to respond an empty response before socket timeout
		Math.max((appContextService.getConfig()?.fleet.pollingRequestTimeout ?? 0) - 3000, 3000)

[Ingest Manager] Better default value for fleet long polling timeout #76393

[Ingest Manager] Better default value for fleet long polling timeout #76393

Conversation

nchaulet commented Sep 1, 2020

Summary

elasticmachine commented Sep 1, 2020

jen-huang Sep 1, 2020

Choose a reason for hiding this comment

nchaulet Sep 1, 2020

Choose a reason for hiding this comment

nchaulet Sep 2, 2020 • edited Loading

Choose a reason for hiding this comment

jen-huang Sep 3, 2020

Choose a reason for hiding this comment

kibanamachine commented Sep 1, 2020

💚 Build Succeeded

Build metrics

page load bundle size

nchaulet Sep 2, 2020 •

edited

Loading