forked from elastic/elasticsearch
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ML] Reduce chance of timeout in serverless ML autoscaling
If ML serverless autoscaling fails to return a response within the configured timeout period then the control plane autoscaler will log an error. Too many of these errors will raise an alert, therefore as much as possible should be done on the ML side to _not_ time out. Previously there were two possible causes of timeouts: 1. If a request for node stats from all ML nodes timed out 2. If a request to refresh the ML memory tracker timed out The first case can happen if a node leaves the cluster at a bad time and the message sent to it gets lost. The second case can happen if searching the ML results indices for model size stats documents is slow. We can avoid timeouts in these two situations as follows: 1. There was no need to use the API to get the only value from the node stats that the autoscaler needs to know - the total amount of memory on each ML node is stored in a node attribute on startup so exists in cluster state 2. When we refresh the ML memory tracker we can just return stats that instruct the autoscaler to do nothing until the refresh is complete - this is functionally the same as timing out each request, but without generating error messages
- Loading branch information
1 parent
31736fc
commit 30926e5
Showing
3 changed files
with
55 additions
and
256 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.