-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data Frame Analytics memory estimation ignores Query #49454
Comments
Pinging @elastic/ml-core (:ml) |
adding the exception as well so other people running into this can find this issue easier: |
dimitris-athanasiou
added a commit
to dimitris-athanasiou/elasticsearch
that referenced
this issue
Nov 24, 2019
dimitris-athanasiou
added a commit
to dimitris-athanasiou/elasticsearch
that referenced
this issue
Nov 25, 2019
dimitris-athanasiou
added a commit
that referenced
this issue
Nov 25, 2019
dimitris-athanasiou
added a commit
to dimitris-athanasiou/elasticsearch
that referenced
this issue
Nov 25, 2019
dimitris-athanasiou
added a commit
that referenced
this issue
Nov 25, 2019
dimitris-athanasiou
added a commit
that referenced
this issue
Dec 2, 2019
jpountz
changed the title
[ML] Data Frame Analytics memory estimation ignores Query
Data Frame Analytics memory estimation ignores Query
Dec 18, 2019
This was referenced Feb 3, 2020
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The Memory estimation process uses the extractor against the source index before the data frame analytics process runs. But, in doing this, it ignores the the data frame analytics source query.
Extractor simply does a match_all: https://github.com/elastic/elasticsearch/blob/1dd816f030e256f497c1726695d1209a481b9d8e/x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/dataframe/extractor/DataFrameDataExtractorFactory.java#L42..L58
And the estimation process uses the extractor to determine row and column counts: https://github.com/elastic/elasticsearch/blob/12528b351a99941ffc1d44dd2c0265768ef79635/x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/dataframe/process/MemoryUsageEstimationProcessManager.java#L59..L60
The memory estimator needs to account for a defined query when estimating against the source index.
The text was updated successfully, but these errors were encountered: