-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Collect separate statistics for failed tasks #11327
Conversation
This is initial work towards #10734. An alternative approach is in #11317 but I like this one better. It is a bit more code but it is more straightforward as we are only adding fields we need to. The #11317 added whole |
@arhimondr please skim if it looks like a good direction to you. |
private final DataSize userMemoryReservation; | ||
private final DataSize totalMemoryReservation; | ||
private final Duration totalCpuTime; | ||
private final Duration failedCpuTime; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: maybe "wasted" (here and in other places)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i'd prefer "failed" too.
btw do you plan anything like speculative execution, where work could become "wasted" without being "failed"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
eg "cancelled"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah - that will come up. But even then "wasted" is wrong term really. It has negative connotation, and if we are up to kill some tasks to speed up the execution of whole query it should not be seen as we are doing something wrong. And "wasting" resources feels wrong. So maybe some other term - neither "failed" nor "wasted" - but nothing comes to my mind.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FWIW, I think "failed" is ok. The failure could be intrinsic (cluster / resource / system issues) or induced (task was forced to abort). In either case, it failed to complete.
081329d
to
cde5595
Compare
core/trino-main/src/main/resources/webapp/src/components/QueryDetail.jsx
Show resolved
Hide resolved
eea094d
to
288e251
Compare
288e251
to
a574043
Compare
Description
Collect separate statistics for failed tasks.
Show aggregated CPU/Scheduled time and cumulative memory for failed tasks in Web UI.
improvement
query engine/UI
Related issues, pull requests, and links
fixes: #10734
improves on: #10754
Documentation
(x) No documentation is needed.
( ) Sufficient documentation is included in this PR.
( ) Documentation PR is available with #prnumber.
( ) Documentation issue #issuenumber is filed, and can be handled later.
Release notes
( ) No release notes entries required.
(x) Release notes entries required with the following suggested text: