[Data] Incorrect StageSummaryStats
execution time calculated
#37105
Labels
bug
Something that is supposed to be working; but isn't
data
Ray Data-related issues
P0
Issues that should be fixed in short order
Ray 2.6
release-blocker
P0 Issue that blocks the release
What happened + What you expected to happen
The stats for a Dataset generated from
Read->SplitBlocks(n)->MapBatches
contains an incorrectly duplicated execution time summary; see example below.Initial hypothesis is that this is caused from inheriting incorrect stats information during operator fusion / stats generation.
Versions / Dependencies
ray master
Reproduction script
The block execution summary string (first line) is duplicated incorrectly for the two stages:
We should expect Stage 2 to take close to 5 seconds, due to the
time.sleep(5)
in the mapped function. However, we see the same time as Stage 1.Issue Severity
None
The text was updated successfully, but these errors were encountered: