Skip to content

[QST] Performance question #5333

Answered by jlowe
eyalhir74 asked this question in General
Discussion options

You must be logged in to vote

I'm using RAPIDS Accelerator 21.12.0 using cudf 21.12.0

Given the relatively high cost of the contiguous_split kernel from your traces, I suggest updating to the 22.02 release of the RAPIDS Accelerator and cudf. That includes some performance fixes for contiguous_split that may help your use-case.

Going through the explain plan, I see a few non-GPU compliant ops

According to the physical plan and stage runtime statistics, I don't think the impact of this is significant. The only operations that aren't running on the GPU are the Project and CollectLimit occurring right at the end of the query. This is the last stage of the job, which appears to have taken only 0.4 seconds, while the ma…

Replies: 4 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by sameerz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
2 participants
Converted from issue

This discussion was converted from issue #4894 on April 27, 2022 15:33.