Optimizing Expand+Aggregate in sqls with many count distinct #10798

binmahone · 2024-05-13T06:32:23Z

Fixing #10799. This PR tries to optimize the Expand&Aggregate exec in the first stage of a sql with many count distinct measures.

The optimizations in this PR include:

Avoid allocating&initializing large number of null vectors when doing Expand
Try coaleasce expanded column batches before sending them to Aggregate

binmahone · 2024-05-13T06:51:56Z

build

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

binmahone · 2024-05-14T03:55:00Z

build

binmahone · 2024-05-14T06:57:29Z

@revans2 @abellina @winningsix can you pls take a look of this PR ? we're going to pack a debug build based on this PR

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpandExec.scala

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpressions.scala

revans2 · 2024-05-14T15:17:17Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpressions.scala

+  case class NullVecKey(d: DataType, n: Int)
+
+  class NullVecCache(private val maxNulls: Int)
+    extends util.LinkedHashMap[NullVecKey, GpuColumnVector](100, 0.75f, true) {


I really don't understand why we are extending a map instead of wrapping it? Or even better using some other cache data structure built for this type of use case.

If we wrapped it, then we could get true LRU functionality and be able to reset the the priority on a read. It would let us not need to override remove so it throws. That API would just not exist.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpressions.scala

revans2 · 2024-05-14T15:23:25Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpressions.scala

+  // This is only for ExpandExec which will generate a lot of null vectors
+  case class NullVecKey(d: DataType, n: Int)
+
+  class NullVecCache(private val maxNulls: Int)


The data stored in the cache needs to be spillable in some form. Eventually it would be nice to make it so instead of spilling we can just delete the value from the cache, but in the short term we need to make sure that everything stored in the cache is spillable.

It would also be really nice to have a timeout of some kind. If an entry is unused for a specific amount of time it should be deleted to avoid adding more memory pressure to the system.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpressions.scala

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsConf.scala

sameerz · 2024-05-29T12:23:51Z

Please retarget to 24.08

…ze_expand

binmahone · 2024-06-26T02:55:29Z

build

binmahone · 2024-06-26T06:08:20Z

@wjxiz1992 query perf pass

binmahone · 2024-07-02T02:49:58Z

Hi @revans2 @abellina , since we're getting often-contradictory conclusions from customer side, we decide to hold on this PR until things are clearer. I'll turn back to address your comments once we're confident that these optimizations are always benificial.

binmahone · 2024-07-26T01:58:16Z

Hi @revans2 @abellina , since we're getting often-contradictory conclusions from customer side, we decide to hold on this PR until things are clearer. I'll turn back to address your comments once we're confident that these optimizations are always benificial.

@GaryShen2008 , I suggest to move this PR to 2410 because of the quoted reason

sameerz · 2024-07-29T23:43:48Z

Please retarget to the 24.10 branch.

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

…ze_expand Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

binmahone · 2024-09-06T05:59:30Z

Hi @revans2 I simplified the code to make it unnecessary to worry about the side effects of global caching for null vectors. The cache reuse ratio would be smaller than previous version, but it would suffice for our customer's use case (a query with a lot of count distincts). Please help to review again

revans2

I have a few nits, but I think there are still some comments from @abellina that need to be addressed

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuExpressions.scala

revans2 · 2024-09-06T13:21:54Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala

-      val newColumns = boundExprs.safeMap(_.columnarEval(cb)).toArray[ColumnVector]
-      new ColumnarBatch(newColumns, cb.numRows())
+      try {
+        GpuExpressionsUtils.cachedNullVectors.get.clear()


There are more ways than just this method to run a expression. I don't trust this to fix it every time. It is probably god enough in practice, but I don't like the precedence it is setting. At a minimum I want to see some comments here explaining what is happening and why. Preferably with a follow on issue to fix this once we have the ability to delete a buffer from the spill framework instead of spilling it.

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

abellina · 2024-09-23T15:24:17Z

build

revans2 · 2024-09-23T15:25:03Z

build

binmahone · 2024-09-24T06:12:13Z

build

binmahone · 2024-09-26T01:33:23Z

close #10799

binmahone changed the title ~~optimzing Expand+Aggregate in sqlw with many count distinct~~ optimzing Expand+Aggregate in sqlw with many count distinct [WIP] May 13, 2024

winningsix changed the title ~~optimzing Expand+Aggregate in sqlw with many count distinct [WIP]~~ Optimzing Expand+Aggregate in sqlw with many count distinct [WIP] May 13, 2024

binmahone changed the title ~~Optimzing Expand+Aggregate in sqlw with many count distinct [WIP]~~ Optimzing Expand+Aggregate in sqls with many count distinct [WIP] May 13, 2024

sameerz added the performance A performance related task/issue label May 13, 2024

optimzing Expand+Aggregate in sqlw with many count distinct

cdc867d

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

binmahone force-pushed the 240513_optimize_expand branch from d4a8261 to cdc867d Compare May 14, 2024 03:54

abellina requested changes May 14, 2024

View reviewed changes

revans2 requested changes May 14, 2024

View reviewed changes

binmahone mentioned this pull request May 27, 2024

optimzing Expand+Aggregate in sqlw with many count distinct nvliyuan/yuali-spark-rapids#9

Merged

binmahone changed the base branch from branch-24.06 to branch-24.08 June 3, 2024 02:16

binmahone mentioned this pull request Jun 17, 2024

#10798 for liyuan/0612-base-local nvliyuan/yuali-spark-rapids#18

Merged

Merge remote-tracking branch 'origin/branch-24.08' into 240513_optimi…

c7ac82f

…ze_expand

binmahone changed the base branch from branch-24.08 to branch-24.10 August 6, 2024 05:28

binmahone added 3 commits September 5, 2024 18:03

simplify

55eb7bd

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

Merge remote-tracking branch 'origin/branch-24.10' into 240513_optimi…

6f39db6

…ze_expand Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

add comment

2984563

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

revans2 reviewed Sep 6, 2024

View reviewed changes

address comments

31249fb

Signed-off-by: Hongbin Ma (Mahone) <mahongbin@apache.org>

binmahone changed the title ~~Optimzing Expand+Aggregate in sqls with many count distinct [WIP]~~ Optimzing Expand+Aggregate in sqls with many count distinct Sep 23, 2024

revans2 approved these changes Sep 23, 2024

View reviewed changes

abellina approved these changes Sep 23, 2024

View reviewed changes

binmahone merged commit 5725e2e into NVIDIA:branch-24.10 Sep 24, 2024
44 of 45 checks passed

sameerz changed the title ~~Optimzing Expand+Aggregate in sqls with many count distinct~~ Optimizing Expand+Aggregate in sqls with many count distinct Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing Expand+Aggregate in sqls with many count distinct #10798

Optimizing Expand+Aggregate in sqls with many count distinct #10798

binmahone commented May 13, 2024 •

edited by winningsix

Loading

binmahone commented May 13, 2024

binmahone commented May 14, 2024

binmahone commented May 14, 2024

revans2 May 14, 2024

revans2 May 14, 2024

sameerz commented May 29, 2024

binmahone commented Jun 26, 2024

binmahone commented Jun 26, 2024

binmahone commented Jul 2, 2024 •

edited

Loading

binmahone commented Jul 26, 2024

sameerz commented Jul 29, 2024

binmahone commented Sep 6, 2024

revans2 left a comment

revans2 Sep 6, 2024

abellina commented Sep 23, 2024

revans2 commented Sep 23, 2024

binmahone commented Sep 24, 2024

binmahone commented Sep 26, 2024 •

edited

Loading

Optimizing Expand+Aggregate in sqls with many count distinct #10798

Optimizing Expand+Aggregate in sqls with many count distinct #10798

Conversation

binmahone commented May 13, 2024 • edited by winningsix Loading

binmahone commented May 13, 2024

binmahone commented May 14, 2024

binmahone commented May 14, 2024

revans2 May 14, 2024

Choose a reason for hiding this comment

revans2 May 14, 2024

Choose a reason for hiding this comment

sameerz commented May 29, 2024

binmahone commented Jun 26, 2024

binmahone commented Jun 26, 2024

binmahone commented Jul 2, 2024 • edited Loading

binmahone commented Jul 26, 2024

sameerz commented Jul 29, 2024

binmahone commented Sep 6, 2024

revans2 left a comment

Choose a reason for hiding this comment

revans2 Sep 6, 2024

Choose a reason for hiding this comment

abellina commented Sep 23, 2024

revans2 commented Sep 23, 2024

binmahone commented Sep 24, 2024

binmahone commented Sep 26, 2024 • edited Loading

binmahone commented May 13, 2024 •

edited by winningsix

Loading

binmahone commented Jul 2, 2024 •

edited

Loading

binmahone commented Sep 26, 2024 •

edited

Loading