[NSE-732] Support Map complex type in Shuffle #749

zhixingheyi-tian · 2022-03-02T12:50:11Z

What changes were proposed in this pull request?

Implement Map type support in JNI buffer building
Support infinite recursion in same complex types
Support infinite recursion between different complex types

How was this patch tested?

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

github-actions · 2022-03-02T12:50:26Z

#732

zhixingheyi-tian · 2022-03-04T09:11:26Z

@zhouyuan
Have passed Jenkins SparkSQL uts and other workloads.

zhouyuan · 2022-03-07T00:19:56Z

@zhixingheyi-tian these configurations are not set in the new tests, we may lose some coverage, is this intended?

        .set("spark.oap.sql.columnar.rowtocolumnar", "false")
        .set("spark.oap.sql.columnar.columnartorow", "false")

zhixingheyi-tian · 2022-03-07T01:41:26Z

@zhixingheyi-tian these configurations are not set in the new tests, we may lose some coverage, is this intended?
        .set("spark.oap.sql.columnar.rowtocolumnar", "false")
        .set("spark.oap.sql.columnar.columnartorow", "false")

Because Native row=>column, column => row have not supported complex types enough. So used the non-native conversion.

zhouyuan · 2022-03-07T01:47:58Z

@zhixingheyi-tian these configurations are not set in the new tests, we may lose some coverage, is this intended?
        .set("spark.oap.sql.columnar.rowtocolumnar", "false")
        .set("spark.oap.sql.columnar.columnartorow", "false")
Because Native row=>column, column => row have not supported complex types enough. So used the non-native conversion.

can you also improve the missing build_check() in c2r & r2c?

zhixingheyi-tian · 2022-03-07T13:12:47Z

zhixingheyi-tian · 2022-03-07T13:13:55Z

@zhixingheyi-tian these configurations are not set in the new tests, we may lose some coverage, is this intended?
        .set("spark.oap.sql.columnar.rowtocolumnar", "false")
        .set("spark.oap.sql.columnar.columnartorow", "false")
Because Native row=>column, column => row have not supported complex types enough. So used the non-native conversion.
can you also improve the missing build_check() in c2r & r2c?

Yes. Done

zhixingheyi-tian · 2022-03-07T13:15:10Z

@zhouyuan
Also added Operator check in SQL executedPlan.

zhouyuan · 2022-03-08T04:01:30Z

note:

columnar shuffle works fine with array/struct/map as payload
columnar shuffle failed with array/struct/map as shuffle keys due to gandiva expr doesnot support these types (rare case)

zhixingheyi-tian · 2022-03-08T11:00:37Z

@zhouyuan
Added Partitioning keys check.
Also added Fall back UTs about complex type in Partitioning keys

zhixingheyi-tian added 11 commits February 9, 2022 17:00

[NSE-732] Support Struct and Map nested types in Shuffle

04657e5

format C code

66da184

Turn on Map and Struct

5b56bbb

Fix Typo

71587c0

Troubleshoot recordbatch building

ed8fa50

Fix Clang stype

b35d8f2

Reserve previous check way

686c362

Fix clang stype

d970cfa

Add check for nested complex types

c4a90db

Merge remote-tracking branch 'upstream/master' into mapsupport

69635c2

First draft commit

fb1702a

zhixingheyi-tian added 3 commits March 3, 2022 15:42

Add uts

b81a683

clean code

f27bbb5

Format code

bb8d28f

Improve ArrowRowToColumnarExec buildcheck

59c5cf5

zhixingheyi-tian closed this Mar 7, 2022

zhixingheyi-tian reopened this Mar 7, 2022

Add Fall back with complex type in Partitioning keys

a0e8702

zhouyuan approved these changes Mar 8, 2022

View reviewed changes

zhouyuan merged commit 20379cd into oap-project:master Mar 8, 2022

zhouyuan mentioned this pull request Mar 9, 2022

[NSE-708] Enable write for Arrow map/struct vector #709

Closed

weiting-chen added the feature label Apr 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NSE-732] Support Map complex type in Shuffle #749

[NSE-732] Support Map complex type in Shuffle #749

zhixingheyi-tian commented Mar 2, 2022 •

edited

Loading

github-actions bot commented Mar 2, 2022

zhixingheyi-tian commented Mar 4, 2022

zhouyuan commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022 •

edited

Loading

zhouyuan commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022 •

edited

Loading

zhouyuan commented Mar 8, 2022

zhixingheyi-tian commented Mar 8, 2022

[NSE-732] Support Map complex type in Shuffle #749

[NSE-732] Support Map complex type in Shuffle #749

Conversation

zhixingheyi-tian commented Mar 2, 2022 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

github-actions bot commented Mar 2, 2022

zhixingheyi-tian commented Mar 4, 2022

zhouyuan commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022 • edited Loading

zhouyuan commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022

zhixingheyi-tian commented Mar 7, 2022 • edited Loading

zhouyuan commented Mar 8, 2022

zhixingheyi-tian commented Mar 8, 2022

zhixingheyi-tian commented Mar 2, 2022 •

edited

Loading

zhixingheyi-tian commented Mar 7, 2022 •

edited

Loading

zhixingheyi-tian commented Mar 7, 2022 •

edited

Loading