Implement Spark’s FunctionCatalog for Existing Transformations #5349

kbendick · 2022-07-24T06:33:50Z

We need to implement Spark’s FunctionCatalog so that we can use the partition transformation functions in queries.

This allows for using the partition transforms on non-partition columns in generated code.

This is necessary in order to write Catalyst rules which will pass bucket So that storage partitioned joins (aka bucketed joins) can be implemented.

See also:

FunctionCatalog
ScalarFunction interface, which has practical description of what is needed for codegeneration and the benefits of it.

The functions we have that are likely highest priority:

truncate
bucket
zorder
date transformations

The text was updated successfully, but these errors were encountered:

kbendick · 2022-07-24T07:11:03Z

This will allow us to make use of Spark’s storage partitioned joins (aka bucket joins which is one subset of possible join optimizations of transformed columns) https://issues.apache.org/jira/browse/SPARK-37166

kbendick · 2022-07-27T19:57:22Z

This relates to #430

kbendick changed the title ~~Implement Spark’s FunctionCatalog for Existing Transform functions~~ Implement Spark’s FunctionCatalog for Existing Transformations Jul 24, 2022

kbendick mentioned this issue Jul 24, 2022

iceberg bucketing via writeTo api? bucket on a non partitioned column? #4646

Closed

kbendick mentioned this issue Jul 27, 2022

Spark - Implement FunctionCatalog and Truncate #5305

Closed

kbendick mentioned this issue Aug 4, 2022

Spark: Support truncate in FunctionCatalog #5431

Merged

aokolnychyi closed this as completed in #5431 Aug 12, 2022

kbendick mentioned this issue Aug 12, 2022

Spark 3.3 - Support bucket in FunctionCatalog #5513

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Spark’s FunctionCatalog for Existing Transformations #5349

Implement Spark’s FunctionCatalog for Existing Transformations #5349

kbendick commented Jul 24, 2022 •

edited

Loading

kbendick commented Jul 24, 2022 •

edited

Loading

kbendick commented Jul 27, 2022

Implement Spark’s FunctionCatalog for Existing Transformations #5349

Implement Spark’s FunctionCatalog for Existing Transformations #5349

Comments

kbendick commented Jul 24, 2022 • edited Loading

kbendick commented Jul 24, 2022 • edited Loading

kbendick commented Jul 27, 2022

kbendick commented Jul 24, 2022 •

edited

Loading

kbendick commented Jul 24, 2022 •

edited

Loading