Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Support GetArrayStructFields expression #993

Merged
merged 8 commits into from
Oct 7, 2024

Conversation

Kimahriman
Copy link
Contributor

Which issue does this PR close?

Closes #992

Rationale for this change

Support the last complex type extractor for arrays and structs.

What changes are included in this PR?

Adds support for the GetArrayStructFields expression in Spark which extracts a nested struct field of an array of structs as an array of the inner field type.

How are these changes tested?

New expression UT.

@Kimahriman Kimahriman changed the title Array struct fields Support GetArrayStructFields expression Oct 2, 2024
@Kimahriman Kimahriman changed the title Support GetArrayStructFields expression feat: Support GetArrayStructFields expression Oct 2, 2024
@codecov-commenter
Copy link

Codecov Report

Attention: Patch coverage is 72.72727% with 3 lines in your changes missing coverage. Please review.

Project coverage is 54.84%. Comparing base (c3023c5) to head (7fe6963).
Report is 19 commits behind head on main.

Files with missing lines Patch % Lines
.../scala/org/apache/comet/serde/QueryPlanSerde.scala 72.72% 2 Missing and 1 partial ⚠️
Additional details and impacted files
@@              Coverage Diff              @@
##               main     #993       +/-   ##
=============================================
+ Coverage     34.03%   54.84%   +20.81%     
+ Complexity      875      858       -17     
=============================================
  Files           112      109        -3     
  Lines         43289    10841    -32448     
  Branches       9572     2101     -7471     
=============================================
- Hits          14734     5946     -8788     
+ Misses        25521     3821    -21700     
+ Partials       3034     1074     -1960     
Flag Coverage Δ
?

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@@ -2542,6 +2542,25 @@ object QueryPlanSerde extends Logging with ShimQueryPlanSerde with CometExprShim
None
}

case GetArrayStructFields(child, _, ordinal, _, _) =>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To help other reviewers, here is Spark's definition of GetArrayStructFields:

case class GetArrayStructFields(
    child: Expression,
    field: StructField,
    ordinal: Int,
    numFields: Int,
    containsNull: Boolean)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah those other three fields aren't relevant for actual execution

Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @Kimahriman

@andygrove andygrove merged commit b131cc3 into apache:main Oct 7, 2024
75 of 77 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support GetArrayStructFields
3 participants