[SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) #47054

uros-db · 2024-06-21T08:22:20Z

What changes were proposed in this pull request?

Following up on #47030, re-running the collation benchmark for NonASCII.

Why are the changes needed?

We've changed the meaning of LCASE collation in Spark, and also modified how equality checks / hashing/ expressions work with this collation, so we need to re-run the benchmarks and identify areas of improvement.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Existing tests.

Was this patch authored or co-authored using generative AI tooling?

No.

cloud-fan · 2024-06-24T06:31:24Z

thanks, merging to master!

### What changes were proposed in this pull request? Following up on apache#47030, re-running the collation benchmark for NonASCII. ### Why are the changes needed? We've changed the meaning of LCASE collation in Spark, and also modified how equality checks / hashing/ expressions work with this collation, so we need to re-run the benchmarks and identify areas of improvement. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Existing tests. ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#47054 from uros-db/collation-benchmarks-nonascii. Authored-by: Uros Bojanic <157381213+uros-db@users.noreply.github.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>

NonASCII results

8a98a15

github-actions bot added the SQL label Jun 21, 2024

uros-db changed the title ~~[SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII)~~ [WIP][SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) Jun 21, 2024

Update results

2470636

uros-db changed the title ~~[WIP][SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII)~~ [SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) Jun 21, 2024

cloud-fan approved these changes Jun 24, 2024

View reviewed changes

cloud-fan closed this in 31fa9d8 Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) #47054

[SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) #47054

uros-db commented Jun 21, 2024

cloud-fan commented Jun 24, 2024

[SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) #47054

[SQL][TEST][FOLLOWUP] Re-run collation benchmark (NonASCII) #47054

Conversation

uros-db commented Jun 21, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

cloud-fan commented Jun 24, 2024