-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add hudi integeration tests and bump version #17463
Conversation
569fb47
to
c79bb33
Compare
Some tests are expected to fail till #17477 merged. |
@codope @pratyakshsharma @arunthirupathi would you like to take a review on this PR? |
Holding. To be fixed after hudi 0.11 is released. |
379d7f0
to
e931539
Compare
@codope @pratyakshsharma @arunthirupathi This PR is ready for review. Could you take a review on this PR? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@7c00 Thanks for upgrading Hudi version and adding more coverage for Hudi tables. Overall looks good to me, just a few minor comments
presto-hive/src/main/java/com/facebook/presto/hive/util/HudiRealtimeSplitConverter.java
Show resolved
Hide resolved
presto-hive/src/test/java/com/facebook/presto/hive/hudi/HudiTestingDataGenerator.java
Outdated
Show resolved
Hide resolved
@arunthirupathi Could you please take a look when you get a chance? This PR simply upgrades Hudi version, and adds more coverage for hudi table querying. |
@codope Thanks for your comments. Have updated the code. Please take a second look. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @7c00 for addressing the comments. Looks good to me.
@kewang1024 @arunthirupathi Can you please review this PR? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If #17149 gets merge, this PR's content needs to be deleted and moved to Hudi connector's specific directory right
Do we know the timeline for merging Hudi's connector?
presto-hive/src/test/java/com/facebook/presto/hive/hudi/HudiTestUtils.java
Outdated
Show resolved
Hide resolved
assertQuery(format(sqlTemplate, "stock_ticks_mor_rt"), sqlResult); | ||
assertQuery(format(sqlTemplate, "stock_ticks_morn_ro"), sqlResultReadOptimized); | ||
assertQuery(format(sqlTemplate, "stock_ticks_morn_rt"), sqlResult); | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we comment somewhere the structures of those tables?
For other people, it seems hard to understand what we're testing against and can't verify if the expected the results are correct
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added a doc hudi-testing-data.md
to describe the testing data, and inserted a comment at the begining of HudiTestingDataGenerator#generateData
to guide people to the doc.
@kewang1024 Thanks for reviewing. The tests added here will continue to cover the current integration of Hudi with presto-hive connector. We will refactor some parts of this PR to enhance reusability across presto-hive and presto-hudi. Once we merge this PR, then we can rebase the presto-hudi connector PR and try to merge that asap contingent on reviews. |
@7c00 let me know if the comments make sense? |
Hi, @kewang1024 Thanks for your comments! I have addressed the issues. Could you pelase take a second review? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, thanks for the changes
One Nit: can you move the recent change to the first commit instead of the second commit
(removing the main function and adding test description file)
Thanks @kewang1024. The two commits are refined.
|
presto-hive/src/test/java/com/facebook/presto/hive/hudi/HudiTestingDataGenerator.java
Outdated
Show resolved
Hide resolved
presto-hive/src/test/java/com/facebook/presto/hive/hudi/HudiTestingDataGenerator.java
Outdated
Show resolved
Hide resolved
presto-hive/src/test/java/com/facebook/presto/hive/hudi/HudiTestingDataGenerator.java
Outdated
Show resolved
Hide resolved
presto-hive/src/test/java/com/facebook/presto/hive/hudi/HudiTestingDataGenerator.java
Outdated
Show resolved
Hide resolved
@highker Thanks for your comments. I solved the issues mentioned. Lets merge this PR? |
The PR is merged. Thanks @codope @kewang1024 @highker ❤️ |
This PR adds integration tests for querying Hudi tables. It could help prevent minor bugs when upgrading Hudi dependency versions.
The testing data is generated followed by the tutor at https://hudi.apache.org/docs/docker_demo (step1-step7).
This PR is extracted from #17149, and is to be shared by Hudi connector until Hudi connector gets production ready and the code for Hudi support is removed from Hive connector.
Test plan - Tests added.