-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HUDI-915] Add missing partititonpath to records COW #8666
[HUDI-915] Add missing partititonpath to records COW #8666
Conversation
7510bce
to
5d1b90a
Compare
...nt/hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/BaseMergeHelper.java
Outdated
Show resolved
Hide resolved
...lient/hudi-flink-client/src/main/java/org/apache/hudi/table/HoodieFlinkCopyOnWriteTable.java
Outdated
Show resolved
Hide resolved
...-client/hudi-spark-client/src/main/java/org/apache/hudi/client/bootstrap/PartitionUtils.java
Outdated
Show resolved
Hide resolved
...-client/hudi-spark-client/src/main/java/org/apache/hudi/client/bootstrap/PartitionUtils.java
Outdated
Show resolved
Hide resolved
...n/java/org/apache/hudi/client/clustering/run/strategy/MultipleSparkJobExecutionStrategy.java
Outdated
Show resolved
Hide resolved
...-client/hudi-spark-client/src/main/java/org/apache/hudi/client/bootstrap/PartitionUtils.java
Outdated
Show resolved
Hide resolved
hudi-common/src/main/java/org/apache/hudi/io/storage/HoodieBootstrapRecordIterator.java
Outdated
Show resolved
Hide resolved
hudi-common/src/test/java/org/apache/hudi/common/testutils/HoodieTestUtils.java
Show resolved
Hide resolved
...-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/functional/TestBootstrapRead.java
Outdated
Show resolved
Hide resolved
|
||
private static Stream<Arguments> testArgs() { | ||
Stream.Builder<Arguments> b = Stream.builder(); | ||
//TODO: add dash partitions false with [HUDI-4944] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add more details here.
Also, you could try adding a sleep of 2s and check if timeline server returns correct fs view.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- fixed
- I added a 5 second delay between bootstrapping and the update and was still getting failures at the same rate
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok let's tackle that issue separately but I don't see more details in the comment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I changed the line below to:
Boolean[] dashPartitions = {true/,false/};
The code is already there to test that, it just doesn't work currently because of the double decoding issue
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jonvex Looks in better shape now. Can you refactor merge helper as suggested in the comments?
...nt/hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/BaseMergeHelper.java
Outdated
Show resolved
Hide resolved
hudi-common/src/main/java/org/apache/hudi/io/storage/HoodieBootstrapRecordIterator.java
Outdated
Show resolved
Hide resolved
hudi-common/src/test/java/org/apache/hudi/common/testutils/HoodieTestUtils.java
Show resolved
Hide resolved
|
||
private static Stream<Arguments> testArgs() { | ||
Stream.Builder<Arguments> b = Stream.builder(); | ||
//TODO: add dash partitions false with [HUDI-4944] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok let's tackle that issue separately but I don't see more details in the comment.
Change Logs
In a bootstrapped table, If there was an upsert in a file group, the records in the file group that were not upserted would have null values for their partition column.
Impact
Now works correctly
Risk level (write none, low medium or high below)
Medium
Lots of tests written
Documentation Update
N/A
Contributor's checklist