Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[HUDI-6039] Fixing FS based listing for full cleaning in clean Planner #8384

Merged

Conversation

nsivabalan
Copy link
Contributor

Change Logs

Looks like when we fallback to full partition cleaning in clean planner, we do FS based listing even though metadata is enabled. It was added in #5169 mainly due to how delete_partition was designed back then. Later delete_partition logic evolved and now we should be good to make this metadata based if applicable.

Impact

Will improve clean planning latency for large tables.

Risk level (write none, low medium or high below)

low.

Documentation Update

N/A

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@nsivabalan nsivabalan force-pushed the clean_planner_revert_fs_based_listing branch from e0515dc to 98414ab Compare April 5, 2023 06:18
@yihua
Copy link
Contributor

yihua commented Apr 6, 2023

cc @umehrot2

@nsivabalan nsivabalan force-pushed the clean_planner_revert_fs_based_listing branch from 5f94019 to f8aa7b9 Compare April 10, 2023 21:08
@nsivabalan
Copy link
Contributor Author

@codope @yihua : can one of you guys review this.

@bvaradar bvaradar self-assigned this Apr 14, 2023
@nsivabalan nsivabalan force-pushed the clean_planner_revert_fs_based_listing branch from 9d90f97 to 08ec5be Compare April 18, 2023 14:17
@nsivabalan nsivabalan force-pushed the clean_planner_revert_fs_based_listing branch from 27c25af to 4064bbe Compare April 19, 2023 04:01
@hudi-bot
Copy link

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@nsivabalan nsivabalan merged commit 52efac9 into apache:master Apr 21, 2023
yihua pushed a commit to yihua/hudi that referenced this pull request May 15, 2023
apache#8384)

- Looks like when we fallback to full partition cleaning in clean planner, we do FS based listing even though metadata is enabled. It was added in apache#5169 mainly due to how delete_partition was designed back then. Later delete_partition logic evolved and now we should be good to make this metadata based if applicable.
yihua pushed a commit to yihua/hudi that referenced this pull request May 15, 2023
apache#8384)

- Looks like when we fallback to full partition cleaning in clean planner, we do FS based listing even though metadata is enabled. It was added in apache#5169 mainly due to how delete_partition was designed back then. Later delete_partition logic evolved and now we should be good to make this metadata based if applicable.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

6 participants