Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't write metadata file #875

Merged
merged 2 commits into from
Feb 24, 2024
Merged

Don't write metadata file #875

merged 2 commits into from
Feb 24, 2024

Conversation

RobbeSneyders
Copy link
Member

We reintroduced writing the metadata file in #864 to preserve the divisions of the data when writing and reading again. We turned this behavior off in the past, but without proper documentation of the reason.

I'm now running into issues with Dask workers dying when writing large datasets though, presumably because of the metadata file, as documented in these Dask issues:

Also, while I ran into issues with the preservation of divisions before, I can't reproduce this locally with a small example. Let's turn writing metadata off again and validate if we are still having issues with this.

@RobbeSneyders RobbeSneyders merged commit c2b717d into main Feb 24, 2024
8 of 9 checks passed
@RobbeSneyders RobbeSneyders deleted the feature/no-metadata-file branch February 24, 2024 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant