-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement serde for CSV and Parquet FileSinkExec #8646
Conversation
@devinjdangelo fyi, this is now ready for review |
@@ -1220,20 +1222,22 @@ message ParquetWriterOptions { | |||
} | |||
|
|||
message CsvWriterOptions { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We have not released a version of DataFusion that contains CsvWriterOptions
yet, so it is safe to change the field numbers here.
This LGTM @andygrove! I am planning to attempt a more significant refactor in #8667 as discussed with @alamb and might be able to simplify adding support for some of the more advanced file writing options, including externally defined file types is the goal. |
Thanks for the reviews @avantgardnerio and @devinjdangelo |
Which issue does this PR close?
Closes #8645
Rationale for this change
Needed by Ballista, so that we can support
DataFrame:write_xxx
again.What changes are included in this PR?
Implement serde for CSV and Parquet FileSinkExec, based on existing support for JSON FileSinkExec.
Are these changes tested?
Yes, new tests added.
Are there any user-facing changes?