-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(rust!): Rename to CsvParserOptions
to CsvReaderOptions
, use in CsvReader
#15919
Conversation
} else { | ||
None | ||
CommentPrefix::Multi(prefix.to_string()) | ||
} | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I removed the arbitrary 5 character limit here. Not sure why it was there in the first place 🤔 the original PR/issue do not mention anything about the limit: #12519
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The API mentions the 5 character limit here, so that should probably updated too.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point, updated. Thanks!
ac72a9e
to
79c3cc8
Compare
CodSpeed Performance ReportMerging #15919 will improve performances by 22.17%Comparing Summary
Benchmarks breakdown
|
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #15919 +/- ##
==========================================
+ Coverage 81.24% 81.25% +0.01%
==========================================
Files 1382 1382
Lines 176628 176615 -13
Branches 3032 3032
==========================================
+ Hits 143494 143505 +11
+ Misses 32649 32627 -22
+ Partials 485 483 -2
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
To keep going back and forth, I am more fan of |
We can revert before the next Rust release if you want. But I don't think CsvParserOptions makes sense if we consider all file formats. Parquet will have an options struct but not really a 'parser' since it's not flat text. So ParquetParserOptions would be wrong. I think all Parser structs will accept multiple options structs: CloudOptions, PartitioningOptions, perhaps others. And options specific to that file format. Since the options differ between reading and writing, it makes sense to have ParquetReaderOptions and ParquetWriterOptions, same for Csv/Ipc/etc. I don't think it's confusing for those to exist next to a CloudOptions struct and possibly others. Maybe |
@stinodego , @ritchie46 , I'm currently doing some work to improve the way we read lists of CSV files. As part of that I need to do some refactoring around the options structs, mainly so that they're cheaper to clone. Can I take it from here? |
@nameexhaustion go for it! |
Changes
CsvParserOptions
toCsvReaderOptions
CsvReaderOptions
toCsvReader
.There are many more places internally where we can use
CsvReaderOptions
- this will be implemented in followup PRs