-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
crash on binary data, native support for compressed csv? #2301
Comments
I would consider this a feature :-)-O and on Linux and the Mac you can easily filter for that and then put it into a pipe in front of The binary format supported is While I like |
Crashing with a |
Hi @wardi , As such, its top goal is performance. That's why I chose to support Snappy, instead of more popular compression formats like gz and zip.. Another goal of qsv is composability, so as you and @ondohotola pointed out, qsv can be easily used with a other purpose-built command-line tools. But you're right, qsv should at least check for supported formats and fail gracefully rather than panic. Currently, it already has logic to detect CSV, TSV/TAB and SSV formats and their Snappy compressed variants (csv.sz, tsv.sz, tab.sz and ssv.sz) and set the default delimiter accordingly and compress/decompress automatically and it could be easily extended. In the meantime, you may want to use |
Thanks @jqnatividad, maybe when I finally get into rust I could send a PR with some more automatic stream compression/decompression formats |
Hi @wardi , Also added a more human-friendly panic handler with the |
Ended up simplifying input format checking to just checking for supported file extensions and removing the mime-type sniffing as it was causing false positive failures on CI property tests. |
The
this was made possible by enabling polars For suite-wide auto decompression support beyond |
Describe the bug
qsv crashes if given binary data
To Reproduce
Expected behavior
Report an error with the expected format, or for bonus points handle .gz, .bz2, .xz etc automatically
Screenshots/Backtrace/Sample Data
Desktop (please complete the following information):
Additional context
Happily qsv does work fine in a pipeline like
zcat mybigdata.csv.gz | qsv stats
The text was updated successfully, but these errors were encountered: