-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support additional output types #31
Comments
Hi @JohnPaton At work we plan to extend the CLI to handle parquet files. Are you interested on a PR? It would be a post processing CLI script, something like We'll base our work from #38, as Poetry allows console scripts that depends on an extra. That way We could further integrate with the existing CLI and download utilities, but we can discuss the details on the eventual PR... |
Hey, I think more output formats would be great and parquet is an obvious choice, though I guess we'll need to make some smart choices about partitioning |
Maybe we could start a separate module for postprocessing, and add to the CLI in a followup PR? |
Sure. This can be done using a plugin architecture that allows to add post processing formats on a different package by declaring entry points. I have done something like this on two projects before. Will prepare a draft PR to illustrate the methodology. |
Alright, I have no experience in this direction so I'm happy to see what you come up with! |
Right now we only support CSV, which is what the portal provides. We could convert to other file formats (parquet, avro) on the fly for easier processing later.
The text was updated successfully, but these errors were encountered: