-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Python][Parquet] ParquetWriter reports incorrect schema when specified at wrong order #39241
Comments
Seems this report the reason. The If you want you can add a |
I think Parquet spec doesn't require the schema to have same order. Did I misunderstand the spec? EDIT: https://issues.apache.org/jira/browse/PARQUET-188 OK, maybe I misread the spec |
Though I linked to the issue, I am unable to actually find any mention of order in specs, however. |
Emmmm as a file format, parquet cannot reordering, it needs to maintain all schema in a file as same. So in parquet standard, it cannot reorder within a file. During library writing to parquet, we don't support reorder now because handling this in parquet writer might be tricky. So I think a |
I see one comment in a source file saying that it is ordered. Could this info be added to the API documentation? |
If you think it's confusion to not allow ordering, I think we can add this info |
Describe the bug, including details regarding any error messages, version, and platform.
When I try to write a table to a
ParquetWriter
, specifying schema in a different order gives an error:Component(s)
Parquet, Python
The text was updated successfully, but these errors were encountered: