-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Python][CI] Tests involving fastparquet are never run #37853
Comments
Context: the parquet cross-compat test appears to be failing when enabled in the conda recipe tests: #37624 (comment) And I can actually reproduce this locally, both with running our test, as with the following small snippet:
Update: the above is with the 14 dev version of pyarrow. When writing the file with 13.0 or older, the roundtrip is actually perfectly fine. So this might be related to change to use RLE by default for boolean values: #36955 |
Yes, so it's the different encoding that makes fastparquet fail:
This is getting a bit off-topic for this issue, but maybe that's a good argument to actually do run those tests on our CI, then we would have noticed this compat issue earlier. Opened dask/fastparquet#884 on the fastparquet side |
Do we still care about this or should we close this issue? |
We have a
fastparquet
pytest marker for tests that require fastparquet, and we have two such tests, but from a search in our code base, it doesn't seem that we include fastparquet anywhere in one of our CI test builds.The two tests are:
test_fastparquet_cross_compatibility
in the parquet tests, added in https://issues.apache.org/jira/browse/ARROW-6683 (I know that pandas has similar cross compat tests)test_fastparquet_read_with_hdfs
in test_hdfs.py: ensuring fastparquet can use our HDFS filesystem -> this is something that fsspec / fastparquet can test themselves I think, and this is also in our legacy HDFS tests that we will remove once removing the legacy HDFS bindingsGiven the above, we could also consider removing the tests alltogether (although adding fastparquet in one of the nightly builds should also be easy)
The text was updated successfully, but these errors were encountered: