-
Notifications
You must be signed in to change notification settings - Fork 244
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] test_array_intersect failed in databricks 10.4 runtime and Spark 3.3+ #6208
Comments
This also failed in other CI jobs (non databricks) for 22.08 or 22.10. |
confirmed the failure is in Spark 3.3.0 |
I am seeing a related failure in our Yarn test for
|
The array_intersect test failure looks like a bug in array_intersect that was introduced in Spark 3.3.0. Here is an example of a test failure with this strange output on the CPU:
It probably came from here: apache/spark#34077 I'm guessing Databricks might have pulled in the same "fix" I will look into the array_union issue but that looks like a separate issue from this one. |
Filed https://issues.apache.org/jira/browse/SPARK-39976
|
Hmm, I replicated the test scenario on the YARN cluster, and all the tests all passed with the same spark version and via YARN and UCX. This looks like some kind of ephemeral failure at this point.
|
This failure occurred again. I'm going to file a new issue to track this one.
|
Filed #6249 for the array_union failure
|
@NVnavkumar please file a follow up issue to revert the changes in #6223 when a fix goes into Spark, and then close this issue. |
Filed a followup issue #6285 to revert the changes once the fix is made in Apache Spark. |
Describe the bug
related to #5958
AssertionError: CPU and GPU list have different lengths at [0, 'sort_array(array_intersect(a, b), true)'] CPU: 1 GPU: 0�
BTW databricks 9.1 runtime works fine.
The text was updated successfully, but these errors were encountered: