Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-48751][INFRA][PYTHON][TESTS] Re-balance `pyspark-pandas-connec…
…t` tests on GA ### What changes were proposed in this pull request? The pr aims to `re-balance` `pyspark-pandas-connect` tests on `GA`. ### Why are the changes needed? Make the execution cost time of `pyspark-pandas-connect-part[0-3]` testing to a relatively average level, avoiding the occurrence of long tails and resulting in higher overall GA execution cost time. Here are some currently observed examples: - https://github.com/apache/spark/pull/47135/checks?check_run_id=26784966983 <img width="311" alt="image" src="https://github.com/apache/spark/assets/15246973/45d627bc-f0e7-4a76-bfd5-edc6e821e427"> Most of them are around `1 hour`, but `part2` cost `1h 49m`, `part3` cost `2h 16m` - https://github.com/panbingkun/spark/actions/runs/9693237300 <img width="296" alt="image" src="https://github.com/apache/spark/assets/15246973/6837622a-3ff3-42d7-9725-e548c161277e"> Most of them are around `1 hour`, but `part2` cost `1h 47m`, `part3` cost `2h 20m` ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Manually observing the cost time of `pyspark-pandas-connect-part[0-3]`. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #47137 from panbingkun/split_pyspark_tests_to_5. Authored-by: panbingkun <panbingkun@baidu.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
- Loading branch information