test(python): Reorganize benchmark test folder #6695

stinodego · 2023-02-05T23:38:18Z

Related to #6364

Changes:

Added a threshold for the H2O AI benchmark - strings test.
- I figured, if there is no threshold set, there is not much use in running the benchmark at all.
Made other benchmark tests into proper pytest tests and marked them as benchmark tests.
- These are now ran as part of the benchmark workflow, and their run durations are reported in the logs.
Rename the folder db-benchmark to benchmark and add some documentation to the benchmark script.
Rename docs/run_doc_examples.py to docs/run_doctest.py
Added a README.md to the py-polars/tests folder, and linked to this in the CONTRIBUTING guide. Feels good to make a bunch of this knowledge explicit to future contributors ☺️

stinodego · 2023-02-05T23:47:13Z

py-polars/tests/unit/test_series.py

-def test_mean_overflow() -> None:
-    arr = np.array([255] * (1 << 17), dtype="int16")
-    assert arr.mean() == 255.0


This doesn't test anything Polars-related, so I removed it.

It does! It was testing if it doesn't overflow the mean. A naive sum/count would overflow the i16

Hmm, but it's just numpy code though? 🤔 We don't rely on this to calculate our mean, right?

Reverted this removal, but I still don't understand why this test is in there!

ritchie46

The slow tests were explicitly run in benchmark folder because the polars had a release build. So we can test really slow code.

py-polars/tests/db-benchmark/lazy_vs_eager.py

py-polars/tests/db-benchmark/various.py

stinodego · 2023-02-06T11:29:12Z

Ah interesting, learned a lot from your comments 😄 I'm gonna revert a lot of this stuff and add a bit of documentation in that folder on what's going on there.

ritchie46 · 2023-02-06T16:05:40Z

Yeah, maybe most ideal would be having those loose files also concerted to pytest functions and running those "release-build" tests only in CI once the benchmark has finished. That same binary is then used for those tests.

stinodego · 2023-02-08T21:16:06Z

Yeah, maybe most ideal would be having those loose files also concerted to pytest functions and running those "release-build" tests only in CI once the benchmark has finished. That same binary is then used for those tests.

Done, and documented in the new Test Suite README 😄

ritchie46

Nice cleanup. Especially formalizing the release tests is a good one.

I hope we can add the full TPCH benchmarks in the future.

ritchie46 · 2023-02-09T08:42:42Z

py-polars/tests/benchmark/test_release.py

@@ -0,0 +1,148 @@
+"""


Nice, that's cleaner indeed. :)

alexander-beedie · 2023-02-09T10:36:51Z

Really nice.

github-actions bot added python Related to Python Polars test Related to the test suite labels Feb 5, 2023

stinodego commented Feb 5, 2023

View reviewed changes

ritchie46 reviewed Feb 6, 2023

View reviewed changes

py-polars/tests/db-benchmark/lazy_vs_eager.py Show resolved Hide resolved

py-polars/tests/db-benchmark/various.py Show resolved Hide resolved

stinodego marked this pull request as draft February 6, 2023 11:35

stinodego force-pushed the test-organization branch from c2fbc7b to d52fc8b Compare February 8, 2023 01:33

Rename doctest folder

0657920

stinodego force-pushed the test-organization branch from d6dae24 to 85a950a Compare February 8, 2023 19:33

stinodego added 5 commits February 8, 2023 22:00

Rename and restructure benchmark folder

2da12e4

Update other code

288e6d4

Driveby fixture change

6436ea1

Add hypothesis marker

ab89ca0

Add README to tests folder

8c4ce49

stinodego force-pushed the test-organization branch from 85a950a to 8c4ce49 Compare February 8, 2023 21:00

Add reference in the contributing guide

ca2e1a1

stinodego marked this pull request as ready for review February 8, 2023 21:05

stinodego requested a review from ritchie46 February 8, 2023 21:15

ritchie46 approved these changes Feb 9, 2023

View reviewed changes

py-polars/tests/benchmark/test_release.py

@@ -0,0 +1,148 @@

"""

Copy link

Member

ritchie46 Feb 9, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice, that's cleaner indeed. :)

ritchie46 merged commit 0a1c1bc into pola-rs:master Feb 9, 2023

Vincenthays pushed a commit to Vincenthays/polars that referenced this pull request Feb 9, 2023

test(python): Reorganize benchmark test folder (pola-rs#6695)

2dee8fa

stinodego deleted the test-organization branch February 22, 2023 18:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(python): Reorganize benchmark test folder #6695

test(python): Reorganize benchmark test folder #6695

stinodego commented Feb 5, 2023 •

edited

Loading

stinodego Feb 5, 2023

ritchie46 Feb 6, 2023

stinodego Feb 8, 2023

ritchie46 left a comment

stinodego commented Feb 6, 2023

ritchie46 commented Feb 6, 2023

stinodego commented Feb 8, 2023

ritchie46 left a comment

ritchie46 Feb 9, 2023

alexander-beedie commented Feb 9, 2023

test(python): Reorganize benchmark test folder #6695

test(python): Reorganize benchmark test folder #6695

Conversation

stinodego commented Feb 5, 2023 • edited Loading

stinodego Feb 5, 2023

Choose a reason for hiding this comment

ritchie46 Feb 6, 2023

Choose a reason for hiding this comment

stinodego Feb 8, 2023

Choose a reason for hiding this comment

ritchie46 left a comment

Choose a reason for hiding this comment

stinodego commented Feb 6, 2023

ritchie46 commented Feb 6, 2023

stinodego commented Feb 8, 2023

ritchie46 left a comment

Choose a reason for hiding this comment

ritchie46 Feb 9, 2023

Choose a reason for hiding this comment

alexander-beedie commented Feb 9, 2023

stinodego commented Feb 5, 2023 •

edited

Loading