refactor(tpc): add tpc-ds tests #9467

cpcloud · 2024-06-28T14:10:46Z

First 27 TPC-DS queries running against DuckDB, Trino, Snowflake, and DataFusion, sans the ones that requires ROLLUP.

More fail on DataFusion than the others. Those are marked with appropriate xfail marker.

cpcloud · 2024-07-01T16:07:32Z

If this PR is too large I can split up the things into a separate PR for each backend, and disable the runs until the refactoring is done.

gforsyth · 2024-07-01T16:24:50Z

Nah, I'm most of the way through reviewing it

gforsyth

I think the changeset here is good -- do we want to keep the tpch and tpc-ds snapshots around? Since we're comparing results against hand-written SQL, I don't know that we're getting much out of them

cpcloud · 2024-07-01T17:51:32Z

They're gone.

cpcloud · 2024-07-01T17:53:11Z

The empty result sets are questionable, but I couldn't find a SF (up to sf=10) where all queries were non-empty. I believe that q17 is supposed to empty, based on the fact that it's always empty for the larger scale factors.

So, basically what I did was selectively allow empty queries in the tpc_test marker.

gforsyth · 2024-07-01T17:58:24Z

I believe that q17 is supposed to empty, based on the fact that it's always empty for the larger scale factors.

lol, I have an optimization for O(1) compute for query 17 at any scale factor

gforsyth · 2024-07-01T19:09:11Z

snowflake is passing:

🐚 pytest -m snowflake ibis/backends/tests/tpc/ds/test_queries.py -v
================================ test session starts ================================
platform linux -- Python 3.10.14, pytest-8.2.2, pluggy-1.5.0 -- /nix/store/1lj814h1vbfi6p7m6vr2rvcvkdqxc426-python3-3.10.14-env/bin/python3.10
cachedir: .pytest_cache
hypothesis profile 'dev' -> deadline=None, max_examples=50, suppress_health_check=[HealthCheck.too_slow], database=DirectoryBasedExampleDatabase(PosixPath('/home/gil/github.com/ibis-project/ibis/.hypothesis/examples'))
Using --randomly-seed=3164630463
benchmark: 4.0.0 (defaults: timer=time.perf_counter disable_gc=False min_rounds=5 min_time=0.000005 max_time=1.0 calibration_precision=10 warmup=False warmup_iterations=100000)
rootdir: /home/gil/github.com/ibis-project/ibis
configfile: pyproject.toml
plugins: hypothesis-6.104.2, snapshot-0.9.0, anyio-4.4.0, randomly-3.15.0, mock-3.14.0, benchmark-4.0.0, timeout-2.3.1, cov-5.0.0, repeat-0.9.3, clarity-1.0.1, pytest_httpserver-1.0.10, xdist-3.6.1
collected 540 items / 513 deselected / 27 selected                                  

ibis/backends/tests/tpc/ds/test_queries.py::test_15[snowflake] PASSED    [  3%]
ibis/backends/tests/tpc/ds/test_queries.py::test_04[snowflake] PASSED    [  7%]
ibis/backends/tests/tpc/ds/test_queries.py::test_07[snowflake] PASSED    [ 11%]
ibis/backends/tests/tpc/ds/test_queries.py::test_23[snowflake] XFAIL     [ 14%]
ibis/backends/tests/tpc/ds/test_queries.py::test_03[snowflake] PASSED    [ 18%]
ibis/backends/tests/tpc/ds/test_queries.py::test_20[snowflake] PASSED    [ 22%]
ibis/backends/tests/tpc/ds/test_queries.py::test_24[snowflake] PASSED    [ 25%]
ibis/backends/tests/tpc/ds/test_queries.py::test_18[snowflake] XFAIL     [ 29%]
ibis/backends/tests/tpc/ds/test_queries.py::test_10[snowflake] PASSED    [ 33%]
ibis/backends/tests/tpc/ds/test_queries.py::test_16[snowflake] PASSED    [ 37%]
ibis/backends/tests/tpc/ds/test_queries.py::test_26[snowflake] PASSED    [ 40%]
ibis/backends/tests/tpc/ds/test_queries.py::test_05[snowflake] XFAIL     [ 44%]
ibis/backends/tests/tpc/ds/test_queries.py::test_17[snowflake] PASSED    [ 48%]
ibis/backends/tests/tpc/ds/test_queries.py::test_02[snowflake] PASSED    [ 51%]
ibis/backends/tests/tpc/ds/test_queries.py::test_08[snowflake] PASSED    [ 55%]
ibis/backends/tests/tpc/ds/test_queries.py::test_12[snowflake] PASSED    [ 59%]
ibis/backends/tests/tpc/ds/test_queries.py::test_01[snowflake] PASSED    [ 62%]
ibis/backends/tests/tpc/ds/test_queries.py::test_25[snowflake] PASSED    [ 66%]
ibis/backends/tests/tpc/ds/test_queries.py::test_27[snowflake] PASSED    [ 70%]
ibis/backends/tests/tpc/ds/test_queries.py::test_21[snowflake] PASSED    [ 74%]
ibis/backends/tests/tpc/ds/test_queries.py::test_13[snowflake] PASSED    [ 77%]
ibis/backends/tests/tpc/ds/test_queries.py::test_06[snowflake] PASSED    [ 81%]
ibis/backends/tests/tpc/ds/test_queries.py::test_22[snowflake] XFAIL     [ 85%]
ibis/backends/tests/tpc/ds/test_queries.py::test_09[snowflake] PASSED    [ 88%]
ibis/backends/tests/tpc/ds/test_queries.py::test_19[snowflake] PASSED    [ 92%]
ibis/backends/tests/tpc/ds/test_queries.py::test_11[snowflake] PASSED    [ 96%]
ibis/backends/tests/tpc/ds/test_queries.py::test_14[snowflake] XFAIL     [100%]

============= 22 passed, 513 deselected, 5 xfailed in 124.88s (0:02:04) =============

gforsyth

Boy do I not understand waht asceding is about, but it's clearly intentional.

This looks good, and makes it easy for other contributors to tackle individual tpc-ds queries.

cpcloud added this to the 9.2 milestone Jun 28, 2024

cpcloud added the tests Issues or PRs related to tests label Jun 28, 2024

cpcloud requested review from jcrist and gforsyth June 28, 2024 14:10

cpcloud force-pushed the tpc-ds-tests branch 5 times, most recently from 22a07b6 to 53747a4 Compare June 29, 2024 12:42

cpcloud added datafusion The Apache DataFusion backend duckdb The DuckDB backend snowflake The Snowflake backend trino The Trino backend labels Jun 29, 2024

gforsyth reviewed Jul 1, 2024

View reviewed changes

refactor(tpc): add tpc-ds tests

ae57b1e

cpcloud force-pushed the tpc-ds-tests branch from 53747a4 to ae57b1e Compare July 1, 2024 17:22

gforsyth approved these changes Jul 1, 2024

View reviewed changes

gforsyth merged commit d2dff68 into ibis-project:main Jul 1, 2024
80 checks passed

cpcloud deleted the tpc-ds-tests branch July 1, 2024 23:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(tpc): add tpc-ds tests #9467

refactor(tpc): add tpc-ds tests #9467

cpcloud commented Jun 28, 2024 •

edited

Loading

cpcloud commented Jul 1, 2024

gforsyth commented Jul 1, 2024

gforsyth left a comment

cpcloud commented Jul 1, 2024

cpcloud commented Jul 1, 2024 •

edited

Loading

gforsyth commented Jul 1, 2024

gforsyth commented Jul 1, 2024

gforsyth left a comment

refactor(tpc): add tpc-ds tests #9467

refactor(tpc): add tpc-ds tests #9467

Conversation

cpcloud commented Jun 28, 2024 • edited Loading

cpcloud commented Jul 1, 2024

gforsyth commented Jul 1, 2024

gforsyth left a comment

Choose a reason for hiding this comment

cpcloud commented Jul 1, 2024

cpcloud commented Jul 1, 2024 • edited Loading

gforsyth commented Jul 1, 2024

gforsyth commented Jul 1, 2024

gforsyth left a comment

Choose a reason for hiding this comment

cpcloud commented Jun 28, 2024 •

edited

Loading

cpcloud commented Jul 1, 2024 •

edited

Loading