refactor(benchmarks): remove pandas benchmarking and replace with more-representative duckdb version #8322

cpcloud · 2024-02-12T21:37:38Z

This PR removes pandas execution benchmarks, whose performance has gotten much
worse after the-epic-split merge.

IMO, there are good reasons to avoid benchmarking backend execution in Ibis,
the main one being that execution is not a part of Ibis that we can control.

That isn't true of the Pandas backend, but since the DuckDB backend can now do
everything the pandas backend could do after merging the-epic-split
--including asof_join--we are not going to spend any time improving the
performance of the pandas backend,

I would actually vote to remove all execution tests here, and keep only the
ones that benchmark expression compilation and construction.

…e-representative duckdb version

…e-representative duckdb version (ibis-project#8322) This PR removes pandas execution benchmarks, whose performance has gotten much worse after `the-epic-split` merge. IMO, there are good reasons to avoid benchmarking backend execution in Ibis, the main one being that execution is not a part of Ibis that we can control. That isn't true of the Pandas backend, but since the DuckDB backend can now do everything the pandas backend could do after merging `the-epic-split` --including `asof_join`--we are not going to spend any time improving the performance of the pandas backend, I would actually vote to remove all execution tests here, and keep only the ones that benchmark expression compilation and construction.

refactor(benchmarks): remove pandas benchmarking and replace with mor…

8c6d1c1

…e-representative duckdb version

cpcloud added this to the 9.0 milestone Feb 12, 2024

cpcloud added refactor Issues or PRs related to refactoring the codebase performance Issues related to ibis's performance labels Feb 12, 2024

cpcloud requested a review from kszucs February 12, 2024 21:37

kszucs approved these changes Feb 12, 2024

View reviewed changes

kszucs merged commit e540575 into ibis-project:main Feb 12, 2024
76 of 78 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(benchmarks): remove pandas benchmarking and replace with more-representative duckdb version #8322

refactor(benchmarks): remove pandas benchmarking and replace with more-representative duckdb version #8322

cpcloud commented Feb 12, 2024

refactor(benchmarks): remove pandas benchmarking and replace with more-representative duckdb version #8322

refactor(benchmarks): remove pandas benchmarking and replace with more-representative duckdb version #8322

Conversation

cpcloud commented Feb 12, 2024