refactor: logical op constructor+builder boundary #3684

kevinzwang · 2025-01-14T23:58:40Z

The problem

Plan ops are created for various reasons through our code - from our dataframe or sql interfaces, to optimization rules, to even op constructors themselves which can sometimes create other ones. All of these cases generally go through the same new/try_new constructor for each op, which tries to accommodate all of these use cases. This creates complexity, adds unnecessary compute to planning time, and also conflates user input errors with Daft internal errors.

For example, I don't expect any optimization rules to create unresolved expressions, expression resolution should only be done for the builder. Another example is the Join op, where inputs such as join_prefix and join_suffix are only applicable for renaming columns, which should also only happen via the builder. We recently added another initializer to some ops for that reason, but it bypasses the validation that is typically done and is not standardized across ops.

My solution

Every op should provide a try_new constructor which contain explicit checks for all the requirements about the op's state (one example would be that all expression columns exist in the schema), but otherwise should simply put those values into the struct without any modification and return it.

Functions such as LogicalPlan::with_new_children will just call try_new.
Other constructors/helpers may exist that explicitly provide additional functionality and ultimately call try_new. E.g. a Join::rename_right_columns to rename the right side columns that conflict with the left side, called to update the right side schema before calling try_new.
User input normalization, such as expression resolution, should be handled by the logical plan builder. After the logical plan op has been constructed, everything should be in a valid state from there on.

codecov · 2025-01-15T01:24:26Z

Codecov Report

Attention: Patch coverage is 98.34025% with 4 lines in your changes missing coverage. Please review.

Project coverage is 77.79%. Comparing base (feab49a) to head (9304075).
Report is 11 commits behind head on main.

Files with missing lines	Patch %	Lines
src/daft-dsl/src/expr/mod.rs	60.00%	2 Missing ⚠️
src/daft-logical-plan/src/ops/filter.rs	85.71%	1 Missing ⚠️
src/daft-logical-plan/src/ops/join.rs	97.36%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3684      +/-   ##
==========================================
- Coverage   77.82%   77.79%   -0.03%     
==========================================
  Files         728      732       +4     
  Lines       89919    90457     +538     
==========================================
+ Hits        69975    70368     +393     
- Misses      19944    20089     +145

Files with missing lines	Coverage Δ
src/daft-dsl/src/lib.rs	`100.00% <ø> (ø)`
src/daft-dsl/src/python.rs	`91.07% <ø> (-0.03%)`	⬇️
src/daft-logical-plan/src/builder/mod.rs	`92.71% <100.00%> (ø)`
src/daft-logical-plan/src/builder/resolve_expr.rs	`89.20% <100.00%> (ø)`
src/daft-logical-plan/src/builder/tests.rs	`100.00% <ø> (ø)`
src/daft-logical-plan/src/display.rs	`98.06% <100.00%> (ø)`
src/daft-logical-plan/src/lib.rs	`100.00% <100.00%> (ø)`
src/daft-logical-plan/src/logical_plan.rs	`74.27% <100.00%> (+0.63%)`	⬆️
...rc/daft-logical-plan/src/ops/actor_pool_project.rs	`36.73% <100.00%> (-5.86%)`	⬇️
src/daft-logical-plan/src/ops/agg.rs	`62.50% <100.00%> (-5.20%)`	⬇️
... and 15 more

... and 40 files with indirect coverage changes

kevinzwang · 2025-01-15T01:37:21Z

Note to reviewers: Join has been pretty broken and I don't want to change that behavior in this PR. This is going to be directly followed by PR to fix various join issues, so don't worry too much about any bugs with join you find in this PR.

codspeed-hq · 2025-01-15T01:46:00Z

CodSpeed Performance Report

Merging #3684 will degrade performances by 34.92%

_{Comparing kevin/logical-plan-builder-refactor (9304075) with main (f97902a)}

Summary

⚡ 4 improvements
❌ 1 regressions
✅ 22 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`main`	`kevin/logical-plan-builder-refactor`	Change
⚡	`test_count[1 Small File]`	3.7 ms	3.3 ms	+11.75%
⚡	`test_iter_rows_first_row[100 Small Files]`	214.8 ms	186.2 ms	+15.4%
❌	`test_show[100 Small Files]`	15.7 ms	24.1 ms	-34.92%
⚡	`test_tpch[1-in-memory-native-2]`	106.6 ms	96.4 ms	+10.6%
⚡	`test_tpch_sql[1-in-memory-native-2]`	226.9 ms	204.7 ms	+10.87%

universalmind303

overall looks good. I do think a builder pattern may be better suited for Join in the long term though. Right now we have 8 arguments to the constructor.

Something like this I think would be a bit more intuitive

JoinBuilder::new(left, right)
  .left_on(left_on)
  .right_on(right_on)
  // could also add a `.on(join_keys)` instead of needing to always supply left and right keys. 
  .join_type(join_type)
  .join_suffix(suffix) // optional
  .join_prefix(prefix) // optional
  .rename_right_columns(true) // optional
  .keep_join_keys(true)  // optional
  .finish()

rchowell · 2025-01-15T19:36:21Z

src/daft-logical-plan/src/builder.rs

@@ -188,11 +188,19 @@ impl LogicalPlanBuilder {
    }

    pub fn select(&self, to_select: Vec<ExprRef>) -> DaftResult<Self> {
+        let expr_resolver = ExprResolver::builder().allow_actor_pool_udf(true).build();


Out of curiosity, why does each (or some) method create their own expr_resolver? Maybe the builder could hold a single expr_resolver for its schema, then resolve_single takes a single arg and closes over self.schema().

But there may be more to this I am not familiar with yet, just curious.

Each logical op may resolve expressions differently. For example, Agg expects expressions to be aggregation expressions, whereas Project expects no aggregation expressions. Take a look at the parameters to the ExprResolver builder!

rchowell · 2025-01-15T19:38:48Z

Some general comments as I familiarize myself with this.

expression resolution should only be done for the builder

Agreed, rewrites/transforms f on the logical domain L so f(L) -> L shouldn't create unresolved expressions (afaik) – once resolved to L you don't leave L.

Every op should provide a try_new constructor which contain explicit checks for all the requirements about the op's state

Do you anticipate every logical operator having both a try_new and a builder? The builders may need to perform checks incrementally as well as on the final .build().

kevinzwang · 2025-01-15T19:53:31Z

@universalmind303 @rchowell I'm open to having individual builders for logical ops, but I'm not sure yet how that would fit into our current structure.

We already have a LogicalPlanBuilder, which is the interface that our external APIs use to construct logical plans. It would not make much sense to create a builder abstraction for each op but have, say, our dataframe API still create joins by passing in all the arguments to builder.join(...) -- we would probably want to expose the op builder directly somehow.

We should probably think about this on a case-by-case basis. Most ops are pretty basic and do not require builders. Moreover, I'm not super happy about this current LogicalPlanBuilder abstraction either, it tries to work for both dataframe and sql but is becoming a little unwieldy.

kevinzwang

Did a bit of cleanup along with this PR:

moved resolve_expr to inside builder because we don't want anything other than the builder to use it anymore
removed various .context(CreationSnafu)? because conversion to logical_plan::Result is actually done automatically for DaftResult if using ?, so this is not needed

chore: refactor logical op constructor+builder boundary

562bc4e

github-actions bot added the chore label Jan 14, 2025

kevinzwang changed the title ~~chore: refactor logical op constructor+builder boundary~~ refactor: logical op constructor+builder boundary Jan 15, 2025

github-actions bot added refactor and removed chore labels Jan 15, 2025

fix tests

78e8ae8

kevinzwang requested review from universalmind303, rchowell and jaychia January 15, 2025 01:07

fix unnest subquery

30e9a81

universalmind303 approved these changes Jan 15, 2025

View reviewed changes

rchowell reviewed Jan 15, 2025

View reviewed changes

kevinzwang added 2 commits January 15, 2025 17:53

refactor other ops + move resolve_expr

49e6896

remove asdf

c153f1e

kevinzwang commented Jan 16, 2025

View reviewed changes

kevinzwang added 2 commits January 15, 2025 18:00

remove unused dep

01d6388

fix test

9304075

kevinzwang requested review from rchowell and universalmind303 January 16, 2025 18:37

universalmind303 approved these changes Jan 16, 2025

View reviewed changes

kevinzwang merged commit 3720c2a into main Jan 16, 2025
42 of 43 checks passed

kevinzwang deleted the kevin/logical-plan-builder-refactor branch January 16, 2025 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: logical op constructor+builder boundary #3684

refactor: logical op constructor+builder boundary #3684

kevinzwang commented Jan 14, 2025 •

edited

Loading

codecov bot commented Jan 15, 2025 •

edited

Loading

kevinzwang commented Jan 15, 2025

codspeed-hq bot commented Jan 15, 2025 •

edited

Loading

universalmind303 left a comment

rchowell Jan 15, 2025

kevinzwang Jan 15, 2025

rchowell commented Jan 15, 2025

kevinzwang commented Jan 15, 2025 •

edited

Loading

kevinzwang left a comment

refactor: logical op constructor+builder boundary #3684

refactor: logical op constructor+builder boundary #3684

Conversation

kevinzwang commented Jan 14, 2025 • edited Loading

The problem

My solution

codecov bot commented Jan 15, 2025 • edited Loading

Codecov Report

kevinzwang commented Jan 15, 2025

codspeed-hq bot commented Jan 15, 2025 • edited Loading

CodSpeed Performance Report

Merging #3684 will degrade performances by 34.92%

Summary

Benchmarks breakdown

universalmind303 left a comment

Choose a reason for hiding this comment

rchowell Jan 15, 2025

Choose a reason for hiding this comment

kevinzwang Jan 15, 2025

Choose a reason for hiding this comment

rchowell commented Jan 15, 2025

kevinzwang commented Jan 15, 2025 • edited Loading

kevinzwang left a comment

Choose a reason for hiding this comment

kevinzwang commented Jan 14, 2025 •

edited

Loading

codecov bot commented Jan 15, 2025 •

edited

Loading

codspeed-hq bot commented Jan 15, 2025 •

edited

Loading

kevinzwang commented Jan 15, 2025 •

edited

Loading