move functionality out of PhysicalPlanBuilder into other classes and add tests #405

dguy · 2017-10-23T17:17:10Z

No description provided.

hjafarpour · 2017-10-23T17:51:12Z

@dguy you have moved the physical layer functionality into the logical layer. We want to keep logical plan phase independent of the physical plan phase.

hjafarpour

You have moved the physical layer functionality into the logical layer. We want to keep logical plan phase independent of the physical plan phase.

apurvam · 2017-10-24T01:15:43Z

@hjafarpour, not sure I follow your point. I understand that the PlanNode hierarchy was the logical layer, and this was converted to a stream topology by the physical plan builder.

However, I have been looking at the physical plan builder code, and it really is quite complicated. In particular, the recursive calls into kafkaStreamsDsl are quite hard to follow. This seems to disentangle some of that by the adding the buildStream method to each PlanNode type which builds the DSL for that particular type of node.

While this does mix the strictly logical PlanNode with the physical DSL, it does so through a clean boundary (ie. the buildStream method).

The proposed structure makes things a bit easier to read and reason about. More importantly, when in the future when we add more types of PlanNode, the logic would also be compartmentalized in those new nodes. It would result in a simpler PhysicalPlanBuilder over time. So there are many benefits.

Is it really that important that the logical and physical layers are separated in the previous way? What are the advantages of that approach in your mind?

hjafarpour · 2017-10-24T04:35:06Z

@apurvam yes, keeping logical plan independent of physical plan layer is essential since in logical layer we should not care about how we implement the plan in destination physical layer. Our default physical layer is Kafka Streams at the moment but in future we will also consider other execution contexts such as connect or 3rd party systems. So it is essential to keep any dependency to the underlying physical plan away and keep logical plan layer abstract.

dguy · 2017-10-24T09:28:05Z

I (obviously) agree with @apurvam. This makes the code easier to reason about. The logic is actually in the classes it needs to be rather than all dumped in the PhysicalPlanBuilder.
As for the future... Well, if we had some interfaces then this would be not a problem. Maybe we'll get there maybe we won't. I don't think the maybe future should hold this up as it is an improvement in readability and testability

hjafarpour · 2017-10-24T14:57:21Z

I totally agree with the readability and testability argument but we would be violating a more important principal here which is keeping logical layer independent of physical layer. I would suggest breaking up the PhysicalPlanBuilder by creating new classes in the same layer instead of spilling the physical implementation code in the abstraction layer. All data management systems have kept their logical planning code independent of implementation details and we should not violate this principal.

dguy · 2017-10-24T15:43:17Z

As it is in this PR if we add new extensions of PlanNode we only have to add 1 class that encapsulates the behaviour. If we do it another way we need add multiple classes every time we change something. IMO, this is not a good design. If we put the logic with the data we can just add a single class and it works.
As it stands, there is no abstraction layer - it is just a bunch of containers of data that do nothing.

hjafarpour

LGTM.

bluemonk3y

leggit

move functionality out of PPB into other classes

972a689

dguy requested review from bluemonk3y and hjafarpour October 23, 2017 17:17

hjafarpour suggested changes Oct 23, 2017

View reviewed changes

hjafarpour approved these changes Oct 30, 2017

View reviewed changes

bluemonk3y approved these changes Oct 31, 2017

View reviewed changes

dguy merged commit 93dba91 into confluentinc:4.0.x Oct 31, 2017

dguy deleted the physical-plan-builder branch October 31, 2017 11:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

move functionality out of PhysicalPlanBuilder into other classes and add tests #405

move functionality out of PhysicalPlanBuilder into other classes and add tests #405

dguy commented Oct 23, 2017

hjafarpour commented Oct 23, 2017

hjafarpour left a comment

apurvam commented Oct 24, 2017

hjafarpour commented Oct 24, 2017

dguy commented Oct 24, 2017

hjafarpour commented Oct 24, 2017

dguy commented Oct 24, 2017

hjafarpour left a comment

bluemonk3y left a comment

move functionality out of PhysicalPlanBuilder into other classes and add tests #405

move functionality out of PhysicalPlanBuilder into other classes and add tests #405

Conversation

dguy commented Oct 23, 2017

hjafarpour commented Oct 23, 2017

hjafarpour left a comment

Choose a reason for hiding this comment

apurvam commented Oct 24, 2017

hjafarpour commented Oct 24, 2017

dguy commented Oct 24, 2017

hjafarpour commented Oct 24, 2017

dguy commented Oct 24, 2017

hjafarpour left a comment

Choose a reason for hiding this comment

bluemonk3y left a comment

Choose a reason for hiding this comment