-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simplify Expr::map_children #9876
Simplify Expr::map_children #9876
Conversation
This is now ready for review. cc @alamb, @berkaysynnada |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @peter-toth -- I think this looks quite nice and easier to undertand
I left some suggestions on comments / naming that might help, but I also think we could do them as a follow on PR
datafusion/common/src/tree_node.rs
Outdated
@@ -532,7 +532,7 @@ impl<T> Transformed<T> { | |||
} | |||
} | |||
|
|||
/// Transformation helper to process tree nodes that are siblings. | |||
/// Transformation helper to process sequence of iterable tree nodes that are siblings. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I found the reason / logic for continue / jump quite subtle when trying to make TreeNodeMutator. I have some suggested comments below to explain the rationale that might help
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @alamb, merged your suggestion.
datafusion/common/src/tree_node.rs
Outdated
/// Transformation helper to process sequence of tree node containing expressions. | ||
/// This macro is very similar to [TransformedIterator::map_until_stop_and_collect] to | ||
/// process nodes that are siblings, but it accepts an initial transformation and a | ||
/// sequence of pairs of an expression and its transformation. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is quite clever. I have some suggestions on naming that would have helped me:
Use F
to mirror the nomenclature of TransformedIterator
. I thought $TRANSFORMED_EXPR
was an actually expr (but it is actually a closure that is invoked I think 🤔 )
$TRANSFORMED_EXPR_0 --> F0
$TRANSFORMED_EXPR --> F.
I think it would help to document each parameter and the return value (specifically it looks like it returns Transformed<(data, ..., data)>
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, let me fix these and come back yo you.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Renamed in a36f1aa and added rustdoc to the macro. Let me know if it needs more details.
datafusion/common/src/tree_node.rs
Outdated
macro_rules! map_until_stop_and_collect { | ||
($TRANSFORMED_EXPR_0:expr, $($EXPR:expr, $TRANSFORMED_EXPR:expr),*) => {{ | ||
$TRANSFORMED_EXPR_0.and_then(|Transformed { data: data0, mut transformed, mut tnr }| { | ||
let data = ( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we could name this something other than data
so it was clearer what type it is (maybe all_datas
?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed in a36f1aa
null_treatment, | ||
)) | ||
}), | ||
}) => map_until_stop_and_collect!( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is very clever
Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
The test failure seems unrelated to this PR and looks like a non-deterministic test... |
I agree https://github.com/apache/arrow-datafusion/actions/runs/8543135777/job/23406267036?pr=9876
I restarted it |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks again @peter-toth
The CI test passed on rerun. I filed #9930 to track the non deterministic test (and we are trying to get to issue 10,000!) |
Thanks @alamb for the review! |
Which issue does this PR close?
Closes #9457, part of #8913
Rationale for this change
The current implementation of
Expr::map_children()
is very complex, this PR tries to simplify its code.What changes are included in this PR?
This PR adds a
map_until_stop_and_collect()
macro, which is similar toTransformedIterator::map_until_stop_and_collect()
to process sibling tree nodes, but it can process heterogeneous tree node containing expressions.Are these changes tested?
Yes, with existing UTs.
Are there any user-facing changes?
No.