-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Relay] Register layout conversion function to more reduce ops #9048
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
52a123a
to
84353c5
Compare
84353c5
to
1088b45
Compare
@comaniac I've updated this PR. I found a bug in |
@comaniac There is one thing that's very odd about what There is nothing that prevents reducing over |
I wasn't involved in that PR so I'm not really sure the root cause of that decision, but what you described makes sense to me. I'll merge this PR first and you could send a follow up one to fix the NCHWc -> NCHW issue. cc @anijain2305 @zhiics @yzhliu it would be good if any of you folks could help provide more background s. |
Thanks @comaniac |
* main: (37 commits) [ONNX] [Relay] Dynamic squeeze (apache#9095) [Meta Schedule][M3b] Database (apache#9061) [Bugfix] Add nullptr checking for `AttrStmt` with `coproc_uop_scope` attr key (apache#9123) [Codegen] Swap out analyzer when outlining (apache#9117) [CI] bash.sh, build.sh: add option to set the container name and hostname (apache#9110) Ensure google-mock is installed and setup (apache#9107) Arm(R) Ethos(TM)-U NPU TIR to CS for Conv2D (apache#8811) Frontend: add onnx GlobalLpPool op (apache#8845) [LLVM] Refactor MakeCallPacked, NFC (apache#9118) prevent casting handle to other types (apache#9114) fix annotation of tir generic (apache#9119) [Relay] Register layout conversion function to more reduce ops (apache#9048) Fix the missing `dtype` attribute of `tir.Shuffle` in Python level (apache#9131) add `multiply` and remove `subtract` for dnnl json runtime (apache#9120) relu of dnnl json runtime only support 4-dims input (apache#9122) [Meta Schedule][M3a] SpaceGenerator (apache#9079) [TensorIR][Bugfix] Disallow fusing loops with dependency (apache#9112) adding Jorn to reviewers list (apache#9105) [BYOC] Fix incorrect conv2d padding handling of `dnnl with c source runtime` (apache#9097) [Frontend][TFLite] fix apache#9078 (apache#9099) ...
…e#9048) * Register layout conversion function to more reduce ops * bug fix for exclude=True case, the original code compute wrong axes * properly handle variance op, which has two inputs * update test expected output
…e#9048) * Register layout conversion function to more reduce ops * bug fix for exclude=True case, the original code compute wrong axes * properly handle variance op, which has two inputs * update test expected output
This PR simply adds missing registrations of
ReduceInferCorrectLayout
, introduced in #4059, to all of reduce ops. I found that the efficientnet v2 model has manymean
op, which introduces 89layout_transform
. After this PR, there is only onelayout_transform
(at the beginning).@comaniac @anijain2305