-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug][MXNet] MXNet dot not working with non 2D tensors #11691
Comments
Can you confirm me that my solution is acceptable? If it is, I will write the tests to check combinations of different tensor's dimensions and create PR for this issue. Additionally, is there some reason for not using transpose_a in this scenario? If the reason doesn't exist, I can handle it, too. If you have some better ideas or some advices of upgrading my solution, please tell me. |
@petuca Thanks for looking into it. According to how it is fixed in the two PRs you linked, I guess the workaround is to simply return the following line of code after
PR is welcome or I can help you patch that when I am more available. |
Hi @ganler , Thank you for comment and advices. I saw your changes and took it as a start idea for solving this, but I found optimal structure with minimal conditions on this way to obtain all possible cases of tensor's dimensions. Eventually, I could swap nn.dense with nn.matmul where I could reduce my code for some extra lines. I will sent the PR tomorrow. @masahi |
Yes, @padreofthegame this issue has been fixed in PR #11760 . |
Thanks for participating in the TVM community! We use https://discuss.tvm.ai for any general usage questions and discussions. The issue tracker is used for actionable items such as feature proposals discussion, roadmaps, and bug tracking. You are always welcomed to post on the forum first 😸
Issues that are inactive for a period of time may get closed. We adopt this policy so that we won't lose track of actionable issues that may fall at the bottom of the pile. Feel free to reopen a new one if you feel there is an additional problem that needs attention when an old one gets closed.
Here we have a dot operation problem established in MXNet model with non 2D tensors.
For example here we want to do dot product of two tensors:
Looks like this bug is very similar to those reported in #10651 and in PR #11174 for ONNX and PyTorch models.
Similar error is obtained using any shape different from 2D for any of the data and weight tensors.
Expected behavior
Should be compiled by TVM, as it follows correct MXNet specification and can be executed by MXNet.
Actual behavior
When I comment the lines for checking ranks in from_mxnet.py file I got this error:
Steps to reproduce
The code above successfully reproduce this problem.
Potential solution
Changing the _mx_dot function in from_mxnet.py with:
cc: @masahi @junrushao1994 @kevinthesun @ganler
The text was updated successfully, but these errors were encountered: