-
Notifications
You must be signed in to change notification settings - Fork 12.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compiler no longer producing single horizontal add instruction after ab2c499d #49736
Labels
Comments
Link to review of the change that caused the regression: https://reviews.llvm.org/D98714 |
To add more details: this should be fixed after Alexey's patch (https://reviews.llvm.org/D57059) for the non-power-of-two vector sizes landing. |
RKSimon
added a commit
that referenced
this issue
May 8, 2022
RKSimon
added a commit
that referenced
this issue
Oct 30, 2024
I've kept the old PR50392 tag since this is such an old issue....
smallp-o-p
pushed a commit
to smallp-o-p/llvm-project
that referenced
this issue
Nov 3, 2024
…"binop (shuffle), (shuffle)" (llvm#114101) Add foldPermuteOfBinops - to fold a permute (single source shuffle) through a binary op that is being fed by other shuffles. Fixes llvm#94546 Fixes llvm#49736
NoumanAmir657
pushed a commit
to NoumanAmir657/llvm-project
that referenced
this issue
Nov 4, 2024
I've kept the old PR50392 tag since this is such an old issue....
NoumanAmir657
pushed a commit
to NoumanAmir657/llvm-project
that referenced
this issue
Nov 4, 2024
…"binop (shuffle), (shuffle)" (llvm#114101) Add foldPermuteOfBinops - to fold a permute (single source shuffle) through a binary op that is being fed by other shuffles. Fixes llvm#94546 Fixes llvm#49736
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Extended Description
One of our internal tests compiles the following code and verifies that it generates a horizontal add instruction in the resulting assembly when optimizations are enabled and targeting btver2.
Prior to commit ab2c499, when compiled with "-g0 -O3 -march=btver2", the compiler would produce the following code:
But following the mentioned change, the compiler is now producing the following code instead:
The text was updated successfully, but these errors were encountered: