[AArch64] Only apply bool vector bitcast opt if result is scalar #81256

nikic · 2024-02-09T15:27:07Z

This optimization tries to optimize bitcasts from <N x i1> to iN, but currently also triggers for <N x i1> to <M x iK> bitcasts, if custom lowering has been requested for these for an unrelated reason. Fix this by explicitly checking that the result type is scalar.

Fixes #81216.

This optimization tries to optimize bitcasts from <N x i1> to iN, but currently also triggers for <N x i1> to <M x iK> bitcasts, if custom lowering has been requested for these for an unrelated reason. Fix this by explicitly checking that the result type is scalar. Fixes llvm#81216.

llvmbot · 2024-02-09T15:27:38Z

@llvm/pr-subscribers-backend-aarch64

Author: Nikita Popov (nikic)

Changes

This optimization tries to optimize bitcasts from <N x i1> to iN, but currently also triggers for <N x i1> to <M x iK> bitcasts, if custom lowering has been requested for these for an unrelated reason. Fix this by explicitly checking that the result type is scalar.

Fixes #81216.

Full diff: https://github.com/llvm/llvm-project/pull/81256.diff

2 Files Affected:

(modified) llvm/lib/Target/AArch64/AArch64ISelLowering.cpp (+2-1)
(modified) llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll (+28)

diff --git a/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp b/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
index 20290c958a70e9..2854dd4d96746a 100644
--- a/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
+++ b/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
@@ -24615,7 +24615,8 @@ void AArch64TargetLowering::ReplaceBITCASTResults(
     return;
   }
 
-  if (SrcVT.isVector() && SrcVT.getVectorElementType() == MVT::i1)
+  if (SrcVT.isVector() && SrcVT.getVectorElementType() == MVT::i1 &&
+      !VT.isVector())
     return replaceBoolVectorBitcast(N, Results, DAG);
 
   if (VT != MVT::i16 || (SrcVT != MVT::f16 && SrcVT != MVT::bf16))
diff --git a/llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll b/llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
index 1b22e2f900ddb7..557aa010b3a7d9 100644
--- a/llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
+++ b/llvm/test/CodeGen/AArch64/vec-combine-compare-to-bitmask.ll
@@ -489,3 +489,31 @@ define i6 @no_combine_illegal_num_elements(<6 x i32> %vec) {
   %bitmask = bitcast <6 x i1> %cmp_result to i6
   ret i6 %bitmask
 }
+
+; Only apply the combine when casting a vector to a scalar.
+define <2 x i8> @vector_to_vector_cast(<16 x i1> %arg) nounwind {
+; CHECK-LABEL: vector_to_vector_cast:
+; CHECK:       ; %bb.0:
+; CHECK-NEXT:    sub sp, sp, #16
+; CHECK-NEXT:    shl.16b v0, v0, #7
+; CHECK-NEXT:  Lloh36:
+; CHECK-NEXT:    adrp x8, lCPI20_0@PAGE
+; CHECK-NEXT:  Lloh37:
+; CHECK-NEXT:    ldr q1, [x8, lCPI20_0@PAGEOFF]
+; CHECK-NEXT:    add x8, sp, #14
+; CHECK-NEXT:    cmlt.16b v0, v0, #0
+; CHECK-NEXT:    and.16b v0, v0, v1
+; CHECK-NEXT:    ext.16b v1, v0, v0, #8
+; CHECK-NEXT:    zip1.16b v0, v0, v1
+; CHECK-NEXT:    addv.8h h0, v0
+; CHECK-NEXT:    str h0, [sp, #14]
+; CHECK-NEXT:    ld1.b { v0 }[0], [x8]
+; CHECK-NEXT:    orr x8, x8, #0x1
+; CHECK-NEXT:    ld1.b { v0 }[4], [x8]
+; CHECK-NEXT:    ; kill: def $d0 killed $d0 killed $q0
+; CHECK-NEXT:    add sp, sp, #16
+; CHECK-NEXT:    ret
+; CHECK-NEXT:    .loh AdrpLdr Lloh36, Lloh37
+  %bc = bitcast <16 x i1> %arg to <2 x i8>
+  ret <2 x i8> %bc
+}

davemgreen

LGTM

lawben

LGTM

…m#81256) This optimization tries to optimize bitcasts from `<N x i1>` to iN, but currently also triggers for `<N x i1>` to `<M x iK>` bitcasts, if custom lowering has been requested for these for an unrelated reason. Fix this by explicitly checking that the result type is scalar. Fixes llvm#81216. (cherry picked from commit 92d7992)

nikic requested review from aemerson, lawben and davemgreen February 9, 2024 15:27

llvmbot added the backend:AArch64 label Feb 9, 2024

davemgreen approved these changes Feb 9, 2024

View reviewed changes

lawben approved these changes Feb 9, 2024

View reviewed changes

nikic merged commit 92d7992 into llvm:main Feb 12, 2024
6 checks passed

nikic deleted the aarch64-bitcast-fix branch February 12, 2024 09:00

pointhex mentioned this pull request May 7, 2024

getStyleDiagHandler #91314

Closed

aemerson mentioned this pull request May 9, 2024

release/18.x: [AArc64][GlobalISel] Fix legalizer assert for G_INSERT_VECTOR_ELT - manual merge #91672

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AArch64] Only apply bool vector bitcast opt if result is scalar #81256

[AArch64] Only apply bool vector bitcast opt if result is scalar #81256

nikic commented Feb 9, 2024 •

edited

Loading

llvmbot commented Feb 9, 2024

davemgreen left a comment

lawben left a comment

[AArch64] Only apply bool vector bitcast opt if result is scalar #81256

[AArch64] Only apply bool vector bitcast opt if result is scalar #81256

Conversation

nikic commented Feb 9, 2024 • edited Loading

llvmbot commented Feb 9, 2024

davemgreen left a comment

Choose a reason for hiding this comment

lawben left a comment

Choose a reason for hiding this comment

nikic commented Feb 9, 2024 •

edited

Loading