[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode #91327

jayfoad · 2024-05-07T13:53:04Z

No description provided.

llvmbot · 2024-05-07T13:53:32Z

@llvm/pr-subscribers-backend-amdgpu

Author: Jay Foad (jayfoad)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/91327.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIISelLowering.cpp (+8)
(modified) llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll (+21-26)

diff --git a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
index ed41c10b50d32..dd2dea501380c 100644
--- a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+++ b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
@@ -1604,6 +1604,14 @@ bool SITargetLowering::isLegalAddressingMode(const DataLayout &DL,
         return false;
     }
 
+    if (AS == AMDGPUAS::CONSTANT_ADDRESS && AM.BaseOffs < 0) {
+      // Scalar (non-buffer) loads can only use a negative offset if
+      // soffset+offset is non-negative. Since the compiler can only prove that
+      // in a few special cases, it is safer to claim that negative offsets are
+      // not supported.
+      return false;
+    }
+
     if (AM.Scale == 0) // r + i or just i, depending on HasBaseReg.
       return true;
 
diff --git a/llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll b/llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
index 54dc5b8b9d3dd..b0bbd90f165b9 100644
--- a/llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
+++ b/llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
@@ -279,38 +279,30 @@ end:
 }
 
 define amdgpu_cs void @test_sink_smem_offset_neg400(ptr addrspace(4) inreg %ptr, i32 inreg %val) {
-; GFX678-LABEL: test_sink_smem_offset_neg400:
-; GFX678:       ; %bb.0: ; %entry
-; GFX678-NEXT:    s_add_u32 s0, s0, 0xfffffe70
-; GFX678-NEXT:    s_addc_u32 s1, s1, -1
-; GFX678-NEXT:  .LBB5_1: ; %loop
-; GFX678-NEXT:    ; =>This Inner Loop Header: Depth=1
-; GFX678-NEXT:    s_waitcnt lgkmcnt(0)
-; GFX678-NEXT:    s_load_dword s3, s[0:1], 0x0
-; GFX678-NEXT:    s_add_i32 s2, s2, -1
-; GFX678-NEXT:    s_cmp_lg_u32 s2, 0
-; GFX678-NEXT:    s_cbranch_scc1 .LBB5_1
-; GFX678-NEXT:  ; %bb.2: ; %end
-; GFX678-NEXT:    s_endpgm
-;
-; GFX9-LABEL: test_sink_smem_offset_neg400:
-; GFX9:       ; %bb.0: ; %entry
-; GFX9-NEXT:  .LBB5_1: ; %loop
-; GFX9-NEXT:    ; =>This Inner Loop Header: Depth=1
-; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
-; GFX9-NEXT:    s_load_dword s3, s[0:1], -0x190
-; GFX9-NEXT:    s_add_i32 s2, s2, -1
-; GFX9-NEXT:    s_cmp_lg_u32 s2, 0
-; GFX9-NEXT:    s_cbranch_scc1 .LBB5_1
-; GFX9-NEXT:  ; %bb.2: ; %end
-; GFX9-NEXT:    s_endpgm
+; GFX6789-LABEL: test_sink_smem_offset_neg400:
+; GFX6789:       ; %bb.0: ; %entry
+; GFX6789-NEXT:    s_add_u32 s0, s0, 0xfffffe70
+; GFX6789-NEXT:    s_addc_u32 s1, s1, -1
+; GFX6789-NEXT:  .LBB5_1: ; %loop
+; GFX6789-NEXT:    ; =>This Inner Loop Header: Depth=1
+; GFX6789-NEXT:    s_waitcnt lgkmcnt(0)
+; GFX6789-NEXT:    s_load_dword s3, s[0:1], 0x0
+; GFX6789-NEXT:    s_add_i32 s2, s2, -1
+; GFX6789-NEXT:    s_cmp_lg_u32 s2, 0
+; GFX6789-NEXT:    s_cbranch_scc1 .LBB5_1
+; GFX6789-NEXT:  ; %bb.2: ; %end
+; GFX6789-NEXT:    s_endpgm
 ;
 ; GFX12-LABEL: test_sink_smem_offset_neg400:
 ; GFX12:       ; %bb.0: ; %entry
+; GFX12-NEXT:    s_movk_i32 s4, 0xfe70
+; GFX12-NEXT:    s_mov_b32 s5, -1
+; GFX12-NEXT:    s_delay_alu instid0(SALU_CYCLE_1)
+; GFX12-NEXT:    s_add_nc_u64 s[0:1], s[0:1], s[4:5]
 ; GFX12-NEXT:  .LBB5_1: ; %loop
 ; GFX12-NEXT:    ; =>This Inner Loop Header: Depth=1
 ; GFX12-NEXT:    s_wait_kmcnt 0x0
-; GFX12-NEXT:    s_load_b32 s3, s[0:1], -0x190
+; GFX12-NEXT:    s_load_b32 s3, s[0:1], 0x0
 ; GFX12-NEXT:    s_add_co_i32 s2, s2, -1
 ; GFX12-NEXT:    s_delay_alu instid0(SALU_CYCLE_1)
 ; GFX12-NEXT:    s_cmp_lg_u32 s2, 0
@@ -331,3 +323,6 @@ loop:
 end:
   ret void
 }
+;; NOTE: These prefixes are unused and the list is autogenerated. Do not add tests below this line:
+; GFX678: {{.*}}
+; GFX9: {{.*}}

jayfoad · 2024-05-07T13:54:01Z

This is intended to avoid some code quality regressions I saw with #89165. Perhaps it should even be folded into that PR.

jayfoad · 2024-05-07T13:55:45Z

llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll

-; GFX9-NEXT:    s_endpgm
+; GFX6789-LABEL: test_sink_smem_offset_neg400:
+; GFX6789:       ; %bb.0: ; %entry
+; GFX6789-NEXT:    s_add_u32 s0, s0, 0xfffffe70


For GFX9 this calculation is done outside the loop. With just #89165 applied, this calculation would be done inside the loop.

arsenm · 2024-05-07T13:58:13Z

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

@@ -1604,6 +1604,14 @@ bool SITargetLowering::isLegalAddressingMode(const DataLayout &DL,
        return false;
    }

+    if (AS == AMDGPUAS::CONSTANT_ADDRESS && AM.BaseOffs < 0) {


should also cover the constant_32bit (but again, this whole thing is just a weak heuristic for might-use-scalar-loads)

I specifically didn't want to include constant_32bit because I saw a case where LLPC was using that address space for an s_buffer_load, not an s_load. But I did not look into why that was happening. I have never understood constant_32bit.

It's a CONSTANT_ADDRESS truncated to the low 32-bits, where the high bits are assumed a constant from an attribute. The addressing modes should behave the same as constant, it's just implicitly promoted to the 64-bit pointer when codegenned

should also cover the constant_32bit

Done

arsenm

lgtm with test added

arsenm · 2024-05-08T14:14:11Z

llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll

@@ -279,38 +279,30 @@ end:
 }

 define amdgpu_cs void @test_sink_smem_offset_neg400(ptr addrspace(4) inreg %ptr, i32 inreg %val) {


Add the constant 32-bit test (6)?

…relim) This is a cherry-pick of an unmerged upstream change llvm#91327 Once the upstream change is finished and merged, this will need reverting. Change-Id: Ie89236494ed38063856d881b5c2944e650ee17c4

…lvm#91327)

[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode

ba9c2c4

llvmbot added the backend:AMDGPU label May 7, 2024

jayfoad requested review from arsenm, vangthao95 and piotrAMD May 7, 2024 13:53

jayfoad commented May 7, 2024

View reviewed changes

arsenm reviewed May 7, 2024

View reviewed changes

Also handle CONSTANT_ADDRESS_32BIT

e220e1a

arsenm approved these changes May 8, 2024

View reviewed changes

Add test case

52cd925

Merge remote-tracking branch 'origin/main' into HEAD

11266ee

arsenm approved these changes Jun 25, 2024

View reviewed changes

jayfoad merged commit aaf50bf into llvm:main Jun 25, 2024
7 checks passed

jayfoad deleted the islegal-smem-offset branch June 25, 2024 16:43

AlexisPerry pushed a commit to llvm-project-tlp/llvm-project that referenced this pull request Jul 9, 2024

[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode (l…

c32a256

…lvm#91327)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode #91327

[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode #91327

jayfoad commented May 7, 2024

llvmbot commented May 7, 2024

jayfoad commented May 7, 2024

jayfoad May 7, 2024

arsenm May 7, 2024

jayfoad May 7, 2024

arsenm May 7, 2024

jayfoad Jun 25, 2024

arsenm left a comment

arsenm May 8, 2024

jayfoad Jun 25, 2024

		@@ -279,38 +279,30 @@ end:
		}

		define amdgpu_cs void @test_sink_smem_offset_neg400(ptr addrspace(4) inreg %ptr, i32 inreg %val) {

[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode #91327

[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode #91327

Conversation

jayfoad commented May 7, 2024

llvmbot commented May 7, 2024

jayfoad commented May 7, 2024

jayfoad May 7, 2024

Choose a reason for hiding this comment

arsenm May 7, 2024

Choose a reason for hiding this comment

jayfoad May 7, 2024

Choose a reason for hiding this comment

arsenm May 7, 2024

Choose a reason for hiding this comment

jayfoad Jun 25, 2024

Choose a reason for hiding this comment

arsenm left a comment

Choose a reason for hiding this comment

arsenm May 8, 2024

Choose a reason for hiding this comment

jayfoad Jun 25, 2024

Choose a reason for hiding this comment