[AMDGPU] Allow null operands in VImage tensor instructions by RyanRio · Pull Request #200911 · llvm/llvm-project

RyanRio · 2026-06-01T19:27:12Z

NULL is equivalent to passing a block of SGPRs that are set to zero, and is allowed

llvmorg-github-actions · 2026-06-01T19:27:55Z

@llvm/pr-subscribers-backend-amdgpu

Author: Ryan Mitchell (RyanRio)

Changes

NULL is equivalent to passing a block of SGPRs that are set to zero, and is allowed

Full diff: https://github.com/llvm/llvm-project/pull/200911.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/MIMGInstructions.td (+3-3)
(modified) llvm/test/MC/AMDGPU/gfx1250_asm_vimage_err.s (-36)

diff --git a/llvm/lib/Target/AMDGPU/MIMGInstructions.td b/llvm/lib/Target/AMDGPU/MIMGInstructions.td
index 0f31697f15688..a425e06a7554c 100644
--- a/llvm/lib/Target/AMDGPU/MIMGInstructions.td
+++ b/llvm/lib/Target/AMDGPU/MIMGInstructions.td
@@ -2182,9 +2182,9 @@ class VIMAGE_TENSOR_Pseudo<string opName, bit _UpTo2D = 0> :
   let hasSideEffects = 0;
 
   bit UpTo2D = _UpTo2D;
-  let InOperandList = !if(UpTo2D, (ins SReg_128_XNULL:$vaddr0, SReg_256_XNULL:$vaddr1, R128A16:$r128, CPol:$cpol),
-                                      (ins SReg_128_XNULL:$vaddr0, SReg_256_XNULL:$vaddr1, SReg_128_XNULL:$vaddr2,
-                                       SReg_128_XNULL:$vaddr3, R128A16:$r128, CPol:$cpol));
+  let InOperandList = !if(UpTo2D, (ins SReg_128:$vaddr0, SReg_256:$vaddr1, R128A16:$r128, CPol:$cpol),
+                                      (ins SReg_128:$vaddr0, SReg_256:$vaddr1, SReg_128:$vaddr2,
+                                       SReg_128:$vaddr3, R128A16:$r128, CPol:$cpol));
   string AsmOperands = " $vaddr0, $vaddr1"#!if(UpTo2D, "", ", $vaddr2, $vaddr3")#"$r128$cpol";
 }
 
diff --git a/llvm/test/MC/AMDGPU/gfx1250_asm_vimage_err.s b/llvm/test/MC/AMDGPU/gfx1250_asm_vimage_err.s
index 2f911ae79c00f..3f8a913b2c458 100644
--- a/llvm/test/MC/AMDGPU/gfx1250_asm_vimage_err.s
+++ b/llvm/test/MC/AMDGPU/gfx1250_asm_vimage_err.s
@@ -25,42 +25,6 @@ tensor_store_from_lds s[0:3], s[4:11], s[12:15], s[16:19] r128
 tensor_store_from_lds s[0:3], s[4:11], s[12:15], s[16:19] th:TH_LOAD_NT_HT scope:SCOPE_DEV
 // GFX1250-ERR: :[[@LINE-1]]:59: error: invalid th value for store instructions
 
-tensor_load_to_lds null, s[4:11]
-// GFX1250-ERR: :[[@LINE-1]]:20: error: invalid operand for instruction
-
-tensor_load_to_lds s[0:3], null
-// GFX1250-ERR: :[[@LINE-1]]:28: error: invalid operand for instruction
-
-tensor_load_to_lds null, s[4:11], s[12:15], s[16:19]
-// GFX1250-ERR: :[[@LINE-1]]:20: error: invalid operand for instruction
-
-tensor_load_to_lds s[0:3], null, s[12:15], s[16:19]
-// GFX1250-ERR: :[[@LINE-1]]:28: error: invalid operand for instruction
-
-tensor_load_to_lds s[0:3], s[4:11], null, s[16:19]
-// GFX1250-ERR: :[[@LINE-1]]:37: error: invalid operand for instruction
-
-tensor_load_to_lds s[0:3], s[4:11], s[12:15], null
-// GFX1250-ERR: :[[@LINE-1]]:47: error: invalid operand for instruction
-
-tensor_store_from_lds null, s[4:11]
-// GFX1250-ERR: :[[@LINE-1]]:23: error: invalid operand for instruction
-
-tensor_store_from_lds s[0:3], null
-// GFX1250-ERR: :[[@LINE-1]]:31: error: invalid operand for instruction
-
-tensor_store_from_lds null, s[4:11], s[12:15], s[16:19]
-// GFX1250-ERR: :[[@LINE-1]]:23: error: invalid operand for instruction
-
-tensor_store_from_lds s[0:3], null, s[12:15], s[16:19]
-// GFX1250-ERR: :[[@LINE-1]]:31: error: invalid operand for instruction
-
-tensor_store_from_lds s[0:3], s[4:11], null, s[16:19]
-// GFX1250-ERR: :[[@LINE-1]]:40: error: invalid operand for instruction
-
-tensor_store_from_lds s[0:3], s[4:11], s[12:15], null
-// GFX1250-ERR: :[[@LINE-1]]:50: error: invalid operand for instruction
-
 tensor_load_to_lds s[14:17], s[4:11]
 // GFX1250-ERR: :[[@LINE-1]]:20: error: invalid register alignment

shiltian · 2026-06-01T19:38:54Z

 tensor_store_from_lds s[0:3], s[4:11], s[12:15], s[16:19] th:TH_LOAD_NT_HT scope:SCOPE_DEV
 // GFX1250-ERR: :[[@LINE-1]]:59: error: invalid th value for store instructions

-tensor_load_to_lds null, s[4:11]


Hmm, SP3 doesn't allow this syntax.

Also, need to add them to encoding/decoding tests.

rampitec

First 2 operands can be numbered SGPRs only.

RyanRio · 2026-06-01T20:42:50Z

Yeah unfortunately that seems to be the case, even if maybe hardware doesn't check for it.

rampitec · 2026-06-01T20:48:20Z

Yeah unfortunately that seems to be the case, even if maybe hardware doesn't check for it.

The other 2 can be null though.

RyanRio · 2026-06-01T22:59:57Z

True 😄 adding the asm/dasm tests too

rampitec

LGTM

shiltian · 2026-06-02T01:33:15Z

 // GFX12-ERR: :[[@LINE-1]]:1: error: instruction not supported on this GPU (gfx1200): tensor_store_from_lds
 // GFX1250: tensor_store_from_lds s[0:3], s[4:11] th:TH_STORE_BYPASS scope:SCOPE_SYS ; encoding: [0x01,0x40,0x71,0xd0,0x00,0x00,0x3c,0x7c,0x00,0x04,0x7c,0x7c]

+tensor_store_from_lds s[0:3], s[4:11], null, null th:TH_STORE_NT_HT scope:SCOPE_DEV


There seems to be some difference between this and SP3:

tensor_store_from_lds s[0:3], s[4:11], null, null dmask:0x1 th:TH_STORE_NT_HT scope:SCOPE_DEV // 000000000000: D0714001 7C680000 7C7C0400

Hm you are right, that 0x71 should be 0x31

This is also incorrect -

tensor_store_from_lds s[0:3], s[4:11] th:TH_STORE_BYPASS scope:SCOPE_SYS
// GFX12-ERR: :[[@line-1]]:1: error: instruction not supported on this GPU (gfx1200): tensor_store_from_lds
// GFX1250: tensor_store_from_lds s[0:3], s[4:11] th:TH_STORE_BYPASS scope:SCOPE_SYS ; encoding: [0x01,0x40,0x71,0xd0,0x00,0x00,0x3c,0x7c,0x00,0x04,0x7c,0x7c]

It has the same hex, and just leaves off the opnds (which btw, seems like in this case the sp3 does not leave them off, not specifying null is an error)

Are we missing something about the reasoning for this in the tablegen -

let dmask = 1; // sp3
let dim = 1; // sp3

@rampitec @changpeng

According to SPG these fields are unused and shall be set to 0.

Right so it's a mistake they're set to 1 right now yeah

RyanRio · 2026-06-05T14:30:11Z

Ping, good to merge this, and fix dim/dmask separately?

shiltian

Sounds good to me.

Allow null operands in VIMage tensor instructions

ffb0aff

RyanRio requested review from chinmaydd, krzysz00 and shiltian June 1, 2026 19:27

llvmorg-github-actions Bot added the backend:AMDGPU label Jun 1, 2026

shiltian reviewed Jun 1, 2026

View reviewed changes

shiltian requested a review from rampitec June 1, 2026 19:39

rampitec requested changes Jun 1, 2026

View reviewed changes

RyanRio closed this Jun 1, 2026

only v2-v3 can be null

3ff2e42

RyanRio reopened this Jun 1, 2026

add dasm and asm tests

d1c0b78

RyanRio requested review from rampitec and shiltian June 1, 2026 23:07

rampitec approved these changes Jun 1, 2026

View reviewed changes

shiltian reviewed Jun 2, 2026

View reviewed changes

RyanRio requested a review from shiltian June 5, 2026 14:30

shiltian approved these changes Jun 5, 2026

View reviewed changes

RyanRio merged commit de1ff3e into llvm:main Jun 5, 2026
10 checks passed

Conversation

RyanRio commented Jun 1, 2026

Uh oh!

llvmorg-github-actions Bot commented Jun 1, 2026

Uh oh!

shiltian Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

rampitec left a comment

Choose a reason for hiding this comment

Uh oh!

RyanRio commented Jun 1, 2026

Uh oh!

rampitec commented Jun 1, 2026

Uh oh!

RyanRio commented Jun 1, 2026

Uh oh!

rampitec left a comment

Choose a reason for hiding this comment

Uh oh!

shiltian Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanRio Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanRio Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanRio Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

rampitec Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanRio Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanRio commented Jun 5, 2026

Uh oh!

shiltian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants