AMDGPU: Remove wavefrontsize64 feature from dummy target #117410

arsenm · 2024-11-23T02:42:00Z

This is a refinement for the existing hack. With this,
the default target will have neither wavefrontsize feature
present, unless it was explicitly specified. That is,
getWavefrontSize() == 64 no longer implies +wavefrontsize64.
getWavefrontSize() == 32 does imply +wavefrontsize32.

Continue to assume the value is 64 with no wavesize feature.
This maintains the codegenable property without any code
that directly cares about the wavesize needing to worry about it.

Introduce an isWaveSizeKnown helper to check if we know the
wavesize is accurate based on having one of the features explicitly
set, or a known target-cpu.

I'm not sure what's going on in wave_any.s. It's testing what
happens when both wavesizes are enabled, but this is treated
as an error in codegen. We now treat wave32 as the winning
case, so some cases that were previously printed as vcc are now
vcc_lo.

arsenm · 2024-11-23T02:42:17Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

llvmbot · 2024-11-23T02:42:45Z

@llvm/pr-subscribers-mc

@llvm/pr-subscribers-backend-amdgpu

Author: Matt Arsenault (arsenm)

Changes

This is a refinement for the existing hack. With this,
the default target will have neither wavefrontsize feature
present, unless it was explicitly specified. That is,
getWavefrontSize() == 64 no longer implies +wavefrontsize64.
getWavefrontSize() == 32 does imply +wavefrontsize32.

Continue to assume the value is 64 with no wavesize feature.
This maintains the codegenable property without any code
that directly cares about the wavesize needs to worry about it.

Introduce an isWaveSizeKnown helper to check if we know the
wavesize is accurate based on having one of the features explicitly
set, or a known target-cpu.

Full diff: https://github.com/llvm/llvm-project/pull/117410.diff

4 Files Affected:

(modified) llvm/lib/Target/AMDGPU/GCNProcessors.td (+2-2)
(modified) llvm/lib/Target/AMDGPU/GCNSubtarget.cpp (+7-9)
(modified) llvm/lib/Target/AMDGPU/GCNSubtarget.h (+8)
(modified) llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.cpp (+3-3)

diff --git a/llvm/lib/Target/AMDGPU/GCNProcessors.td b/llvm/lib/Target/AMDGPU/GCNProcessors.td
index 3403cbab526d46..6241fa6e22ab8b 100644
--- a/llvm/lib/Target/AMDGPU/GCNProcessors.td
+++ b/llvm/lib/Target/AMDGPU/GCNProcessors.td
@@ -9,11 +9,11 @@
 // The code produced for "generic" is only useful for tests and cannot
 // reasonably be expected to execute on any particular target.
 def : ProcessorModel<"generic", NoSchedModel,
-  [FeatureWavefrontSize64, FeatureGDS, FeatureGWS]
+  [FeatureGDS, FeatureGWS]
 >;
 
 def : ProcessorModel<"generic-hsa", NoSchedModel,
-  [FeatureWavefrontSize64, FeatureGDS, FeatureGWS, FeatureFlatAddressSpace]
+  [FeatureGDS, FeatureGWS, FeatureFlatAddressSpace]
 >;
 
 //===------------------------------------------------------------===//
diff --git a/llvm/lib/Target/AMDGPU/GCNSubtarget.cpp b/llvm/lib/Target/AMDGPU/GCNSubtarget.cpp
index 6233ca2eb4f1dd..51361b75940560 100644
--- a/llvm/lib/Target/AMDGPU/GCNSubtarget.cpp
+++ b/llvm/lib/Target/AMDGPU/GCNSubtarget.cpp
@@ -100,14 +100,16 @@ GCNSubtarget &GCNSubtarget::initializeSubtargetDependencies(const Triple &TT,
   if (Gen == AMDGPUSubtarget::INVALID) {
     Gen = TT.getOS() == Triple::AMDHSA ? AMDGPUSubtarget::SEA_ISLANDS
                                        : AMDGPUSubtarget::SOUTHERN_ISLANDS;
-  }
-
-  if (!hasFeature(AMDGPU::FeatureWavefrontSize32) &&
-      !hasFeature(AMDGPU::FeatureWavefrontSize64)) {
+    // Assume wave64 for the unknown target, if not explicitly set.
+    if (getWavefrontSizeLog2() == 0)
+      WavefrontSizeLog2 = 6;
+  } else if (!hasFeature(AMDGPU::FeatureWavefrontSize32) &&
+             !hasFeature(AMDGPU::FeatureWavefrontSize64)) {
     // If there is no default wave size it must be a generation before gfx10,
     // these have FeatureWavefrontSize64 in their definition already. For gfx10+
     // set wave32 as a default.
     ToggleFeature(AMDGPU::FeatureWavefrontSize32);
+    WavefrontSizeLog2 = getGeneration() >= AMDGPUSubtarget::GFX10 ? 5 : 6;
   }
 
   // We don't support FP64 for EG/NI atm.
@@ -147,10 +149,6 @@ GCNSubtarget &GCNSubtarget::initializeSubtargetDependencies(const Triple &TT,
       !getFeatureBits().test(AMDGPU::FeatureCuMode))
     LocalMemorySize *= 2;
 
-  // Don't crash on invalid devices.
-  if (WavefrontSizeLog2 == 0)
-    WavefrontSizeLog2 = 5;
-
   HasFminFmaxLegacy = getGeneration() < AMDGPUSubtarget::VOLCANIC_ISLANDS;
   HasSMulHi = getGeneration() >= AMDGPUSubtarget::GFX9;
 
@@ -166,7 +164,7 @@ GCNSubtarget &GCNSubtarget::initializeSubtargetDependencies(const Triple &TT,
 
 void GCNSubtarget::checkSubtargetFeatures(const Function &F) const {
   LLVMContext &Ctx = F.getContext();
-  if (hasFeature(AMDGPU::FeatureWavefrontSize32) ==
+  if (hasFeature(AMDGPU::FeatureWavefrontSize32) &&
       hasFeature(AMDGPU::FeatureWavefrontSize64)) {
     Ctx.diagnose(DiagnosticInfoUnsupported(
         F, "must specify exactly one of wavefrontsize32 and wavefrontsize64"));
diff --git a/llvm/lib/Target/AMDGPU/GCNSubtarget.h b/llvm/lib/Target/AMDGPU/GCNSubtarget.h
index f3f96940c1f44b..5eada4f003ece7 100644
--- a/llvm/lib/Target/AMDGPU/GCNSubtarget.h
+++ b/llvm/lib/Target/AMDGPU/GCNSubtarget.h
@@ -1564,6 +1564,14 @@ class GCNSubtarget final : public AMDGPUGenSubtargetInfo,
     return getWavefrontSize() == 64;
   }
 
+  /// Returns if the wavesize of this subtarget is known reliable. This is false
+  /// only for the a default target-cpu that does not have an explicit
+  /// +wavefrontsize target feature.
+  bool isWaveSizeKnown() const {
+    return hasFeature(AMDGPU::FeatureWavefrontSize32) ||
+           hasFeature(AMDGPU::FeatureWavefrontSize64);
+  }
+
   const TargetRegisterClass *getBoolRC() const {
     return getRegisterInfo()->getBoolRC();
   }
diff --git a/llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.cpp b/llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.cpp
index 344028c4b48689..e21aa70c9859a0 100644
--- a/llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.cpp
+++ b/llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUInstPrinter.cpp
@@ -649,9 +649,9 @@ void AMDGPUInstPrinter::printDefaultVccOperand(bool FirstOperand,
                                                raw_ostream &O) {
   if (!FirstOperand)
     O << ", ";
-  printRegOperand(STI.hasFeature(AMDGPU::FeatureWavefrontSize64)
-                      ? AMDGPU::VCC
-                      : AMDGPU::VCC_LO,
+  printRegOperand(STI.hasFeature(AMDGPU::FeatureWavefrontSize32)
+                      ? AMDGPU::VCC_LO
+                      : AMDGPU::VCC,
                   O, MRI);
   if (FirstOperand)
     O << ", ";

jhuber6

Does this target get invoked with empty -mcpu?

arsenm · 2024-11-23T02:45:52Z

Yes

jhuber6

Thanks! libc tests are still happy so I'm fairly confident this doesn't break anything.

jhuber6 · 2024-11-23T03:10:25Z

The disassembler seems unhappy on CI (misclicked the comment button and accidentally closed it, sorry)

arsenm · 2024-11-23T17:16:49Z

Merge activity

Nov 23, 12:16 PM EST: A user started a stack merge that includes this pull request via Graphite.
Nov 23, 12:25 PM EST: Graphite rebased this pull request as part of a merge.
Nov 23, 12:27 PM EST: A user merged this pull request with Graphite.

This is a refinement for the existing hack. With this, the default target will have neither wavefrontsize feature present, unless it was explicitly specified. That is, getWavefrontSize() == 64 no longer implies +wavefrontsize64. getWavefrontSize() == 32 does imply +wavefrontsize32. Continue to assume the value is 64 with no wavesize feature. This maintains the codegenable property without any code that directly cares about the wavesize needs to worry about it. Introduce an isWaveSizeKnown helper to check if we know the wavesize is accurate based on having one of the features explicitly set, or a known target-cpu. I'm not sure what's going on in wave_any.s. It's testing what happens when both wavesizes are enabled, but this is treated as an error in codegen. We now treat wave32 as the winning case, so some cases that were previously printed as vcc are now vcc_lo.

arsenm requested review from AlexVlx, jayfoad, jhuber6 and rampitec November 23, 2024 02:42

arsenm added the backend:AMDGPU label Nov 23, 2024 — with Graphite App

arsenm marked this pull request as ready for review November 23, 2024 02:42

arsenm mentioned this pull request Nov 23, 2024

[llvm][AMDGPU] Fold llvm.amdgcn.wavefrontsize early #114481

Open

jhuber6 reviewed Nov 23, 2024

View reviewed changes

jhuber6 approved these changes Nov 23, 2024

View reviewed changes

arsenm mentioned this pull request Nov 23, 2024

AMDGPU: Use isWave[32|64] instead of comparing size value #117411

Merged

jhuber6 closed this Nov 23, 2024

jhuber6 reopened this Nov 23, 2024

shiltian approved these changes Nov 23, 2024

View reviewed changes

arsenm force-pushed the users/arsenm/remove-wave64-feature-from-default-target branch from dad0898 to a9b9b65 Compare November 23, 2024 05:55

arsenm changed the base branch from main to users/arsenm/amdgpu-move-disassembler-mcsubtarget-info-hack November 23, 2024 05:55

llvmbot added the mc Machine (object) code label Nov 23, 2024

This was referenced Nov 23, 2024

AMDGPU: Stop running assembler tests with default cpu #117421

Merged

AMDGPU: Move default wavesize hack for disassembler #117422

Merged

arsenm force-pushed the users/arsenm/amdgpu-move-disassembler-mcsubtarget-info-hack branch from bccd646 to 8c00007 Compare November 23, 2024 16:05

arsenm force-pushed the users/arsenm/remove-wave64-feature-from-default-target branch from 0f6254a to 79ff6e6 Compare November 23, 2024 16:06

arsenm force-pushed the users/arsenm/amdgpu-move-disassembler-mcsubtarget-info-hack branch from 8c00007 to a4f1256 Compare November 23, 2024 17:22

Base automatically changed from users/arsenm/amdgpu-move-disassembler-mcsubtarget-info-hack to main November 23, 2024 17:24

arsenm added 2 commits November 23, 2024 17:24

check if wave32 in disassembler

6fa5b9a

arsenm force-pushed the users/arsenm/remove-wave64-feature-from-default-target branch from 79ff6e6 to 6fa5b9a Compare November 23, 2024 17:25

arsenm merged commit cd20fc0 into main Nov 23, 2024
5 of 8 checks passed

arsenm deleted the users/arsenm/remove-wave64-feature-from-default-target branch November 23, 2024 17:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMDGPU: Remove wavefrontsize64 feature from dummy target #117410

AMDGPU: Remove wavefrontsize64 feature from dummy target #117410

arsenm commented Nov 23, 2024 •

edited

Loading

arsenm commented Nov 23, 2024 •

edited

Loading

llvmbot commented Nov 23, 2024 •

edited

Loading

jhuber6 left a comment

arsenm commented Nov 23, 2024

jhuber6 left a comment

jhuber6 commented Nov 23, 2024 •

edited

Loading

arsenm commented Nov 23, 2024 •

edited

Loading

AMDGPU: Remove wavefrontsize64 feature from dummy target #117410

AMDGPU: Remove wavefrontsize64 feature from dummy target #117410

Conversation

arsenm commented Nov 23, 2024 • edited Loading

arsenm commented Nov 23, 2024 • edited Loading

llvmbot commented Nov 23, 2024 • edited Loading

jhuber6 left a comment

Choose a reason for hiding this comment

arsenm commented Nov 23, 2024

jhuber6 left a comment

Choose a reason for hiding this comment

jhuber6 commented Nov 23, 2024 • edited Loading

arsenm commented Nov 23, 2024 • edited Loading

Merge activity

arsenm commented Nov 23, 2024 •

edited

Loading

arsenm commented Nov 23, 2024 •

edited

Loading

llvmbot commented Nov 23, 2024 •

edited

Loading

jhuber6 commented Nov 23, 2024 •

edited

Loading

arsenm commented Nov 23, 2024 •

edited

Loading