[AMDGPU] Baseline gfx1250 speed model. #145217

rampitec · 2025-06-22T07:34:09Z

No description provided.

rampitec · 2025-06-22T07:34:23Z

[AMDGPU] Baseline gfx1250 speed model. #145217 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

llvmbot · 2025-06-22T07:35:38Z

@llvm/pr-subscribers-backend-amdgpu

Author: Stanislav Mekhanoshin (rampitec)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/145217.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/GCNProcessors.td (+1-1)
(modified) llvm/lib/Target/AMDGPU/SISchedule.td (+33)

diff --git a/llvm/lib/Target/AMDGPU/GCNProcessors.td b/llvm/lib/Target/AMDGPU/GCNProcessors.td
index 0b331bd3f3fb6..b5ffa64c3a4b4 100644
--- a/llvm/lib/Target/AMDGPU/GCNProcessors.td
+++ b/llvm/lib/Target/AMDGPU/GCNProcessors.td
@@ -326,6 +326,6 @@ def : ProcessorModel<"gfx12-generic", GFX12SpeedModel,
   FeatureISAVersion12_Generic.Features
 >;
 
-def : ProcessorModel<"gfx1250", GFX12SpeedModel,
+def : ProcessorModel<"gfx1250", GFX1250SpeedModel,
   FeatureISAVersion12_50.Features
 >;
diff --git a/llvm/lib/Target/AMDGPU/SISchedule.td b/llvm/lib/Target/AMDGPU/SISchedule.td
index 2a374b360b04a..1679cee320067 100644
--- a/llvm/lib/Target/AMDGPU/SISchedule.td
+++ b/llvm/lib/Target/AMDGPU/SISchedule.td
@@ -99,6 +99,7 @@ def SIDPGFX950FullSpeedModel : SISchedMachineModel;
 def GFX10SpeedModel : SISchedMachineModel;
 def GFX11SpeedModel : SISchedMachineModel;
 def GFX12SpeedModel : SISchedMachineModel;
+def GFX1250SpeedModel : SISchedMachineModel;
 
 // XXX: Are the resource counts correct?
 def HWBranch : ProcResource<1> {
@@ -455,3 +456,35 @@ def : HWWriteRes<WriteBarrier,           [HWBranch],       2000>;
 def : InstRW<[WriteCopy], (instrs COPY)>;
 
 }  // End SchedModel = GFX12SpeedModel
+
+multiclass GFX125xCommonWriteRes {
+
+def : HWWriteRes<Write32Bit,             [HWVALU, HWRC],   5>;
+def : HWWriteRes<WriteFloatCvt,          [HWVALU, HWRC],   5>;
+def : HWWriteRes<WriteTrans32,           [HWTransVALU, HWRC],   7>;
+def : HWWriteRes<WriteQuarterRate32,     [HWVALU, HWRC],   6>;
+def : HWWriteRes<WriteFloatFMA,          [HWVALU, HWRC],   5>;
+def : HWWriteRes<WritePseudoScalarTrans, [HWVALU, HWRC],   8>;
+
+def : HWWriteRes<WriteBranch,            [HWBranch],       32>;
+def : HWWriteRes<WriteExport,            [HWExport, HWRC], 16>;
+def : HWWriteRes<WriteLDS,               [HWLGKM,   HWRC], 20>;
+def : HWWriteRes<WriteSALU,              [HWSALU,   HWRC], 2>;
+def : HWWriteRes<WriteSFPU,              [HWSALU,   HWRC], 4>;
+def : HWWriteRes<WriteSMEM,              [HWLGKM,   HWRC], 20>;
+def : HWWriteRes<WriteVMEM,              [HWVMEM,   HWRC], 320>;
+def : HWWriteRes<WriteBarrier,           [HWBranch],       2000>;
+
+def : InstRW<[WriteCopy], (instrs COPY)>;
+} // End GFX125xCommonWriteRes
+
+let SchedModel = GFX1250SpeedModel in {
+defm : GFX125xCommonWriteRes;
+
+def : HWWriteRes<Write64Bit,             [HWVALU, HWRC],   7>;
+def : HWWriteRes<WriteIntMul,            [HWVALU, HWRC],   11>;
+def : HWWriteRes<WriteDouble,            [HWVALU, HWRC],   32>;
+def : HWWriteRes<WriteDoubleAdd,         [HWVALU, HWRC],   32>;
+def : HWWriteRes<WriteDoubleCvt,         [HWVALU, HWRC],   32>;
+def : HWWriteRes<WriteTrans64,           [HWVALU, HWTransVALU, HWRC], 38>;
+} // SchedModel = GFX1250SpeedModel

DadSchoorse · 2025-06-22T07:48:19Z

llvm/lib/Target/AMDGPU/SISchedule.td

+def : HWWriteRes<WriteTrans32,           [HWTransVALU, HWRC],   7>;
+def : HWWriteRes<WriteQuarterRate32,     [HWVALU, HWRC],   6>;
+def : HWWriteRes<WriteFloatFMA,          [HWVALU, HWRC],   5>;
+def : HWWriteRes<WritePseudoScalarTrans, [HWVALU, HWRC],   8>;


Why do WriteTrans32 and WritePseudoScalarTrans use different resources? And it seems unintuitive that the scalar trans cost is higher than trans32, is that correct?

According to the spec it uses VALU pipeline. I suspect it uses both, but that is really what is written. Then it is of course inherited from the gfx12 baseline.

And yes, it is correct it is higher, because you also need to move data to the pipeline.

arsenm

llvm-mca tests would be good

[AMDGPU] Baseline gfx1250 speed model.

da79056

rampitec requested review from kerbowa, changpeng and shiltian June 22, 2025 07:34

rampitec marked this pull request as ready for review June 22, 2025 07:35

llvmbot added the backend:AMDGPU label Jun 22, 2025

DadSchoorse reviewed Jun 22, 2025

View reviewed changes

arsenm approved these changes Jun 23, 2025

View reviewed changes

rampitec merged commit 89c6144 into main Jun 23, 2025
11 checks passed

rampitec deleted the users/rampitec/06-22-_amdgpu_baseline_gfx1250_speed_model branch June 23, 2025 03:26

miguelcsx pushed a commit to miguelcsx/llvm-project that referenced this pull request Jun 23, 2025

[AMDGPU] Baseline gfx1250 speed model. (llvm#145217)

5c5ff12

Jaddyen pushed a commit to Jaddyen/llvm-project that referenced this pull request Jun 23, 2025

[AMDGPU] Baseline gfx1250 speed model. (llvm#145217)

e6a73b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Baseline gfx1250 speed model. #145217

[AMDGPU] Baseline gfx1250 speed model. #145217

Uh oh!

rampitec commented Jun 22, 2025

Uh oh!

rampitec commented Jun 22, 2025

Uh oh!

llvmbot commented Jun 22, 2025

Uh oh!

DadSchoorse Jun 22, 2025

Uh oh!

rampitec Jun 22, 2025

Uh oh!

rampitec Jun 22, 2025

Uh oh!

arsenm left a comment

Uh oh!

Uh oh!

Uh oh!

[AMDGPU] Baseline gfx1250 speed model. #145217

[AMDGPU] Baseline gfx1250 speed model. #145217

Uh oh!

Conversation

rampitec commented Jun 22, 2025

Uh oh!

rampitec commented Jun 22, 2025

Uh oh!

llvmbot commented Jun 22, 2025

Uh oh!

DadSchoorse Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

rampitec Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

rampitec Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!