[mlir][llvm] Add experimental.vector.interleave2 intrinsic #79270

c-rhodes · 2024-01-24T10:19:12Z

No description provided.

llvmbot · 2024-01-24T10:19:42Z

@llvm/pr-subscribers-mlir-sve

@llvm/pr-subscribers-mlir

Author: Cullen Rhodes (c-rhodes)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/79270.diff

2 Files Affected:

(modified) mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td (+4)
(modified) mlir/test/Target/LLVMIR/arm-sve.mlir (+7)

diff --git a/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td b/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td
index e3f3d9e62e8fb39..754413a1ad491ec 100644
--- a/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td
+++ b/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td
@@ -410,4 +410,8 @@ def ConvertToSvboolIntrOp :
     /*overloadedResults=*/[]>,
     Arguments<(ins SVEPredicate:$mask)>;
 
+def Zip1IntrOp :
+  ArmSVE_IntrBinaryOverloadedOp<"zip1">,
+  Arguments<(ins AnyScalableVector, AnyScalableVector)>;
+
 #endif // ARMSVE_OPS
diff --git a/mlir/test/Target/LLVMIR/arm-sve.mlir b/mlir/test/Target/LLVMIR/arm-sve.mlir
index b63d3f06515690a..002b1f9d804a7ce 100644
--- a/mlir/test/Target/LLVMIR/arm-sve.mlir
+++ b/mlir/test/Target/LLVMIR/arm-sve.mlir
@@ -314,3 +314,10 @@ llvm.func @arm_sve_convert_to_svbool(
     : (vector<[1]xi1>) -> vector<[16]xi1>
   llvm.return
 }
+
+// CHECK-LABEL: @arm_sve_zip1
+// CHECK-NEXT: call <vscale x 8 x half> @llvm.aarch64.sve.zip1.nxv8f16(<vscale x 8 x half> %{{.*}}, <vscale x 8 x half> {{.*}})
+llvm.func @arm_sve_zip1(%arg0 : vector<[8]xf16>) -> vector<[8]xf16> {
+  %0 = "arm_sve.intr.zip1"(%arg0, %arg0) : (vector<[8]xf16>, vector<[8]xf16>) -> vector<[8]xf16>
+  llvm.return %0 : vector<[8]xf16>
+}

llvmbot · 2024-01-24T10:19:42Z

@llvm/pr-subscribers-mlir-llvm

Author: Cullen Rhodes (c-rhodes)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/79270.diff

2 Files Affected:

(modified) mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td (+4)
(modified) mlir/test/Target/LLVMIR/arm-sve.mlir (+7)

diff --git a/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td b/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td
index e3f3d9e62e8fb39..754413a1ad491ec 100644
--- a/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td
+++ b/mlir/include/mlir/Dialect/ArmSVE/IR/ArmSVE.td
@@ -410,4 +410,8 @@ def ConvertToSvboolIntrOp :
     /*overloadedResults=*/[]>,
     Arguments<(ins SVEPredicate:$mask)>;
 
+def Zip1IntrOp :
+  ArmSVE_IntrBinaryOverloadedOp<"zip1">,
+  Arguments<(ins AnyScalableVector, AnyScalableVector)>;
+
 #endif // ARMSVE_OPS
diff --git a/mlir/test/Target/LLVMIR/arm-sve.mlir b/mlir/test/Target/LLVMIR/arm-sve.mlir
index b63d3f06515690a..002b1f9d804a7ce 100644
--- a/mlir/test/Target/LLVMIR/arm-sve.mlir
+++ b/mlir/test/Target/LLVMIR/arm-sve.mlir
@@ -314,3 +314,10 @@ llvm.func @arm_sve_convert_to_svbool(
     : (vector<[1]xi1>) -> vector<[16]xi1>
   llvm.return
 }
+
+// CHECK-LABEL: @arm_sve_zip1
+// CHECK-NEXT: call <vscale x 8 x half> @llvm.aarch64.sve.zip1.nxv8f16(<vscale x 8 x half> %{{.*}}, <vscale x 8 x half> {{.*}})
+llvm.func @arm_sve_zip1(%arg0 : vector<[8]xf16>) -> vector<[8]xf16> {
+  %0 = "arm_sve.intr.zip1"(%arg0, %arg0) : (vector<[8]xf16>, vector<[8]xf16>) -> vector<[8]xf16>
+  llvm.return %0 : vector<[8]xf16>
+}

MacDue · 2024-01-24T13:00:42Z

Maybe add zip2 as well?
(In the 2/4-way lowerings I think zip1, zip2, would use half the registers two zip1s, which might be something we'd like to try).

dcaballe · 2024-01-24T15:16:16Z

I've been successfully using vector.shuffle to model this pattern for Neon and the zip instructions were generated accordingly so I'm wondering if we could do the same for scalable. What is the way to model scalable shuffles in LLVM?

banach-space · 2024-01-24T16:29:37Z

I've been successfully using vector.shuffle to model this pattern for Neon and the zip instructions were generated accordingly so I'm wondering if we could do the same for scalable. What is the way to model scalable shuffles in LLVM?

IIUC, that's not possible for SVE. From https://llvm.org/docs/LangRef.html#id189:

For scalable vectors, the only valid mask values at present are zeroinitializer, undef and poison, since we cannot write all indices as literals for a vector with a length unknown at compile time.

MacDue · 2024-01-24T16:34:54Z

IIUC, that's not possible for SVE. From https://llvm.org/docs/LangRef.html#id189:

For SVE/scalable vectors there is @llvm.experimental.vector.interleave2 which a slightly higher level abstraction for this:
https://llvm.org/docs/LangRef.html#llvm-experimental-vector-interleave2-intrinsic

Though the MLIR vector.shuffle operation currently cannot model a scalable zip, so in MLIR this would probably be vector.scalable.interleave (following from vector.scalable.insert/extract, which are also experimental LLVM intrinsics).

c-rhodes · 2024-01-25T13:23:14Z

Maybe add zip2 as well? (In the 2/4-way lowerings I think zip1, zip2, would use half the registers two zip1s, which might be something we'd like to try).

I tried that already but couldn't get it to work

c-rhodes · 2024-01-25T13:25:26Z

Maybe add zip2 as well? (In the 2/4-way lowerings I think zip1, zip2, would use half the registers two zip1s, which might be something we'd like to try).

I tried that already but couldn't get it to work

If we can get that to work could be a nice future improvement, I'd rather not add an intrinsic until we're sure if that's ok, and it's not necessary for this first widening outer product support

c-rhodes · 2024-01-25T14:43:34Z

IIUC, that's not possible for SVE. From https://llvm.org/docs/LangRef.html#id189:

For SVE/scalable vectors there is @llvm.experimental.vector.interleave2 which a slightly higher level abstraction for this: https://llvm.org/docs/LangRef.html#llvm-experimental-vector-interleave2-intrinsic

Oh cool, this is nicer, I'll use this intrinsic instead.

Though the MLIR vector.shuffle operation currently cannot model a scalable zip, so in MLIR this would probably be vector.scalable.interleave (following from vector.scalable.insert/extract, which are also experimental LLVM intrinsics).

Sounds good 👍 I'll update the patches to use the target-agnostic interleave intrinsic, and we can look to add an op later.

mlir/include/mlir/Dialect/LLVMIR/LLVMIntrinsicOps.td

banach-space

LGTM, thanks!

dcaballe

Cool!

use LLVM vector type APIs instead.

c-rhodes · 2024-01-29T12:13:57Z

@MacDue pointed out the input typeLLVM_AnyVector includes LLVMFixedVectorType and LLVMScalableVectorType and cast<VectorType> will crash. I've updated the constraint to fix this.

MacDue

LGTM, thanks

[mlir][ArmSVE] add zip1 intrinsic

f1fe6a2

c-rhodes requested a review from MacDue January 24, 2024 10:19

c-rhodes requested review from banach-space, dcaballe and nicolasvasilache as code owners January 24, 2024 10:19

llvmbot added mlir:llvm mlir mlir:sve labels Jan 24, 2024

c-rhodes mentioned this pull request Jan 24, 2024

[mlir][ArmSME] Support 2-way widening outer products #78975

Merged

banach-space mentioned this pull request Jan 25, 2024

[mlir][Vector] Add patterns for efficient i4 -> i8 conversion emulation #79494

Merged

replace with interleave2 intrinsic

0686a46

c-rhodes changed the title ~~[mlir][ArmSVE] add zip1 intrinsic~~ [mlir][llvm] add experimental.vector.interleave2 intrinsic Jan 26, 2024

MacDue reviewed Jan 26, 2024

View reviewed changes

mlir/include/mlir/Dialect/LLVMIR/LLVMIntrinsicOps.td Outdated Show resolved Hide resolved

check result has even number of elements

a8ee8ed

banach-space approved these changes Jan 26, 2024

View reviewed changes

simplify type constraint by reversing check

8f91971

dcaballe approved these changes Jan 27, 2024

View reviewed changes

vector type cast in type constraint crashes on !llvm.vec types

e7633e9

use LLVM vector type APIs instead.

MacDue approved these changes Jan 29, 2024

View reviewed changes

MacDue changed the title ~~[mlir][llvm] add experimental.vector.interleave2 intrinsic~~ [mlir][llvm] Add experimental.vector.interleave2 intrinsic Jan 29, 2024

c-rhodes merged commit 754a8ad into llvm:main Jan 29, 2024

c-rhodes deleted the mlir-arm-sve-zip1-intrinsic branch January 29, 2024 13:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][llvm] Add experimental.vector.interleave2 intrinsic #79270

[mlir][llvm] Add experimental.vector.interleave2 intrinsic #79270

Uh oh!

c-rhodes commented Jan 24, 2024

Uh oh!

llvmbot commented Jan 24, 2024 •

edited

Loading

Uh oh!

llvmbot commented Jan 24, 2024

Uh oh!

MacDue commented Jan 24, 2024

Uh oh!

dcaballe commented Jan 24, 2024

Uh oh!

banach-space commented Jan 24, 2024

Uh oh!

MacDue commented Jan 24, 2024 •

edited

Loading

Uh oh!

c-rhodes commented Jan 25, 2024

Uh oh!

c-rhodes commented Jan 25, 2024

Uh oh!

c-rhodes commented Jan 25, 2024

Uh oh!

Uh oh!

banach-space left a comment

Uh oh!

dcaballe left a comment

Uh oh!

c-rhodes commented Jan 29, 2024

Uh oh!

MacDue left a comment

Uh oh!

Uh oh!

[mlir][llvm] Add experimental.vector.interleave2 intrinsic #79270

[mlir][llvm] Add experimental.vector.interleave2 intrinsic #79270

Uh oh!

Conversation

c-rhodes commented Jan 24, 2024

Uh oh!

llvmbot commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Jan 24, 2024

Uh oh!

MacDue commented Jan 24, 2024

Uh oh!

dcaballe commented Jan 24, 2024

Uh oh!

banach-space commented Jan 24, 2024

Uh oh!

MacDue commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

c-rhodes commented Jan 25, 2024

Uh oh!

c-rhodes commented Jan 25, 2024

Uh oh!

c-rhodes commented Jan 25, 2024

Uh oh!

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

dcaballe left a comment

Choose a reason for hiding this comment

Uh oh!

c-rhodes commented Jan 29, 2024

Uh oh!

MacDue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvmbot commented Jan 24, 2024 •

edited

Loading

MacDue commented Jan 24, 2024 •

edited

Loading