[LV][EVL] Simplify EVL recipe transformation by using a single EVL mask. nfc #152479

Mel-Chen · 2025-08-07T10:52:25Z

The EVL mask is always defined as icmp ult (step-vector, EVL), so we only need to generate it once per plan in the header. Then, we replace all uses of the header mask with the EVL mask, and recursively optimize the users of EVL mask into EVL recipes. This way, the transformation to EVL recipes can be done with just a single loop.

llvmbot · 2025-08-07T10:53:09Z

@llvm/pr-subscribers-vectorizers

@llvm/pr-subscribers-llvm-transforms

Author: Mel Chen (Mel-Chen)

Changes

The EVL mask is always defined as icmp ult (step-vector, EVL), so we only need to generate it once per plan in the header. Then, we replace all uses of the header mask with the EVL mask, and recursively optimize the users of EVL mask into EVL recipes. This way, the transformation to EVL recipes can be done with just a single loop.

Full diff: https://github.com/llvm/llvm-project/pull/152479.diff

1 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp (+36-37)

diff --git a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
index 0cb704c85ba40..4afaa9c1ece53 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
@@ -2227,48 +2227,47 @@ static void transformRecipestoEVLRecipes(VPlan &Plan, VPValue &EVL) {
     }
   }
 
-  // Try to optimize header mask recipes away to their EVL variants.
+  // Replace header masks with a mask equivalent to predicating by EVL:
+  //
+  // icmp ule widen-canonical-iv backedge-taken-count
+  // ->
+  // icmp ult step-vector, EVL
+  VPRecipeBase *EVLR = EVL.getDefiningRecipe();
+  VPBuilder Builder(EVLR->getParent(), std::next(EVLR->getIterator()));
+  Type *EVLType = TypeInfo.inferScalarType(&EVL);
+  VPValue *EVLMask = Builder.createICmp(
+      CmpInst::ICMP_ULT,
+      Builder.createNaryOp(VPInstruction::StepVector, {}, EVLType), &EVL);
   for (VPValue *HeaderMask : collectAllHeaderMasks(Plan)) {
-    // TODO: Split optimizeMaskToEVL out and move into
-    // VPlanTransforms::optimize. transformRecipestoEVLRecipes should be run in
-    // tryToBuildVPlanWithVPRecipes beforehand.
-    for (VPUser *U : collectUsersRecursively(HeaderMask)) {
-      auto *CurRecipe = cast<VPRecipeBase>(U);
-      VPRecipeBase *EVLRecipe =
-          optimizeMaskToEVL(HeaderMask, *CurRecipe, TypeInfo, *AllOneMask, EVL);
-      if (!EVLRecipe)
-        continue;
-
-      [[maybe_unused]] unsigned NumDefVal = EVLRecipe->getNumDefinedValues();
-      assert(NumDefVal == CurRecipe->getNumDefinedValues() &&
-             "New recipe must define the same number of values as the "
-             "original.");
-      assert(
-          NumDefVal <= 1 &&
-          "Only supports recipes with a single definition or without users.");
-      EVLRecipe->insertBefore(CurRecipe);
-      if (isa<VPSingleDefRecipe, VPWidenLoadEVLRecipe>(EVLRecipe)) {
-        VPValue *CurVPV = CurRecipe->getVPSingleValue();
-        CurVPV->replaceAllUsesWith(EVLRecipe->getVPSingleValue());
-      }
-      ToErase.push_back(CurRecipe);
-    }
-
-    // Replace header masks with a mask equivalent to predicating by EVL:
-    //
-    // icmp ule widen-canonical-iv backedge-taken-count
-    // ->
-    // icmp ult step-vector, EVL
-    VPRecipeBase *EVLR = EVL.getDefiningRecipe();
-    VPBuilder Builder(EVLR->getParent(), std::next(EVLR->getIterator()));
-    Type *EVLType = TypeInfo.inferScalarType(&EVL);
-    VPValue *EVLMask = Builder.createICmp(
-        CmpInst::ICMP_ULT,
-        Builder.createNaryOp(VPInstruction::StepVector, {}, EVLType), &EVL);
     HeaderMask->replaceAllUsesWith(EVLMask);
     ToErase.push_back(HeaderMask->getDefiningRecipe());
   }
 
+  // Try to optimize header mask recipes away to their EVL variants.
+  // TODO: Split optimizeMaskToEVL out and move into
+  // VPlanTransforms::optimize. transformRecipestoEVLRecipes should be run in
+  // tryToBuildVPlanWithVPRecipes beforehand.
+  for (VPUser *U : collectUsersRecursively(EVLMask)) {
+    auto *CurRecipe = cast<VPRecipeBase>(U);
+    VPRecipeBase *EVLRecipe =
+        optimizeMaskToEVL(EVLMask, *CurRecipe, TypeInfo, *AllOneMask, EVL);
+    if (!EVLRecipe)
+      continue;
+
+    [[maybe_unused]] unsigned NumDefVal = EVLRecipe->getNumDefinedValues();
+    assert(NumDefVal == CurRecipe->getNumDefinedValues() &&
+           "New recipe must define the same number of values as the "
+           "original.");
+    assert(NumDefVal <= 1 &&
+           "Only supports recipes with a single definition or without users.");
+    EVLRecipe->insertBefore(CurRecipe);
+    if (isa<VPSingleDefRecipe, VPWidenLoadEVLRecipe>(EVLRecipe)) {
+      VPValue *CurVPV = CurRecipe->getVPSingleValue();
+      CurVPV->replaceAllUsesWith(EVLRecipe->getVPSingleValue());
+    }
+    ToErase.push_back(CurRecipe);
+  }
+
   for (VPRecipeBase *R : reverse(ToErase)) {
     SmallVector<VPValue *> PossiblyDead(R->operands());
     R->eraseFromParent();

alexey-bataev · 2025-08-07T12:09:29Z

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

+  VPRecipeBase *EVLR = EVL.getDefiningRecipe();
+  VPBuilder Builder(EVLR->getParent(), std::next(EVLR->getIterator()));
+  Type *EVLType = TypeInfo.inferScalarType(&EVL);
+  VPValue *EVLMask = Builder.createICmp(
+      CmpInst::ICMP_ULT,
+      Builder.createNaryOp(VPInstruction::StepVector, {}, EVLType), &EVL);


Shall we check before that there is at least a single user?

Given that this runs after optimization maybe it's possible it might get optimized away? In that case we would end up with an unused EVLMask.

Although I'm pretty sure we do a second run of removeDeadRecipes afterwards so it's probably ok.

I remove it if the mask is dead.
53f0022

lukel97

LGTM. This should also make it easier to split up the parts needed for correctness and which parts are just about optimizing away the header mask.

As an aside, its weird to begin with that there might be multiple header masks? From a quick check it only appears to happen when the data tail folding style is used, and we generate both a VPInstruction::ActiveLaneMask and a icmp ule IV, BTC?. I'll see if we can change this so that we only have one header mask max.

lukel97 · 2025-08-07T12:18:52Z

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

+  VPRecipeBase *EVLR = EVL.getDefiningRecipe();
+  VPBuilder Builder(EVLR->getParent(), std::next(EVLR->getIterator()));
+  Type *EVLType = TypeInfo.inferScalarType(&EVL);
+  VPValue *EVLMask = Builder.createICmp(
+      CmpInst::ICMP_ULT,
+      Builder.createNaryOp(VPInstruction::StepVector, {}, EVLType), &EVL);


Although I'm pretty sure we do a second run of removeDeadRecipes afterwards so it's probably ok.

lukel97 · 2025-08-07T12:52:19Z

As an aside, its weird to begin with that there might be multiple header masks? From a quick check it only appears to happen when the data tail folding style is used, and we generate both a VPInstruction::ActiveLaneMask and a icmp ule IV, BTC?. I'll see if we can change this so that we only have one header mask max.

I've opened up #152489 for this

Mel-Chen · 2025-08-08T12:09:26Z

As an aside, its weird to begin with that there might be multiple header masks? From a quick check it only appears to happen when the data tail folding style is used, and we generate both a VPInstruction::ActiveLaneMask and a icmp ule IV, BTC?. I'll see if we can change this so that we only have one header mask max.

Yes, but I haven’t looked closely into the reason for having more than one header mask yet, since I’m out of office today. But I think there’s a chance we could unify them into a single header mask.

fhahn · 2025-08-11T09:14:03Z

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

+  }
+  // Remove dead EVL mask.
+  if (EVLMask->getNumUsers() == 0)
+    EVLMask->getDefiningRecipe()->eraseFromParent();


For consistency, should this also be added to ToErase?

Mel-Chen requested review from fhahn, lukel97, alexey-bataev and LiqinWeng August 7, 2025 10:52

llvmbot added vectorizers llvm:transforms labels Aug 7, 2025

alexey-bataev reviewed Aug 7, 2025

View reviewed changes

lukel97 approved these changes Aug 7, 2025

View reviewed changes

lukel97 mentioned this pull request Aug 7, 2025

Split VPlanTransforms::addExplicitVectorLength into variable step transformation and header mask optimisation #152541

Open

Mel-Chen added 2 commits August 8, 2025 04:40

nfc, unified evl mask

eb09e4e

remove dead mask

53f0022

Mel-Chen force-pushed the nfc-evl-mask branch from a4280e0 to 53f0022 Compare August 8, 2025 12:07

Mel-Chen requested a review from alexey-bataev August 8, 2025 12:10

alexey-bataev approved these changes Aug 8, 2025

View reviewed changes

Mel-Chen merged commit 6db3776 into llvm:main Aug 11, 2025
9 checks passed

fhahn reviewed Aug 11, 2025

View reviewed changes

Mel-Chen deleted the nfc-evl-mask branch August 15, 2025 07:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LV][EVL] Simplify EVL recipe transformation by using a single EVL mask. nfc #152479

[LV][EVL] Simplify EVL recipe transformation by using a single EVL mask. nfc #152479

Uh oh!

Mel-Chen commented Aug 7, 2025

Uh oh!

llvmbot commented Aug 7, 2025 •

edited

Loading

Uh oh!

alexey-bataev Aug 7, 2025

Uh oh!

lukel97 Aug 7, 2025

Uh oh!

lukel97 Aug 7, 2025

Uh oh!

Mel-Chen Aug 8, 2025

Uh oh!

lukel97 left a comment

Uh oh!

lukel97 Aug 7, 2025

Uh oh!

lukel97 commented Aug 7, 2025

Uh oh!

Mel-Chen commented Aug 8, 2025

Uh oh!

Uh oh!

fhahn Aug 11, 2025

Uh oh!

Mel-Chen Aug 15, 2025

Uh oh!

Uh oh!

[LV][EVL] Simplify EVL recipe transformation by using a single EVL mask. nfc #152479

[LV][EVL] Simplify EVL recipe transformation by using a single EVL mask. nfc #152479

Uh oh!

Conversation

Mel-Chen commented Aug 7, 2025

Uh oh!

llvmbot commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexey-bataev Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

lukel97 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

lukel97 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Mel-Chen Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

lukel97 left a comment

Choose a reason for hiding this comment

Uh oh!

lukel97 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

lukel97 commented Aug 7, 2025

Uh oh!

Mel-Chen commented Aug 8, 2025

Uh oh!

Uh oh!

fhahn Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Mel-Chen Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvmbot commented Aug 7, 2025 •

edited

Loading