LLVM and SPIRV-LLVM-Translator pulldown (WW27 2025) #19233

iclsrc · 2025-07-01T09:25:23Z

LLVM: llvm/llvm-project@58987d2
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@dcfd949

…#143415) Cf. https://discourse.llvm.org/t/mlir-dead-code-analysis/67568/10 Custom analysis passes will not work properly unless both DeadCodeAnalysis and SparseConstantPropagation are loaded to the DataFlowSolver. This is intended behavior, but surprising to many users as shown in the thread. In lieu of a longer-term fix (which I am not knowledgeable enough to implement myself, yet), this commit adds a helper function that loads these two analyses, as well as providing breadcrumbs for an explanation of the problem. The existing places in the codebase where these two analyses are loaded for the purpose of running other unrelated analyses are replaced by the use of the helper. --------- Co-authored-by: Jeremy Kun <[email protected]> Co-authored-by: Oleksandr "Alex" Zinenko <[email protected]>

… NFC.

Summary: I'll probably want to use this as a more generic utility in the future. This patch reworks it to make it a top level function. I also tried to decouple this from the OpenMP utilities to make that easier in the future. Instead, I just use `-march=native` functionality which is the same thing. Needed a small hack to skip the linker stage for checking if that works. This should still create the same output as far as I'm aware.

ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch takes care of the mlir side of the migration, starting with straightforward places where I see ArrayRef or ValueRange nearby. Note that ValueRange has a constructor that forwards arguments to an ArrayRef constructor.

This PR adds an MCP (Model Context Protocol ) server to LLDB. For motivation and background, please refer to the corresponding RFC: https://discourse.llvm.org/t/rfc-adding-mcp-support-to-lldb/86798 I implemented this as a new kind of plugin. The idea is that we could support multiple protocol servers (e.g. if we want to support DAP from within LLDB). This also introduces a corresponding top-level command (`protocol-server`) with two subcommands to `start` and `stop` the server. ``` (lldb) protocol-server start MCP tcp://localhost:1234 MCP server started with connection listeners: connection://[::1]:1234, connection://[127.0.0.1]:1234 ``` The MCP sever supports one tool (`lldb_command`) which executes a command, but can easily be extended with more commands.

…dexp (#145048)

…LPV node use v2f64/v4f32 types When reducing v4f64/v8f32 non-lane crossing X86ISD::VPERMV nodes, we use X86ISD::VPERMILPV nodes for 128-bits, but these are only available for fp types. Fixes #145046

…… (#145014) …types usi… (#144676)" This reverts commit 68471d2. IntegralAP contains a union: union { uint64_t *Memory = nullptr; uint64_t Val; }; On 64bit systems, both Memory and Val have the same size. However, on 32 bit system, Val is 64bit and Memory only 32bit. Which means the default initializer for Memory will only zero half of Val. We fixed this by zero-initializing Val explicitly in the IntegralAP(unsigned BitWidth) constructor. See also the discussion in llvm/llvm-project#144246

Address NFC mismatches caused by running perf2bolt from under the wrapper script: https://lab.llvm.org/buildbot/#/builders/92/builds/20938 > <stdin>:2:64: note: possible intended match here > /home/worker/bolt-worker2/bolt-x86_64-ubuntu-nfc/build/bin/llvm-bolt.old: -spe is available only on AArch64. Test Plan: ninja check-bolt

`[]<list<int>>` actually produces `list<list<int>>`.

Allow the function definition line to match with and without attrbute set number. This fixes build break after PR144534: https://lab.llvm.org/buildbot/#/builders/157/builds/31331 Also move the test to the OpenMP subdirectory where it should have been from the beginning.

Adds support to MSan for `free_sized` and `free_aligned_sized` from C23. Other sanitizers will be handled with their own separate PRs. For llvm/llvm-project#144435 Signed-off-by: Justin King <[email protected]>

…s" (#145065) Reverts llvm/llvm-project#144880 Caused `TestObjCIvarsInBlocks.py` to fail on macOS CI.

Detected by CI after #143958.

…144157) See work detail: iree-org/iree#20920 Add support for f4E2M1FN in `arith.truncf` and `arith.extf` ops though a software emulation --------- Signed-off-by: Muzammiluddin Syed <[email protected]>

…16 is enabled (#144968) Fixes SWDEV-537014.

…79R0 (#144553) https://wg21.link/P3379R0 updated the value of __cpp_lib_constrained_equality, but we forgot to update it when we implemented the paper.

This patch disables unexpected_disabled_cpp17.verify.cpp under clang modules builds because it changes diagnostics criteria post #143423, causing the test to fail. This patch follows a similar style to 853059a. This was found when working on trying to land #144033.

… (#144799) isComplete previously meant different things for different conversion directions. Refactored bytes_processed to bytes_stored which now consistently increments on every push and decrements on pop making both directions more consistent with each other

…ins (#142480) Define the __dmr1024 type used to manipulate the new DMR registers introduced by the Dense Math Facility (DMF) on PowerPC, and add six Clang builtins that correspond to the integer outer-product accumulate to ACC PowerPC instructions: * __builtin_mma_dmxvi8gerx4 * __builtin_mma_pmdmxvi8gerx4 * __builtin_mma_dmxvi8gerx4pp * __builtin_mma_pmdmxvi8gerx4pp * __builtin_mma_dmxvi8gerx4spp * __builtin_mma_pmdmxvi8gerx4spp.

…a shuffle if the concat is free (#145043)

…undefined constant fixed-length vectors (#143837) This patch adds an off-by-default flag which, when enabled via `-mllvm -msan-poison-undef-vectors=true`, fixes a false negative in MSan (partially-undefined constant fixed-length vectors). It is currently off by default since, by fixing the false positive, code/tests that previously passed MSan may start failing. The default will be changed in a future patch. Prior to this patch, MSan computes that partially-undefined constant fixed-length vectors are fully initialized, which leads to false negatives; moreover, benign vector rewriting could theoretically flip MSan's shadow computation from initialized to uninitialized or vice-versa (*). `-msan-poison-undef-vectors=true` calculates the shadow precisely: for each element of the vector, the corresponding shadow is fully uninitialized if the element is undefined/poisoned, otherwise it is fully initialized. Updates the test from llvm/llvm-project#143823 (*) For example: ``` %x = insertelement <2 x i64> <i64 0, i64 poison>, i64 42, i64 0 %y = insertelement <2 x i64> <i64 poison, i64 poison>, i64 42, i64 0 ``` %x and %y are equivalent but, prior to this patch, MSan incorrectly computes the shadow of %x as <0, 0> rather than <0, -1>.

…#144097) In Aya/Rust, BPF map definitions are nested in two nested types: * A struct representing the map type (e.g., `HashMap`, `RingBuf`) that provides methods for interacting with the map type (e.g. `HashMap::get`, `RingBuf::reserve`). * An `UnsafeCell`, which informs the Rust compiler that the type is thread-safe and can be safely mutated even as a global variable. The kernel guarantees map operation safety. This leads to a type hierarchy like: ```rust pub struct HashMap<K, V, const M: usize, const F: usize = 0>( core::cell::UnsafeCell<HashMapDef<K, V, M, F>>, ); const BPF_MAP_TYPE_HASH: usize = 1; pub struct HashMapDef<K, V, const M: usize, const F: usize = 0> { r#type: *const [i32; BPF_MAP_TYPE_HASH], key: *const K, value: *const V, max_entries: *const [i32; M], map_flags: *const [i32; F], } ``` Then used in the BPF program code as a global variable: ```rust #[link_section = ".maps"] static HASH_MAP: HashMap<u32, u32, 1337> = HashMap::new(); ``` Which is an equivalent of the following BPF map definition in C: ```c #define BPF_MAP_TYPE_HASH 1 struct { int (*type)[BPF_MAP_TYPE_HASH]; typeof(int) *key; typeof(int) *value; int (*max_entries)[1337]; } map_1 __attribute__((section(".maps"))); ``` Accessing the actual map definition requires traversing: ``` HASH_MAP -> __0 -> value ``` Previously, the BPF backend only visited the pointee types of the outermost struct, and didn’t descend into inner wrappers. This caused issues when the key/value types were custom structs: ```rust // Define custom structs for key and values. pub struct MyKey(u32); pub struct MyValue(u32); #[link_section = ".maps"] #[export_name = "HASH_MAP"] pub static HASH_MAP: HashMap<MyKey, MyValue, 10> = HashMap::new(); ``` These types weren’t fully visited and appeared in BTF as forward declarations: ``` #30: <FWD> 'MyKey' kind:struct #31: <FWD> 'MyValue' kind:struct ``` The fix is to enhance `visitMapDefType` to recursively visit inner composite members. If a member is a composite type (likely a wrapper), it is now also visited using `visitMapDefType`, ensuring that the pointee types of the innermost stuct members, like `MyKey` and `MyValue`, are fully resolved in BTF. With this fix, the correct BTF entries are emitted: ``` #6: <STRUCT> 'MyKey' sz:4 n:1 #00 '__0' off:0 --> [7] #7: <INT> 'u32' bits:32 off:0 #8: <PTR> --> [9] #9: <STRUCT> 'MyValue' sz:4 n:1 #00 '__0' off:0 --> [7] ``` Fixes: #143361

CONFLICT (content): Merge conflict in clang/lib/Driver/ToolChains/Clang.cpp

…142687) Changes: * Decouple layout propagation from subgroup distribution and move it to an independent pass. * Refine layout assignment to handle control-flow ops correctly (scf.for, scf.while). * Refine test cases.

… (#144653) This allows us to better track the narrow type we need and to fix miscompiles if f16->f32 and bf16->f32 extends are mixed. Fixes #144651.

This tries to be a bit smarter for the OLD behaviour of CMP0116, to glob more relevant directories looking for possible dependencies. The changes are: - Remove some duplication of lines in the `tablegen` function. - Put CURRENT_SOURCE_DIR into `tblgen_includes` (at the front) - Glob all directories in `tblgen_includes` - Give up on `local_tds` which was wrong when using tablegen to compile a file in a different directory (as TargetParser does) - Use `EXTRA_INCLUDES` in TargetParser `tablegen` calls. This is still an under-approximation of what might be included, at least comparing the RISCVTargetParserDef.inc.d (after building `target_parser_gen`), and the list of deps in the ninja file when explicitly setting CMP0116 to OLD. Fixes #144639

Fix: Clang::CodeGenSYCL/spirv-builtins-addr-space.cpp

jsji · 2025-07-19T18:54:13Z

AlexeySachkov · 2025-07-21T14:56:09Z

Remove duplicated code -- fix new-pm tests failure @intel/dpcpp-tools-reviewers

This one looks good to me, I assume that there was some merge conflict issue which caused the duplication.

Remove case using reverted function @intel/dpcpp-spirv-reviewers

I assume that this is once again some form of cleanup after merge, because we reverted those lines in #17192. If so, then its fine by me as well

jsji · 2025-07-21T14:57:56Z

@intel/llvm-gatekeepers I think this is ready for merge. The XFAILS in Unxfail HIP tests, xfail more tests @intel/dpcpp-cfe-reviewers @intel/llvm-reviewers-cuda should be followed up in issues.

sarnex · 2025-07-21T14:58:42Z

/merge

bb-sycl · 2025-07-21T14:59:14Z

Mon 21 Jul 2025 02:59:13 PM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2025-07-21T15:09:54Z

Mon 21 Jul 2025 03:09:54 PM UTC --- Merge the branch in this PR to base automatically. Will close the PR later.

j2kun and others added 30 commits June 20, 2025 08:16

[DAG] foldMaskedMerge - fix Wparentheses operator precedence warning.…

c734377

… NFC.

[gn build] Port 9524bfb

7897191

[AMDGPU] Do not replace SALU floating point multiply with VALU-only l…

6e86b7e

…dexp (#145048)

[lldb] Fix ASCII art in CommandObjectProtocolServer (NFC)

749e4a5

[X86] SimplifyDemandedVectorEltsForTargetNode - ensure X86ISD::VPERMI…

151ee0f

…LPV node use v2f64/v4f32 types When reducing v4f64/v8f32 non-lane crossing X86ISD::VPERMV nodes, we use X86ISD::VPERMILPV nodes for 128-bits, but these are only available for fp types. Fixes #145046

[TableGen][Docs] Fix empty list syntax in TableGen doc. (#145041)

72de0e4

`[]<list<int>>` actually produces `list<list<int>>`.

msan: Support free_sized and free_aligned_sized from C23 (#144529)

bfef873

Adds support to MSan for `free_sized` and `free_aligned_sized` from C23. Other sanitizers will be handled with their own separate PRs. For llvm/llvm-project#144435 Signed-off-by: Justin King <[email protected]>

Revert "[lldb][DWARF] Remove object_pointer from ParsedDWARFAttribute…

8775119

…s" (#145065) Reverts llvm/llvm-project#144880 Caused `TestObjCIvarsInBlocks.py` to fail on macOS CI.

[InstCombine] fix hwasan mistake in "remove dead loads" (#145057)

940ff11

Detected by CI after #143958.

[mlir][arith][transforms] Adds f4E2M1FN support to truncf and extf (#…

379a609

…144157) See work detail: iree-org/iree#20920 Add support for f4E2M1FN in `arith.truncf` and `arith.extf` ops though a software emulation --------- Signed-off-by: Muzammiluddin Syed <[email protected]>

[AMDGPU] Fix a potential integer overflow in GCNRegPressure when true…

edbaf19

…16 is enabled (#144968) Fixes SWDEV-537014.

[libc++] Update the value of __cpp_lib_constrained_equality after P33…

945ce1a

…79R0 (#144553) https://wg21.link/P3379R0 updated the value of __cpp_lib_constrained_equality, but we forgot to update it when we implemented the paper.

[AMDGPU] Add another test showing unwanted VALU codegen (#145062)

6ddb3a6

[X86] combineConcatVectorOps - only concat AVX1 v4i64 shift-by-32 to …

f8ee577

…a shuffle if the concat is free (#145043)

Merge from 'main' to 'sycl-web' (133 commits)

f472adb

CONFLICT (content): Merge conflict in clang/lib/Driver/ToolChains/Clang.cpp

[RISCV] Treat bf16->f32 as separate ExtKind in combineOp_VLToVWOp_VL.…

04e2e58

… (#144653) This allows us to better track the narrow type we need and to fix miscompiles if f16->f32 and bf16->f32 extends are mixed. Fixes #144651.

mikolaj-pirog temporarily deployed to WindowsCILock July 17, 2025 18:34 — with GitHub Actions Inactive

mikolaj-pirog temporarily deployed to WindowsCILock July 17, 2025 18:35 — with GitHub Actions Inactive

Merge remote-tracking branch 'origin/sycl' into llvmspirv_pulldown

31f4aef

mikolaj-pirog force-pushed the llvmspirv_pulldown branch from e46e614 to 31f4aef Compare July 18, 2025 09:10

mikolaj-pirog temporarily deployed to WindowsCILock July 18, 2025 09:10 — with GitHub Actions Inactive

mikolaj-pirog temporarily deployed to WindowsCILock July 18, 2025 10:01 — with GitHub Actions Inactive

mikolaj-pirog had a problem deploying to WindowsCILock July 18, 2025 10:37 — with GitHub Actions Failure

mikolaj-pirog temporarily deployed to WindowsCILock July 18, 2025 10:37 — with GitHub Actions Inactive

jsji had a problem deploying to WindowsCILock July 19, 2025 16:32 — with GitHub Actions Failure

jsji temporarily deployed to WindowsCILock July 19, 2025 16:59 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock July 19, 2025 17:15 — with GitHub Actions Inactive

[NFC][Clang] Add triple to spirv-builtins-addr-space.cpp (#30183)

3e8ebcd

Fix: Clang::CodeGenSYCL/spirv-builtins-addr-space.cpp

jsji force-pushed the llvmspirv_pulldown branch from de67b57 to 3e8ebcd Compare July 19, 2025 17:31

jsji temporarily deployed to WindowsCILock July 19, 2025 17:31 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock July 19, 2025 17:32 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock July 19, 2025 18:07 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock July 19, 2025 18:10 — with GitHub Actions Inactive

bb-sycl approved these changes Jul 21, 2025

View reviewed changes

bb-sycl merged commit bab814a into sycl Jul 21, 2025
67 of 68 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW27 2025) #19233

LLVM and SPIRV-LLVM-Translator pulldown (WW27 2025) #19233

Uh oh!

iclsrc commented Jul 1, 2025

Uh oh!

jsji commented Jul 19, 2025 •

edited

Loading

Uh oh!

AlexeySachkov commented Jul 21, 2025

Uh oh!

jsji commented Jul 21, 2025

Uh oh!

sarnex commented Jul 21, 2025

Uh oh!

bb-sycl commented Jul 21, 2025

Uh oh!

bb-sycl commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW27 2025) #19233

LLVM and SPIRV-LLVM-Translator pulldown (WW27 2025) #19233

Uh oh!

Conversation

iclsrc commented Jul 1, 2025

Uh oh!

jsji commented Jul 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexeySachkov commented Jul 21, 2025

Uh oh!

jsji commented Jul 21, 2025

Uh oh!

sarnex commented Jul 21, 2025

Uh oh!

bb-sycl commented Jul 21, 2025

Uh oh!

bb-sycl commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

jsji commented Jul 19, 2025 •

edited

Loading