[SYCL-TLA] Enable SYCL-TLA build #2030

LuFinch · 2025-09-11T01:17:13Z

This is a draft PR to enable SYCL-TLA build in torch-xpu-ops so that we can test SYCL-TLA kernels' accuracy/performance in Pytorch when SDPA/GEMM kernels are ready.

After discussion with Eikan, we decided to put build logic in torch-xpu-ops while put kernels source code in Pytorch in-tree. Please put your SYCL-TLA kernel source code in Pytorch and set its path as part of ATen_XPU_SYCLTLA_SRCS in torch-xpu-ops/src/ATen/CMakeLists.txt.

Since SYCL-TLA has different compilation options compared with normal SYCL kernels in torch-xpu-ops, I make the logic in cmake/BuildFlags.cmake as a macro so that I can reuse the common compilation options.

Since there is not a determined plan of how to import sycl-tla repo, I git clone the main branch in cmake for debug convinence. We can pin commit after sycl-tla has first release tag

Depend on g++ upgrading to gcc13, otherwise the sycltla kernel won't build

cmake/Modules/FindSYCL/run_sycl.cmake

src/ATen/CMakeLists.txt

cmake/BuildFlags.cmake

cmake/Modules/FindSYCL.cmake

LuFinch · 2025-10-22T06:58:02Z

@fengyuan14 @EikanWang Could you help review and give some comments?

LuFinch · 2025-11-13T05:40:05Z

@EikanWang @guangyey Can we merge this PR?

guangyey · 2025-11-13T05:44:19Z

Of course, let's move forward. We can apply a minor fix if needed.

Copilot

Pull Request Overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (3)

cmake/BuildFlags.cmake:1

Removed line that defines __INTEL_LLVM_COMPILER_VERSION. This flag may be required by code that checks the compiler version. If this removal is intentional, ensure that no code depends on this definition.

# Setup building flags for SYCL device and host codes.

cmake/BuildFlags.cmake:1

This comment line was removed but the related code logic immediately following it remains unchanged. The comment should be preserved to document the FP64 conversion emulation logic for DG2/ATS-M targets.

# Setup building flags for SYCL device and host codes.

cmake/BuildFlags.cmake:1

Removed initialization of SYCL_flags variable. This variable is no longer used after renaming to SYCL_COMPILE_FLAGS, but verify that no other code depends on SYCL_flags being defined.

# Setup building flags for SYCL device and host codes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/ATen/CMakeLists.txt

CMakeLists.txt

This PR moves the sycltla kernels in pytorch/pytorch#167056 into torch-xpu-ops. This PR is based on #2030. When the build PR merge, I will rebase this PR.

LuFinch force-pushed the lfq/cutlass branch from 8b015d5 to 4ae8003 Compare September 11, 2025 02:08

LuFinch force-pushed the lfq/cutlass branch 6 times, most recently from aa10375 to e9800af Compare October 20, 2025 07:51

LuFinch changed the title ~~[Cutlass] Enable Cutlass with host compiler~~ [SYCL-TLA] Enable SYCL-TLA build with host compiler Oct 20, 2025

LuFinch force-pushed the lfq/cutlass branch from e9800af to 2ff6ab8 Compare October 21, 2025 02:31

LuFinch marked this pull request as ready for review October 21, 2025 02:33

LuFinch requested review from Copilot, fengyuan14 and guangyey October 21, 2025 02:33

This comment was marked as abuse.

Sign in to view

LuFinch requested a review from EikanWang October 21, 2025 02:34

LuFinch changed the title ~~[SYCL-TLA] Enable SYCL-TLA build with host compiler~~ [SYCL-TLA] Enable SYCL-TLA build Oct 21, 2025

guangyey reviewed Oct 21, 2025

View reviewed changes

cmake/Modules/FindSYCL/run_sycl.cmake Outdated Show resolved Hide resolved

guangyey reviewed Oct 21, 2025

View reviewed changes

src/ATen/CMakeLists.txt Show resolved Hide resolved

guangyey reviewed Oct 21, 2025

View reviewed changes

cmake/BuildFlags.cmake Outdated Show resolved Hide resolved

guangyey reviewed Oct 22, 2025

View reviewed changes

cmake/Modules/FindSYCL.cmake Show resolved Hide resolved

LuFinch force-pushed the lfq/cutlass branch from 7a4a3c9 to f249c3f Compare October 31, 2025 08:43

LuFinch mentioned this pull request Nov 5, 2025

[xpu][feature][1/N] Integrate SYCL-TLA FlashAttention forward/backward kernels pytorch/pytorch#167056

Closed

EikanWang approved these changes Nov 7, 2025

View reviewed changes

guangyey approved these changes Nov 7, 2025

View reviewed changes

LuFinch added 5 commits November 10, 2025 21:22

enable sycltla build and add xpu flash_attn as build dir

8e05f9f

fix lint

9302720

fix lint

76f5c0f

fix comments

c6bb599

update tag and include path

c8f5482

LuFinch added 2 commits November 10, 2025 21:22

update sycl-tla tag to v0.6

2dacca7

rebase to main

edc82f8

LuFinch force-pushed the lfq/cutlass branch from 016137f to edc82f8 Compare November 11, 2025 05:22

LuFinch mentioned this pull request Nov 12, 2025

[SYCL-TLA] Integrate FlashAttention fwd/bwd kernels #2341

Merged

guangyey enabled auto-merge November 13, 2025 05:44

guangyey disabled auto-merge November 13, 2025 05:45

guangyey enabled auto-merge November 13, 2025 05:45

guangyey requested a review from Copilot November 13, 2025 05:48

guangyey disabled auto-merge November 13, 2025 05:48

Copilot AI reviewed Nov 13, 2025

View reviewed changes

src/ATen/CMakeLists.txt Show resolved Hide resolved

CMakeLists.txt Show resolved Hide resolved

guangyey enabled auto-merge November 13, 2025 05:49

guangyey added this pull request to the merge queue Nov 13, 2025

Merged via the queue into main with commit 8384acf Nov 13, 2025
34 of 35 checks passed

guangyey deleted the lfq/cutlass branch November 13, 2025 05:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL-TLA] Enable SYCL-TLA build #2030

[SYCL-TLA] Enable SYCL-TLA build #2030

Uh oh!

LuFinch commented Sep 11, 2025 •

edited

Loading

Uh oh!

This comment was marked as abuse.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LuFinch commented Oct 22, 2025

Uh oh!

LuFinch commented Nov 13, 2025

Uh oh!

guangyey commented Nov 13, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SYCL-TLA] Enable SYCL-TLA build #2030

[SYCL-TLA] Enable SYCL-TLA build #2030

Uh oh!

Conversation

LuFinch commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as abuse.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LuFinch commented Oct 22, 2025

Uh oh!

LuFinch commented Nov 13, 2025

Uh oh!

guangyey commented Nov 13, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

LuFinch commented Sep 11, 2025 •

edited

Loading