-
Notifications
You must be signed in to change notification settings - Fork 61
[SYCL-TLA] Enable SYCL-TLA build #2030
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
8b015d5 to
4ae8003
Compare
aa10375 to
e9800af
Compare
|
@fengyuan14 @EikanWang Could you help review and give some comments? |
|
@EikanWang @guangyey Can we merge this PR? |
|
Of course, let's move forward. We can apply a minor fix if needed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.
Comments suppressed due to low confidence (3)
cmake/BuildFlags.cmake:1
- Removed line that defines __INTEL_LLVM_COMPILER_VERSION. This flag may be required by code that checks the compiler version. If this removal is intentional, ensure that no code depends on this definition.
# Setup building flags for SYCL device and host codes.
cmake/BuildFlags.cmake:1
- This comment line was removed but the related code logic immediately following it remains unchanged. The comment should be preserved to document the FP64 conversion emulation logic for DG2/ATS-M targets.
# Setup building flags for SYCL device and host codes.
cmake/BuildFlags.cmake:1
- Removed initialization of SYCL_flags variable. This variable is no longer used after renaming to SYCL_COMPILE_FLAGS, but verify that no other code depends on SYCL_flags being defined.
# Setup building flags for SYCL device and host codes.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
This PR moves the sycltla kernels in pytorch/pytorch#167056 into torch-xpu-ops. This PR is based on #2030. When the build PR merge, I will rebase this PR.
This is a draft PR to enable SYCL-TLA build in torch-xpu-ops so that we can test SYCL-TLA kernels' accuracy/performance in Pytorch when SDPA/GEMM kernels are ready.
After discussion with Eikan, we decided to put build logic in torch-xpu-ops while put kernels source code in Pytorch in-tree. Please put your SYCL-TLA kernel source code in Pytorch and set its path as part of
ATen_XPU_SYCLTLA_SRCSintorch-xpu-ops/src/ATen/CMakeLists.txt.Since SYCL-TLA has different compilation options compared with normal SYCL kernels in torch-xpu-ops, I make the logic in
cmake/BuildFlags.cmakeas a macro so that I can reuse the common compilation options.Since there is not a determined plan of how to import sycl-tla repo, I git clone the main branch in cmake for debug convinence. We can pin commit after sycl-tla has first release tag
Depend on g++ upgrading to gcc13, otherwise the sycltla kernel won't build