[Driver][SYCL] Improve integration header usage when performing offload #3471

mdtoguchi · 2021-04-02T00:57:49Z

When generating the integration header, we were performing an additional
compilation step which is not needed, as we can create the output file
and the integration header at the same time.

Improve this behavior by tracking the integration files based on given
input files, creating and using the files accordingly when needed during
the compilation flow.

When generating the integration header, we were performing an additional compilation step which is not needed, as we can create the output file and the integration header at the same time. Improve this behavior by tracking the integration files based on given input files, creating and using the files accordingly when needed during the compilation flow.

bader · 2021-04-02T06:28:50Z

@mdtoguchi, great improvement!
By coincidence, we recently discussed with @Naghasan and @keryell what approach should be upstreamed to llorg.
See these two threads: https://reviews.llvm.org/D99190?id=333780#inline-935611 and https://reviews.llvm.org/D99190?id=333780#inline-935593.

In nutshell, @Naghasan suggests we consider using a host compiler to emit the data from the integration header. OpenMP-offload uses such approach for binding offloaded code with the host application. I assume CUDA and HIP compilers are doing something similar. I was going to check if we can apply this approach to SYCL and gather all pros and cons for both approaches.
This patch eliminates one of the main disadvantages of integration header approach, but we probably should checks with the community if this approach can be accepted to llorg.

Tagging @erichkeane and @AaronBallman.

Naghasan · 2021-04-02T11:56:18Z

clang/lib/Driver/Driver.cpp

@@ -5113,6 +5107,21 @@ void Driver::BuildActions(Compilation &C, DerivedArgList &Args,

  handleArguments(C, Args, Inputs, Actions);

+  // When compiling for -fsycl, generate the integration header files that
+  // will be used during the compilation.
+  if (Args.hasFlag(options::OPT_fsycl, options::OPT_fno_sycl, false)) {


Shouldn't this be also working when using -fsycl-device-only ?

-fsycl-device-only doesn't perform the host compilation, so the generated integration header would not be used. Unless there is a use model for it, I don't think it's necessary.

clang/include/clang/Driver/Driver.h

clang/lib/Driver/ToolChains/Clang.cpp

erichkeane · 2021-04-02T12:51:02Z

@mdtoguchi, great improvement!
By coincidence, we recently discussed with @Naghasan and @keryell what approach should be upstreamed to llorg.
See these two threads: https://reviews.llvm.org/D99190?id=333780#inline-935611 and https://reviews.llvm.org/D99190?id=333780#inline-935593.

In nutshell, @Naghasan suggests we consider using a host compiler to emit the data from the integration header. OpenMP-offload uses such approach for binding offloaded code with the host application. I assume CUDA and HIP compilers are doing something similar. I was going to check if we can apply this approach to SYCL and gather all pros and cons for both approaches.
This patch eliminates one of the main disadvantages of integration header approach, but we probably should checks with the community if this approach can be accepted to llorg.

Tagging @erichkeane and @AaronBallman.

I don't think we can do it in host mode for 2 reasons:
1- the information about the actual kernels themselves is only guaranteed to be present during device compile
2- We don't necessarily control the host compiler. The SYCL design permits arbitrary host compilers, so doing this during integration header isn't particularly possible.

bader · 2021-04-02T13:00:25Z

I created #3474 to discuss the approach for upstreaming to llorg.

Naghasan · 2021-04-02T13:07:20Z

I don't think we can do it in host mode for 2 reasons:
1- the information about the actual kernels themselves is only guaranteed to be present during device compile

Well if they are missing during the host compilation, I don't see how you can invoke them.

2- We don't necessarily control the host compiler. The SYCL design permits arbitrary host compilers, so doing this during integration header isn't particularly possible.

The design permits this but it doesn't mandate it, this is an implementation design choice not a spec requirement. The 1.2.1 use to describe a single pass compiler as an alternative to the multi-pass approach BTW.

erichkeane · 2021-04-02T13:09:32Z

I don't think we can do it in host mode for 2 reasons:
1- the information about the actual kernels themselves is only guaranteed to be present during device compile

Well if they are missing during the host compilation, I don't see how you can invoke them.

2- We don't necessarily control the host compiler. The SYCL design permits arbitrary host compilers, so doing this during integration header isn't particularly possible.

The design permits this but it doesn't mandate it, this is an implementation design choice not a spec requirement. The 1.2.1 use to describe a single pass compiler as an alternative to the multi-pass approach BTW.

The whole purpose of the integration header is to provide information to the host compiler that is only available at device compilation though.

As far as a single-pass compiler, great! But we don't have one. As far as a separate host compiler, our implementation has that requirement.

stdcpp_compat.cpp was malformed with the options used layout_accessors_host.cpp was expecting output from 3 compilations, which is now down to 2.

…f-fix

AGindinson

I totally agree with @erichkeane on the subject of host compiler-based strategy - emitting the header during the device FE compilation is the only viable approach if we're to adhere to the "custom host compiler" feature requirement. Not to mention that the chosen approach is a simpler one. In a nutshell, I believe that if there is no strong community pushback against device-side emission of the integration header, we should stick to what this PR currently provides.

Overall, the improving changes look good to me - much cleaner and far less hacky than I personally expected from the filename-mapping approach. I'll need to spend some more time on reading the code itself, hitting "Approve" right after that iteration.

P. S. Coming back from a vacation and seeing an important solution like this one adds up to the positive review experience :)

clang/lib/Driver/ToolChains/Clang.cpp

clang/include/clang/Driver/Driver.h

AGindinson

The implementation LGTM!

clang/test/Driver/sycl-offload-with-split.c

clang/test/Driver/sycl-offload.c

bader · 2021-04-07T08:21:21Z

@vladimirlaz, @intel/llvm-reviewers-runtime, ping.

mdtoguchi requested a review from AGindinson as a code owner April 2, 2021 00:57

Naghasan reviewed Apr 2, 2021

View reviewed changes

bader mentioned this pull request Apr 2, 2021

Host/Device interfaces: integration header vs host compiler. #3474

Open

Address review comment and update failing tests

2bae7b4

stdcpp_compat.cpp was malformed with the options used layout_accessors_host.cpp was expecting output from 3 compilations, which is now down to 2.

mdtoguchi requested a review from a team as a code owner April 3, 2021 00:51

mdtoguchi requested a review from vladimirlaz April 3, 2021 00:51

Merge remote-tracking branch 'otcshare_llvm/sycl' into int-header-per…

9e5c4a4

…f-fix

AGindinson reviewed Apr 5, 2021

View reviewed changes

clang/lib/Driver/ToolChains/Clang.cpp Show resolved Hide resolved

AaronBallman reviewed Apr 5, 2021

View reviewed changes

clang/include/clang/Driver/Driver.h Show resolved Hide resolved

AlexeySachkov reviewed Apr 5, 2021

View reviewed changes

clang/include/clang/Driver/Driver.h Outdated Show resolved Hide resolved

Address review comments using StringRef

b4ae4ce

AGindinson previously approved these changes Apr 6, 2021

View reviewed changes

clang/test/Driver/sycl-offload-with-split.c Outdated Show resolved Hide resolved

clang/test/Driver/sycl-offload.c Outdated Show resolved Hide resolved

clang/test/Driver/sycl-offload.c Outdated Show resolved Hide resolved

clang/test/Driver/sycl-offload.c Show resolved Hide resolved

Address review comments for test improvement

d26d93a

mdtoguchi dismissed AGindinson’s stale review via d26d93a April 6, 2021 16:48

AGindinson approved these changes Apr 7, 2021

View reviewed changes

vladimirlaz approved these changes Apr 7, 2021

View reviewed changes

bader merged commit f110dd4 into intel:sycl Apr 7, 2021

AlexeySachkov mentioned this pull request May 20, 2021

SYCL Driver unable to support multiple outputs for a single triple #1382

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Driver][SYCL] Improve integration header usage when performing offload #3471

[Driver][SYCL] Improve integration header usage when performing offload #3471

Uh oh!

mdtoguchi commented Apr 2, 2021

Uh oh!

bader commented Apr 2, 2021

Uh oh!

Naghasan Apr 2, 2021

Uh oh!

mdtoguchi Apr 2, 2021

Uh oh!

Uh oh!

Uh oh!

erichkeane commented Apr 2, 2021

Uh oh!

bader commented Apr 2, 2021

Uh oh!

Naghasan commented Apr 2, 2021

Uh oh!

erichkeane commented Apr 2, 2021

Uh oh!

AGindinson left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AGindinson left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bader commented Apr 7, 2021

Uh oh!

Uh oh!

[Driver][SYCL] Improve integration header usage when performing offload #3471

[Driver][SYCL] Improve integration header usage when performing offload #3471

Uh oh!

Conversation

mdtoguchi commented Apr 2, 2021

Uh oh!

bader commented Apr 2, 2021

Uh oh!

Naghasan Apr 2, 2021

Choose a reason for hiding this comment

Uh oh!

mdtoguchi Apr 2, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

erichkeane commented Apr 2, 2021

Uh oh!

bader commented Apr 2, 2021

Uh oh!

Naghasan commented Apr 2, 2021

Uh oh!

erichkeane commented Apr 2, 2021

Uh oh!

AGindinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AGindinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bader commented Apr 7, 2021

Uh oh!

Uh oh!