[SYCL] Use batched mul_mat pathway #5591

AidanBeltonS · 2024-02-19T16:48:34Z

This PR enables using the batched mul_mat pathway when appropriate. Previously the single gemm path was being taken and it was not suitable for the type of operation causing segfaults. This PR changes things to more closely match the CUDA impl and use the batched gemm path.

This change allows a lot more tests to pass for SYCL devices. There is one limitation with this approach, we cannot use non default precision operations. As oneMKL has not open sourced the gemm_batch for the data types <half, half, float, float> (corresponding to <src0, src1, dst, scaling>) yet. This is something I have raised with oneMKL

AidanBeltonS · 2024-02-19T16:49:01Z

@NeoZhangJianyu, @abhilash1910, @Alcpz, feedback would be appreciated

ggml-sycl.cpp

abhilash1910

LGTM. I think we can use this until MKL adds the dtypes for batched gemm . Pinging @airMeng @ggerganov for a look when available.
@AidanBeltonS could you please rebase , should fix the android build issue. Thanks

* Use batched mul_mat pathway * rm extra line * Explicitly state scaled data type --------- Co-authored-by: Abhilash Majumder <[email protected]>

This was referenced Feb 19, 2024

Missing gemm_batch data types uxlfoundation/oneMath#446

Open

[SYCL] Add support for SYCL Nvidia target #5738

Merged

abhilash1910 reviewed Feb 27, 2024

View reviewed changes

ggml-sycl.cpp Outdated Show resolved Hide resolved

abhilash1910 approved these changes Feb 29, 2024

View reviewed changes

Aidan and others added 3 commits February 29, 2024 11:52

Use batched mul_mat pathway

1b3c1fe

rm extra line

b2aaee3

Explicitly state scaled data type

abed262

AidanBeltonS force-pushed the fix_mul_mat_pathways branch from cb21f6c to abed262 Compare February 29, 2024 11:54

abhilash1910 merged commit 38d1521 into ggml-org:master Mar 1, 2024

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

[SYCL] Use batched mul_mat pathway (ggml-org#5591)

c2308ed

* Use batched mul_mat pathway * rm extra line * Explicitly state scaled data type --------- Co-authored-by: Abhilash Majumder <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] Use batched mul_mat pathway #5591

[SYCL] Use batched mul_mat pathway #5591

Uh oh!

AidanBeltonS commented Feb 19, 2024

Uh oh!

AidanBeltonS commented Feb 19, 2024

Uh oh!

Uh oh!

abhilash1910 left a comment

Uh oh!

Uh oh!

[SYCL] Use batched mul_mat pathway #5591

[SYCL] Use batched mul_mat pathway #5591

Uh oh!

Conversation

AidanBeltonS commented Feb 19, 2024

Uh oh!

AidanBeltonS commented Feb 19, 2024

Uh oh!

Uh oh!

abhilash1910 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!