Skip to content

Commit 8e67d2d

Browse files
author
Aidan
committed
Update sycl read-me for Nvidia target
1 parent 0231999 commit 8e67d2d

File tree

1 file changed

+26
-0
lines changed

1 file changed

+26
-0
lines changed

README-sycl.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -57,6 +57,29 @@ For iGPU, please make sure the shared memory from host memory is enough. For lla
5757

5858
For dGPU, please make sure the device memory is enough. For llama-2-7b.Q4_0, recommend the device memory is 4GB+.
5959

60+
## Nvidia GPU
61+
62+
### Verified
63+
64+
|Intel GPU| Status | Verified Model|
65+
|-|-|-|
66+
|Ampere Series| Support| A100|
67+
68+
### oneMKL
69+
70+
The current oneMKL release does not contain the oneMKL cuBlas backend.
71+
As a result for Nvidia GPU's oneMKL must be built from source.
72+
73+
```
74+
git clone https://github.com/oneapi-src/oneMKL
75+
cd oneMKL
76+
mkdir build
77+
cd build
78+
cmake -G Ninja .. -DCMAKE_CXX_COMPILER=icpx -DCMAKE_C_COMPILER=icx -DENABLE_MKLGPU_BACKEND=OFF -DENABLE_MKLCPU_BACKEND=OFF -DENABLE_CUBLAS_BACKEND=ON
79+
ninja
80+
// Add paths as necessary
81+
```
82+
6083
## Docker
6184

6285
Note:
@@ -170,6 +193,9 @@ source /opt/intel/oneapi/setvars.sh
170193
# Or, for FP32:
171194
cmake .. -DLLAMA_SYCL=ON -DCMAKE_C_COMPILER=icx -DCMAKE_CXX_COMPILER=icpx
172195

196+
# For Nvidia GPUs
197+
cmake .. -DLLAMA_SYCL=ON -DLLAMA_SYCL_TARGET=NVIDIA -DCMAKE_C_COMPILER=icx -DCMAKE_CXX_COMPILER=icpx
198+
173199
# Build example/main only
174200
#cmake --build . --config Release --target main
175201

0 commit comments

Comments
 (0)