You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Alternate OpenCL support via the CLBlast Netlib BLAS API
Experimental alternate OpenCL support via the CLBlast Netlib BLAS API. The performance is quite similar to the CLBlast optimized implementation when tested on the same low end / old AMD A9 APU.
CLBlast needs to be compiled with ```-DNETLIB=ON``` flag.
Rationale: Support More Hardware. This is meant to be used as a last resort for GPU acceleration when other methods don't work or are not compatible. Since OpenCL 1.x EMBEDDED PROFILE is supported, I anticipate that this could enable acceleration on Single Board Computers and Smart Phones.
Also serves as a template for pre-emptive OpenCL support for projects that use ggml. This could provide baseline GPU acceleration without custom OpenCL code or added effort due to CLBlast being a drop in BLAS with the Netlib API enabled.
More Info:
https://github.com/CNugteren/CLBlast/blob/master/doc/bindings.mdCNugteren/CLBlast#227
Usage:
```
Makefile:
cd whisper.cpp
WHISPER_CLBLAST_NETLIB=1 make
CMake:
cd whisper.cpp ; mkdir build ; cd build
cmake -DWHISPER_CLBLAST_NETLIB=ON ..
make
```
0 commit comments