Commit 96a712c

and

authored

Porting the improved K-Quant CUDA kernels to OpenCL (#1966)

* Added broken new q4k quant * xx + ib0 * Fix q2_k fast kernel * Use preprocessor for QK_K * Add q6_k fast matmul kernel * ported q3k speedup successfully * ported q2k and q5k speedups * remove old dot kernels and template * fixed global const struct types * fixing address spaces * fixed string too long CI issue --------- Co-authored-by: 0cc4m <[email protected]>

1 parent d3494bb commit 96a712cCopy full SHA for 96a712c

1 file changed

+352

-175

lines changed

ggml-opencl.cpp

1 file changed

+352

-175

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit 96a712c

1 file changed

1 file changed

File tree

1 file changed

1 file changed

0 commit comments