Skip to content

Commit 96a712c

Browse files
LostRuins0cc4m
andauthored
Porting the improved K-Quant CUDA kernels to OpenCL (#1966)
* Added broken new q4k quant * xx + ib0 * Fix q2_k fast kernel * Use preprocessor for QK_K * Add q6_k fast matmul kernel * ported q3k speedup successfully * ported q2k and q5k speedups * remove old dot kernels and template * fixed global const struct types * fixing address spaces * fixed string too long CI issue --------- Co-authored-by: 0cc4m <[email protected]>
1 parent d3494bb commit 96a712c

File tree

1 file changed

+352
-175
lines changed

1 file changed

+352
-175
lines changed

0 commit comments

Comments
 (0)