Skip to content

Commit bdfb181

Browse files
JohannesGaesslerjordankanter
authored andcommitted
CUDA: faster softmax via shared memory + fp16 math (ggml-org#4742)
1 parent c952ea9 commit bdfb181

File tree

2 files changed

+318
-26
lines changed

2 files changed

+318
-26
lines changed

0 commit comments

Comments
 (0)