Skip to content

Commit 64c46fc

Browse files
CUDA: faster softmax via shared memory + fp16 math
1 parent 540938f commit 64c46fc

File tree

2 files changed

+285
-24
lines changed

2 files changed

+285
-24
lines changed

0 commit comments

Comments
 (0)