Skip to content

Commit 8f900ab

Browse files
CUDA: faster softmax via shared memory + fp16 math (#4742)
1 parent 1fc2f26 commit 8f900ab

File tree

2 files changed

+318
-26
lines changed

2 files changed

+318
-26
lines changed

0 commit comments

Comments
 (0)