Commit 9668965
[Kernel][Triton][FP8] Adding fp8 and variable length sequence support to Triton FAv2 kernel (vllm-project#12591)
Signed-off-by: Randall Smith <[email protected]>
Signed-off-by: Mu Huai <[email protected]>1 parent f8d0820 commit 9668965
File tree
2 files changed
+1657
-604
lines changed- tests/kernels
- vllm/attention/ops
2 files changed
+1657
-604
lines changed
0 commit comments