Skip to content

Commit 8608888

Browse files
charlifugshtras
andauthored
save shape when fp8 solution not found (#123)
Co-authored-by: Gregory Shtrasberg <[email protected]>
1 parent 30f12f0 commit 8608888

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

vllm/model_executor/layers/quantization/fp8_rocm.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -242,6 +242,8 @@ def apply(
242242
k = x.shape[1]
243243

244244
solidx = self._config._tuned.get((m, n, k), 0)
245+
if solidx == 0:
246+
self._config.save_shape(m, n, k)
245247
res = ops.fp8_mm(x_quant, weight.t(), out_dtype, asf, wsf, osf,
246248
int(solidx))
247249

0 commit comments

Comments
 (0)