You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/reference/launcher.md
+3-1Lines changed: 3 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -55,7 +55,9 @@ Options:
55
55
## QUANTIZE
56
56
```shell
57
57
--quantize <QUANTIZE>
58
-
Whether you want the model to be quantized
58
+
Quantization method to use for the model. It is not necessary to specify this option for pre-quantized models, since the quantization method is read from the model configuration.
59
+
60
+
Marlin kernels will be used automatically for GPTQ/AWQ models.
0 commit comments