We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 595007b commit 966aa9bCopy full SHA for 966aa9b
examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/run_quant.sh
@@ -70,7 +70,7 @@ function run_tuning {
70
python main.py \
71
--quant_format ${quant_format-QOperator} \
72
--model_path ${input_model} \
73
- --tokenizer ${tokenizer-meta-llama/Llama-2-7b-hf} \
+ --tokenizer ${tokenizer-meta-llama/Llama-2-7b-hf} \
74
--output_model ${output_model} \
75
--batch_size ${batch_size-1} \
76
--smooth_quant_alpha ${alpha-0.6} \
0 commit comments