Remove printing of prompt and prompt tokenization at startup #480

slaren · 2023-03-24T22:53:38Z

Now that the tokenizer has been tested fairly well, printing the tokenization on startup adds a lot of clutter for no good reason.

Additionally also removes printing of the prompt itself since it is already printed as it is evaluated anyway.

ggerganov · 2023-03-25T05:24:50Z

The lines have to be either commented so we can easily re-enable - sometime it is still useful to look at the tokens.
I think it is best to make this on/off via command line arg

anzz1 · 2023-03-25T06:24:44Z

Do not agree, a '--quiet' option would be better instead. It's very good information to know which tokens are generated when trying out different prompts and such. The tokens are vital in researching the whole thing.

Actually it would be a cool addition to add a cmdline option which would show the output tokenized along their id's to get a better understanding of different models and their differences. So another option is to go the other way around, instead of '--quiet' it would be a '-v' / '--verbose' option to show this info.

Also as the "debug" and "standard" outputs are already directed to different streams, you know that you can easily show only the generated output by redirecting the stderr output to a file or nul?

file:
main -m ./models/llama-13B-ggml/ggml-model-q4_0.bin 2> err.log

silent:
main -m ./models/llama-13B-ggml/ggml-model-q4_0.bin 2>nul (windows)
main -m ./models/llama-13B-ggml/ggml-model-q4_0.bin 2>/dev/null (unix)

In any case I agree that it definitely shouldn't be outright removed but made a command line option. But it's already easy to redirect the stderr to elsewhere if you want a 'clean look' when not testing/researching.

ggerganov · 2023-03-25T15:17:45Z

Added a command line option for now: 502a400

* Update gpttype_adapter.cpp * use n_vocab instead of 32000 for when top k is off

Remove printing of prompt and prompt tokenization at startup

186ecfd

ggerganov added a commit that referenced this pull request Mar 25, 2023

Disable prompt verbosity by default and add option to enable (#480)

502a400

ggerganov closed this Mar 25, 2023

slaren deleted the remove-tokenization branch March 26, 2023 18:56

gjmulder mentioned this pull request Mar 26, 2023

Is it possible to avoid printing input when using Alpaca models and prompt from file? #533

Closed

AAbushady pushed a commit to AAbushady/llama.cpp that referenced this pull request Jan 27, 2024

Fix for Top K disabling (ggml-org#480)

ddce116

* Update gpttype_adapter.cpp * use n_vocab instead of 32000 for when top k is off

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove printing of prompt and prompt tokenization at startup #480

Remove printing of prompt and prompt tokenization at startup #480

Uh oh!

slaren commented Mar 24, 2023

Uh oh!

ggerganov commented Mar 25, 2023

Uh oh!

anzz1 commented Mar 25, 2023 •

edited

Loading

Uh oh!

ggerganov commented Mar 25, 2023

Uh oh!

Uh oh!

Remove printing of prompt and prompt tokenization at startup #480

Remove printing of prompt and prompt tokenization at startup #480

Uh oh!

Conversation

slaren commented Mar 24, 2023

Uh oh!

ggerganov commented Mar 25, 2023

Uh oh!

anzz1 commented Mar 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Mar 25, 2023

Uh oh!

Uh oh!

anzz1 commented Mar 25, 2023 •

edited

Loading