Allow quantize to only copy tensors, other improvements #2931

KerfuffleV2 · 2023-08-31T11:47:08Z

You can now specify copy as the type to the quantize tool to only copy tensors, never quantize. I forget where I saw it, but it was recently mentioned that something like could be used to convert/repackage from stuff like GGUF V1 to GGUF V2.

This also improves the logic in the actual quantize function (mainly for k-quants) to be a bit smarter when requantizing to the same format. The current logic skips quantizing when quantize_type == tensor->type. However, this means the k-quants logic for stuff like counting numbers of tensors to decide whether to use more bits doesn't run. The result is if you quantize to q4_k_m and then requantize to q4_k_m stuff like requantizing q6_k tensors to q4_k will occur. Not super likely, but I recall reading a pull recently where someone was confused by behavior like that and making it work a little more intuitively is pretty easy. (Of course, if the k-quants strategy changes then you're still going to have a bad time but this change is at least as good as the status quo.)

ghost · 2023-08-31T14:14:56Z

Related: #2821 (comment)

examples/quantize/quantize.cpp

Allow quantize to only copy tensors, other improvements

b860f65

ggerganov approved these changes Sep 1, 2023

View reviewed changes

examples/quantize/quantize.cpp Outdated Show resolved Hide resolved

quantize: Use stdout for help message.

1e05731

KerfuffleV2 merged commit 5d6f19f into ggml-org:master Sep 1, 2023

KerfuffleV2 mentioned this pull request Sep 3, 2023

Converting GGML->GGUF: ValueError: Only GGJTv3 supported #2990

Closed

KerfuffleV2 deleted the feat-quantize-copy branch September 6, 2023 08:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow quantize to only copy tensors, other improvements #2931

Allow quantize to only copy tensors, other improvements #2931

Uh oh!

KerfuffleV2 commented Aug 31, 2023

Uh oh!

ghost commented Aug 31, 2023

Uh oh!

Uh oh!

Uh oh!

Allow quantize to only copy tensors, other improvements #2931

Allow quantize to only copy tensors, other improvements #2931

Uh oh!

Conversation

KerfuffleV2 commented Aug 31, 2023

Uh oh!

ghost commented Aug 31, 2023

Uh oh!

Uh oh!

Uh oh!