Guard against all weights in a super-block being zero #3010

ikawrakow · 2023-09-04T11:14:01Z

Does this resolve the problem in #2982.

In order to get the assertion observed in #2982 all weights in a block of 256 must be zero. I see that I have a guard against this for all k_quants except Q6_K

ikawrakow · 2023-09-04T11:59:08Z

Looking deeper into the assert observed in #2982, this will most likely not fix it. The nearest_int() function is called with NaN, while this change guards against all weights in a block of 256 being zero. If that would happen, than nearest_int() would be invoked with Inf, not NaN. The way the code is written, I can only see two possibilities to get a NaN while quantizing to Q6_K:

There are already NaN's present in the fp16 model being quantized
The weights are not all zero, bur are extremely small (std::numeric_limits<float>::min() or close to that). Small enough, so that, when computing 63 / max on this line in make_qx_quants(), the result overflows and iscale becomes -Inf. The check in nearest_int() does not trigger when the argument is -Inf and we get a non-zero integer value back (-4194304) With this, sumlx and suml2 computed further below both become +/-Inf, and we get a scale that is NaN via scale = sumlx/suml2 = Inf/Inf = NaN.

If this analysis is correct, the "bug" observed in #2982 is more a model problem rather than anything else. But just in case, I now added a check against extremely small model weights.

cebtenzzre · 2023-09-05T04:43:24Z

I'm doing a little more investigation with gdb and rr right now. I haven't tried any of the potential fixes yet.

all x values in this block are zero:

>>> print *(float (*)[256])x
$34 = {[0] = 0 <repeats 256 times>}

each call to make_qx_quants returns at line 90 (!amax case)
make_qx_quants returns scale=0 for each of the 16 sub-blocks
iscale = -128.f/max_scale and max_scale is zero
when nearest_int(iscale*scales[ib]) is called, iscale is -inf and scales[ib] is 0, and -inf * 0 = NaN.

So, maybe the first commit from this PR is the right fix.

edit: 79f1757 is sufficient. 2d5f5d7 doesn't seem to hurt.

ggerganov · 2023-09-06T09:11:24Z

k_quants.c

+            y[i].d = ggml_fp32_to_fp16(0.f);
+            continue;


@ikawrakow

Don't we need to increase x here?

y[i].d = ggml_fp32_to_fp16(0.f); x += QK_K; continue;

Yes, we do. Great catch!

You will fix it?

Yup, will push a fix in minute directly to master

Guard against all weights in a super-block being zero

79f1757

ikawrakow mentioned this pull request Sep 4, 2023

quantize: k_quants.c:73: nearest_int: Assertion `fval <= 4194303.f' failed. #2982

Closed

Also guard against extremely small weights

2d5f5d7

ikawrakow requested review from howard0su and ggerganov and removed request for howard0su September 5, 2023 06:22

ggerganov approved these changes Sep 5, 2023

View reviewed changes

ikawrakow merged commit d59bd97 into master Sep 5, 2023

ikawrakow deleted the ik/issue_2982 branch September 5, 2023 07:55

ikawrakow mentioned this pull request Sep 6, 2023

Regression in output of quantized Huginn-22b-Prototype #3040

Closed

ggerganov reviewed Sep 6, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Guard against all weights in a super-block being zero #3010

Guard against all weights in a super-block being zero #3010

Uh oh!

ikawrakow commented Sep 4, 2023

Uh oh!

ikawrakow commented Sep 4, 2023 •

edited

Loading

Uh oh!

cebtenzzre commented Sep 5, 2023 •

edited

Loading

Uh oh!

ggerganov Sep 6, 2023

Uh oh!

ikawrakow Sep 6, 2023

Uh oh!

ikawrakow Sep 6, 2023

Uh oh!

ggerganov Sep 6, 2023

Uh oh!

Uh oh!

Guard against all weights in a super-block being zero #3010

Guard against all weights in a super-block being zero #3010

Uh oh!

Conversation

ikawrakow commented Sep 4, 2023

Uh oh!

ikawrakow commented Sep 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cebtenzzre commented Sep 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov Sep 6, 2023

Choose a reason for hiding this comment

Uh oh!

ikawrakow Sep 6, 2023

Choose a reason for hiding this comment

Uh oh!

ikawrakow Sep 6, 2023

Choose a reason for hiding this comment

Uh oh!

ggerganov Sep 6, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ikawrakow commented Sep 4, 2023 •

edited

Loading

cebtenzzre commented Sep 5, 2023 •

edited

Loading