Skip to content

Commit e95b655

Browse files
ggerganovsw
andauthored
ggml : add Q8_0 quantization for intermediate results (#951)
* ggml : add Q8_0 quantization for intermediate results * quantize-stats : fix test + add it to Makefile default * Q8: use int8_t, AVX/AVX2 optimizations * ggml : fix quantize_row_q8_0() ARM_NEON rounding * minor : updates after rebase to latest master * quantize-stats : delete obsolete strings * ggml : fix q4_1 dot func --------- Co-authored-by: Stephan Walter <[email protected]>
1 parent aa485ce commit e95b655

File tree

3 files changed

+442
-18
lines changed

3 files changed

+442
-18
lines changed

Makefile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -133,7 +133,7 @@ $(info I CC: $(CCV))
133133
$(info I CXX: $(CXXV))
134134
$(info )
135135

136-
default: main quantize perplexity embedding
136+
default: main quantize quantize-stats perplexity embedding
137137

138138
#
139139
# Build library

0 commit comments

Comments
 (0)