Skip to content

Commit 884e7d7

Browse files
ggerganovslaren
andauthored
ggml : use 8-bit precision for Q4_1 intermediate results (#1047)
* ggml : use 8-bit precision for Q4_1 intermediate results (ARM) * ggml : optimize ggml_vec_dot_q4_1_q8_0() via vmalq_n_f32 56 ms/token with Q4_1 ! * ggml : AVX2 implementation of ggml_vec_dot_q4_1_q8_0 (#1051) * gitignore : ignore ppl-*.txt files --------- Co-authored-by: slaren <[email protected]>
1 parent 7cd5c4a commit 884e7d7

File tree

2 files changed

+192
-194
lines changed

2 files changed

+192
-194
lines changed

.gitignore

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,15 @@
11
*.o
22
*.a
3+
.DS_Store
4+
.build/
35
.cache/
6+
.direnv/
7+
.envrc
8+
.swiftpm
9+
.venv
410
.vs/
511
.vscode/
6-
.DS_Store
712

8-
.build/
913
build/
1014
build-em/
1115
build-debug/
@@ -30,12 +34,9 @@ models/*
3034
arm_neon.h
3135
compile_commands.json
3236

33-
.envrc
34-
.direnv/
35-
36-
.venv
3737
__pycache__
38-
.swiftpm
3938

4039
zig-out/
4140
zig-cache/
41+
42+
ppl-*.txt

0 commit comments

Comments
 (0)