Skip to content

Commit 011e8ec

Browse files
authored
llama : fix not enough space in buffer with Qwen (#5086)
1 parent 6f9939d commit 011e8ec

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

llama.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4440,9 +4440,9 @@ static struct ggml_tensor * llm_build_kv(
44404440

44414441
// these nodes are added to the graph together so that they are not reordered
44424442
// by doing so, the number of splits in the graph is reduced
4443+
ggml_build_forward_expand(graph, q_cur);
44434444
ggml_build_forward_expand(graph, k_cur);
44444445
ggml_build_forward_expand(graph, v_cur);
4445-
ggml_build_forward_expand(graph, q_cur);
44464446

44474447
llm_build_kv_store(ctx, hparams, kv, graph, k_cur, v_cur, n_ctx, n_tokens, kv_head, cb, il);
44484448

0 commit comments

Comments
 (0)