Skip to content

Commit 1170a95

Browse files
Fixed embd when offloading non-repeating layers
1 parent a09f919 commit 1170a95

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

llama.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1654,7 +1654,7 @@ static bool llama_eval_internal(
16541654

16551655
// cur = cur*norm(broadcasted)
16561656
cur = ggml_mul(ctx0, cur, model.norm);
1657-
offload_func_nr(cur);
1657+
// offload_func_nr(cur); // TODO CPU + GPU mirrored backend
16581658
ggml_set_name(cur, "result_norm");
16591659

16601660
embeddings = cur;

0 commit comments

Comments
 (0)