You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
$ ./build/bin/llama-cli --version
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = Apple M1 Pro (MoltenVK) | uma: 1 | fp16: 1 | warp size: 32 | matrix cores: none
version: 4489 (f11cfdfd)
built with Apple clang version 16.0.0 (clang-1600.0.26.6) for arm64-apple-darwin24.2.0
Operating systems
Mac
GGML backends
Vulkan
Hardware
Apple M1 Pro, 32 GB RAM
Models
Meta Llama 3.2 Instruct 1B Q4_K_M
Problem description & steps to reproduce
In a fresh git clone:
$ cmake -B build -DGGML_VULKAN=ON -DGGML_METAL=OFF -DCMAKE_BUILD_TYPE=Release -G Ninja
$ cmake --build build --config Release -j 8
$ ./build/bin/llama-cli -m ~/llamas/Llama-3.2-1B-Instruct-Q4_K_M.gguf -p "The capital of France is " --device Vulkan0 -ngl 17 -no-cnv --version
Result: prompt is echoed, but then generation is obvious nonsense tokens.
If I omit --device Vulkan0 -ngl 17, I get reasonable output, but I see
load_tensors: offloading 0 repeating layers to GPU
load_tensors: offloaded 0/17 layers to GPU
in the logs, suggesting that the GPU is not used. Omitting -ngl 17 and keeping --device Vulkan0 has the same behavior as omitting both -ngl 17 and --device Vulkan0.
First Bad Commit
EDIT: bisect surprisingly finished; seems to bisect to d79d8f3 (#10846).
there are a lot of revs with broken builds in that range. I wrote a simple shell loop to auto-skip them, but it's skipping a lot of revs that mention changing Vulkan, so I'm giving up on bisection being helpful.
Uh oh!
There was an error while loading. Please reload this page.
Name and Version
Operating systems
Mac
GGML backends
Vulkan
Hardware
Apple M1 Pro, 32 GB RAM
Models
Meta Llama 3.2 Instruct 1B Q4_K_M
Problem description & steps to reproduce
In a fresh git clone:
Result: prompt is echoed, but then generation is obvious nonsense tokens.
If I omit
--device Vulkan0 -ngl 17
, I get reasonable output, but I seein the logs, suggesting that the GPU is not used. Omitting
-ngl 17
and keeping--device Vulkan0
has the same behavior as omitting both-ngl 17
and--device Vulkan0
.First Bad Commit
EDIT: bisect surprisingly finished; seems to bisect to d79d8f3 (#10846).
45095a6 is bad
e9e661b is good
there are a lot of revs with broken builds in that range. I wrote a simple shell loop to auto-skip them, but it's skipping a lot of revs that mention changing Vulkan, so I'm giving up on bisection being helpful.
Relevant log output
The text was updated successfully, but these errors were encountered: