Add n_gpu_layers option to talk-llama example #1475

rlapray · 2023-11-11T01:46:53Z

Enable to offload layers to the gpu with -ngl or --n-gpu-layers option.

Before that, --use-gpu could be used but offloaded layers were limited to 0.

add n_gpu_layers to talk-llama example

6a65e7d

ggerganov approved these changes Nov 13, 2023

View reviewed changes

ggerganov merged commit c23598e into ggml-org:master Nov 13, 2023

felrock pushed a commit to felrock/whisper.cpp that referenced this pull request Nov 18, 2023

talk-llama : add n_gpu_layers parameter (ggml-org#1475)

9a4c6fc

landtanin pushed a commit to landtanin/whisper.cpp that referenced this pull request Dec 16, 2023

talk-llama : add n_gpu_layers parameter (ggml-org#1475)

0bfcd89

iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024

talk-llama : add n_gpu_layers parameter (ggml-org#1475)

9a08578

peardox mentioned this pull request Apr 17, 2025

if GGML_BACKEND_DL defined use one of the ggml_backend_load functions #3057

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add n_gpu_layers option to talk-llama example #1475

Add n_gpu_layers option to talk-llama example #1475

Uh oh!

rlapray commented Nov 11, 2023

Uh oh!

Uh oh!

Add n_gpu_layers option to talk-llama example #1475

Add n_gpu_layers option to talk-llama example #1475

Uh oh!

Conversation

rlapray commented Nov 11, 2023

Uh oh!

Uh oh!