-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Closed
Labels
decodingDecoding related issuesDecoding related issuesperformanceCPU and memory usage - results and comparisonsCPU and memory usage - results and comparisons
Description
GPU inference on Apple Silicon via Metal backend was recently added to llama.cpp
: ggml-org/llama.cpp#1642
We should port the changes to whisper.cpp
and allow the Decoder to run on the GPU in a similar way
lin72h, sindresorhus, neurostar, Milker90, stoneLee81 and 15 more
Metadata
Metadata
Assignees
Labels
decodingDecoding related issuesDecoding related issuesperformanceCPU and memory usage - results and comparisonsCPU and memory usage - results and comparisons