Closed
Description
This card is a tracker for ggml-org/llama.cpp#3969
This seems to happen to me as well with the llama.cpp backend only: I can reproduce it programmatically with certain text by using grammars
Update:
There is an "epic" here that we should keep an eye on: ggml-org/llama.cpp#4216