What happened?
Mainline llama.cpp just wrapped the imatrix.dat file in a gguf format, which means Bartowski's imatrix and mradermacher's imatrix.gguf file can't be used to quantize low bit ik_llama.cpp GGUFs. What a below the belt hit on you @ikawrakow
mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-i1-GGUF/blob/main/Llama-3_3-Nemotron-Super-49B-v1_5.imatrix.gguf
Probably need to merge the imatrix GGUF patch from mainline to maintain compatibility.
Name and Version
llama.cpp
What operating system are you seeing the problem on?
No response
Relevant log output