Vulkan: Fix mmq int dot float cache size #12722

0cc4m · 2025-04-02T15:29:20Z

I don't know how I (and everyone else) missed this, considering it means models are completely incoherent when using the new int dot shaders, but here's the fix. The cache buffer for the quant dm values was too small and overflowed, leading to NaN results.

jeffbolznv

LGTM, I didn't try running it.

0cc4m · 2025-04-02T18:17:43Z

For some reason this change removed a large chunk of the performance increase of the shader, I'm not sure how that happened. It added 8 bytes of register use, maybe that crossed some occupancy limit, but that is very weird. I hope it can be fixed.

Vulkan: Fix mmq int dot float cache size

0c67672

0cc4m requested a review from jeffbolznv April 2, 2025 15:29

jeffbolznv approved these changes Apr 2, 2025

View reviewed changes

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Apr 2, 2025

0cc4m mentioned this pull request Apr 2, 2025

Eval bug: ~~Q2_K and Q3_K~~ Q8_0 not working on Vulkan anymore on RX 5700XT #10710

Closed

0cc4m merged commit 92e3006 into master Apr 2, 2025
44 checks passed

0cc4m deleted the 0cc4m/vulkan-mmq-dp4a-fix branch April 2, 2025 17:12

This was referenced Apr 4, 2025

Misc. bug: Vulkan performance regression on Iris Xe #12754

Closed

Vulkan: Tune Vulkan mmq int dot shader for performance #12767

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan: Fix mmq int dot float cache size #12722

Vulkan: Fix mmq int dot float cache size #12722

Uh oh!

0cc4m commented Apr 2, 2025

Uh oh!

jeffbolznv left a comment

Uh oh!

Uh oh!

0cc4m commented Apr 2, 2025

Uh oh!

Uh oh!

Vulkan: Fix mmq int dot float cache size #12722

Vulkan: Fix mmq int dot float cache size #12722

Uh oh!

Conversation

0cc4m commented Apr 2, 2025

Uh oh!

jeffbolznv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

0cc4m commented Apr 2, 2025

Uh oh!

Uh oh!