clip : improve projector naming #13118

ngxson · 2025-04-25T22:18:00Z

I don't quite like the abstract naming like "resample", "merger", etc. It can be useful if one projector can be reused by various vision models. But unfortunately, that has hardly been the case.

The cumbersome bool has_*_projector pattern is also removed. The only variable being kept is has_llava_projector, because both MLP, MLP_NORM, LDP, LDPV2 are considered variants of llava projector.

Test result:

OK:   llama-mtmd-cli ggml-org/SmolVLM-500M-Instruct-GGUF:Q8_0
OK:   llama-mtmd-cli ggml-org/SmolVLM2-2.2B-Instruct-GGUF:Q4_K_M
OK:   llama-mtmd-cli ggml-org/SmolVLM2-500M-Video-Instruct-GGUF:Q8_0
OK:   llama-mtmd-cli ggml-org/gemma-3-4b-it-GGUF:Q4_K_M
OK:   llama-mtmd-cli guinmoon/MobileVLM-3B-GGUF:Q4_K_M
OK:   llama-mtmd-cli THUDM/glm-edge-v-5b-gguf:Q4_K_M
OK:   llama-mtmd-cli second-state/Llava-v1.5-7B-GGUF:Q2_K
OK:   llama-mtmd-cli cjpais/llava-1.6-mistral-7b-gguf:Q3_K
OK:   llama-mtmd-cli ibm-research/granite-vision-3.2-2b-GGUF:Q4_K_M
OK:   llama-mtmd-cli second-state/MiniCPM-Llama3-V-2_5-GGUF:Q2_K
OK:   llama-mtmd-cli openbmb/MiniCPM-V-2_6-gguf:Q2_K
OK:   llama-mtmd-cli openbmb/MiniCPM-o-2_6-gguf:Q4_0
OK:   llama-qwen2vl-cli bartowski/Qwen2-VL-2B-Instruct-GGUF:Q4_K_M
OK:   llama-mtmd-cli ggml-org/pixtral-12b-GGUF:Q4_K_M

ngxson · 2025-04-25T22:24:05Z

examples/llava/clip-impl.h

+    PROJECTOR_TYPE_MINICPMV,
    PROJECTOR_TYPE_GLM_EDGE,
-    PROJECTOR_TYPE_MERGER,
+    PROJECTOR_TYPE_QWEN2VL,


cc @HimariO , PROJECTOR_TYPE_RESAMPLER is renamed to PROJECTOR_TYPE_QWEN2VL

For qwen2.5, we can add PROJECTOR_TYPE_QWEN25VL. For code paths used by qwenvl, we will need to check ctx->proj_type == PROJECTOR_TYPE_QWEN2VL || ctx->proj_type == PROJECTOR_TYPE_QWEN25VL

But tbh the best way is to have a dedicated builder function for qwenvl, it makes the code much easier to read. I'll make a proposal in the next few days.

* clip : improve projector naming * no more kv has_llava_projector * rm unused kv * rm more unused

ngxson added 3 commits April 26, 2025 00:03

clip : improve projector naming

84a5922

no more kv has_llava_projector

5fa6723

rm unused kv

1e4ce1d

ngxson requested a review from ggerganov April 25, 2025 22:18

github-actions bot added the examples label Apr 25, 2025

rm more unused

0a7ef17

ngxson commented Apr 25, 2025

View reviewed changes

ggerganov approved these changes Apr 26, 2025

View reviewed changes

ngxson merged commit 4753791 into ggml-org:master Apr 26, 2025
48 checks passed

pockers21 pushed a commit to pockers21/llama.cpp that referenced this pull request Apr 28, 2025

clip : improve projector naming (ggml-org#13118)

cc00eb2

* clip : improve projector naming * no more kv has_llava_projector * rm unused kv * rm more unused

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

clip : improve projector naming #13118

clip : improve projector naming #13118

Uh oh!

ngxson commented Apr 25, 2025 •

edited

Loading

Uh oh!

ngxson Apr 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

clip : improve projector naming #13118

clip : improve projector naming #13118

Uh oh!

Conversation

ngxson commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ngxson commented Apr 25, 2025 •

edited

Loading

ngxson Apr 25, 2025 •

edited

Loading