[Feature] Implement InternVL to llama.cpp #522

James4Ever0 · 2024-08-20T14:43:01Z

Motivation

Many llama.cpp users are requesting this so far. Ollama is one of the interfaces of llama.cpp and it is quite popular. Implementing it will significantly accelerate InterVL adoption and recognition.

Related resources

ggml-org/llama.cpp#6803

Additional context

InternVL is based on LLaMA architecture. Currently text-only InternLM models have been ported to Ollama but not for multimodal ones.

G-z-w · 2024-08-26T05:08:03Z

Thank you for your suggestions. We will gradually push support for various frameworks, and we also welcome contributions from the community.

James4Ever0 · 2024-09-25T06:46:56Z

Hey @czczup would you please clarify the reason closing this issue?

sammcj · 2024-09-25T22:36:39Z

Any update on internVL support with llama.cpp?

cloudyuyuyu · 2024-09-27T02:43:04Z

Any update on internVL support with llama.cpp?

G-z-w · 2024-09-27T03:46:32Z

Thank you for your attention. We are actively progressing on this work, and we also welcome contributions from the community.

rampageservices · 2024-09-30T06:12:01Z

Just curious why this issue was closed if you are actively progressing on this work?

rampageservices · 2024-10-02T03:22:18Z

Thanks for reopening this issue.

…

On Mon, Sep 30, 2024 at 2:54 PM Zhe Chen ***@***.***> wrote: Reopened #522 <#522>. — Reply to this email directly, view it on GitHub <#522 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE32PCTXAB3YAMM2F3GTQLLZZDYT7AVCNFSM6AAAAABM2CFCKSVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJUGQ2TAMRXGIZDAMA> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

cloudyuyuyu · 2024-10-14T03:13:41Z

any update?

Cartomex-MX · 2024-11-13T05:29:22Z

November 12, any update? thank in advance

BleedingDev · 2024-12-20T20:28:17Z

Is there anything that could be helped with? :) InternVL2.5 is really important for the future. :)

linxhome · 2025-01-06T07:05:38Z

pay close attention to this, any update? pls

James4Ever0 · 2025-01-06T08:24:29Z

Is it really that hard to do this? I suggest myself to implement it.

Current progress: ggml-org/llama.cpp#9403

Also ipex-llm has support for InternVL2.

G-z-w · 2025-01-06T08:39:24Z

Is it really that hard to do this? I suggest myself to implement it.

Thank you for your willingness to help! We greatly appreciate your initiative and would be glad to have your contributions. If you need us to provide any content or information, feel free to let us know.

James4Ever0 · 2025-01-08T04:08:51Z

@G-z-w Is this model based on the LLaVA architecture? What are the differences, in input, output and internal parameters, and more?

G-z-w · 2025-01-08T05:23:28Z

@G-z-w Is this model based on the LLaVA architecture? What are the differences, in input, output and internal parameters, and more?

This model, along with other InternVL chat models, is similar to the LLaVA framework, with the specific structure shown in the link. The difference lies in dynamic resolution and pixel shuffle.

If convenient, we recommend prioritizing the deployment of the InternVL 2.5 series. The details of parameters are in model card of blog.

James4Ever0 · 2025-01-14T05:55:39Z

@G-z-w Is this model based on the LLaVA architecture? What are the differences, in input, output and internal parameters, and more?

This model, along with other InternVL chat models, is similar to the LLaVA framework, with the specific structure shown in the link. The difference lies in dynamic resolution and pixel shuffle.

If convenient, we recommend prioritizing the deployment of the InternVL 2.5 series. The details of parameters are in model card of blog.

Is the model structure of v2.5 identical to v1.5 series? Now I can run v1.5 on llama.cpp. qlylangyu/llama.cpp#1

G-z-w · 2025-01-14T06:11:04Z

@G-z-w Is this model based on the LLaVA architecture? What are the differences, in input, output and internal parameters, and more?

This model, along with other InternVL chat models, is similar to the LLaVA framework, with the specific structure shown in the link. The difference lies in dynamic resolution and pixel shuffle.
If convenient, we recommend prioritizing the deployment of the InternVL 2.5 series. The details of parameters are in model card of blog.

Is the model structure of v2.5 identical to v1.5 series? Now I can run v1.5 on llama.cpp.

Yes, the structure of v2.5 is identical to that of the v1.5 series, except that the v2.5 series use different language models.

BakingBrains · 2025-01-31T19:25:29Z

@James4Ever0 any luck in running v2.5 model?

Thank you

gryffindor-rr · 2025-02-14T13:00:43Z

I tried in llama.cpp today, still not supported yet:
python3 convert_hf_to_gguf.py model_path
ERROR:hf-to-gguf:Model InternVLChatModel is not supported

Any update for luck?

MrZeros · 2025-03-03T07:39:32Z

any update?

James4Ever0 · 2025-03-03T10:18:09Z

For anyone who is about to work with the current code, you could check my latest release here.

The 18 KB archive contains function-level diffs using universal ctags and some Python magic so anyone with entry-level C++ knowledge should be able to merge the changes easily.

BrandonJull · 2025-03-08T17:40:14Z

+1

czczup closed this as completed Sep 25, 2024

James4Ever0 mentioned this issue Sep 25, 2024

Support for InternVL ggml-org/llama.cpp#6803

Closed

czczup reopened this Sep 30, 2024

illtellyoulater mentioned this issue Jan 18, 2025

Add support for InternVL2 models LostRuins/koboldcpp#1016

Open

[Feature] Implement InternVL to llama.cpp #522

[Feature] Implement InternVL to llama.cpp #522

Comments

James4Ever0 commented Aug 20, 2024

Motivation

Related resources

Additional context

G-z-w commented Aug 26, 2024

Uh oh!

James4Ever0 commented Sep 25, 2024

Uh oh!

sammcj commented Sep 25, 2024

Uh oh!

cloudyuyuyu commented Sep 27, 2024

Uh oh!

G-z-w commented Sep 27, 2024

Uh oh!

rampageservices commented Sep 30, 2024

Uh oh!

rampageservices commented Oct 2, 2024 via email

Uh oh!

cloudyuyuyu commented Oct 14, 2024

Uh oh!

Cartomex-MX commented Nov 13, 2024

Uh oh!

BleedingDev commented Dec 20, 2024

Uh oh!

linxhome commented Jan 6, 2025

Uh oh!

James4Ever0 commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

G-z-w commented Jan 6, 2025

Uh oh!

James4Ever0 commented Jan 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

G-z-w commented Jan 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

James4Ever0 commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

G-z-w commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BakingBrains commented Jan 31, 2025

Uh oh!

gryffindor-rr commented Feb 14, 2025

Uh oh!

MrZeros commented Mar 3, 2025

Uh oh!

James4Ever0 commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BrandonJull commented Mar 8, 2025

Uh oh!

James4Ever0 commented Jan 6, 2025 •

edited

Loading

James4Ever0 commented Jan 8, 2025 •

edited

Loading

G-z-w commented Jan 8, 2025 •

edited

Loading

James4Ever0 commented Jan 14, 2025 •

edited

Loading

G-z-w commented Jan 14, 2025 •

edited

Loading

James4Ever0 commented Mar 3, 2025 •

edited

Loading