-
Notifications
You must be signed in to change notification settings - Fork 13.7k
Add support for CogVLM model #15002
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for CogVLM model #15002
Conversation
42113d1 to
de22157
Compare
|
I think I've fixed the typecheck and format check workflows that were failing before, can someone approve the workflows to run again? |
You can run |
CISC
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is not a complete review as I don't know enough about mtmd, just commenting...
Thanks for the info! That's something I've been wondering about for a while. |
de22157 to
a571d9a
Compare
a571d9a to
ac3992d
Compare
CISC
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Further refinement (merge cont+reshape).
After #15662 we can avoid these altogether and just create 3D views. |
Merged, rebase and apply updated suggestions. |
ac3992d to
06a0719
Compare
Thanks for the reminder, I've rebased it and removed the extra ggml_cont calls. |
|
sorry I missed the notification to review this. will have a look & push commits to resolve the conflicts |
ngxson
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't have enough VRAM to test the model right now, but I think the code should be good to merge (after CI passed)
Feel free to give it a try even after the PR is merged. In case there are bugs, we can make follow-up PRs to fix it.
|
No idea why the ASAN test failed, probably just a random runtime issue. I'm re-running the CI |
It's |
|
btw @Tianyue-Zhao , it seems like this implementation still use the legacy llava preprocessing and does not support dynamic resolution. is this expected? |
This addresses the requests for CogVLM in #4387 and #4350.
CogVLM is a pretty popular model that now adds in cleanly after the recent additions to libmtmd.
I've converted a GGUF here: Link to GGUF files
Sample command and output: