forked from ggml-org/llama.cpp
-
Notifications
You must be signed in to change notification settings - Fork 574
Open
Description
Describe the Issue
New multimodal models are supporting not only image captioning (which Kobold implements) but video captioning as well. For examples see Qwen2-VL or Apollo (which is built on Qwen).
Additional Information:
For UI implementation, a simple "Add video" button beside the "Add img" button would suffice - although I believe getting it working with the API is more important. If there is already a way to achieve this with Kobold and I'm mistaken, please let me know!
Thank you for all the hard work! ^_^
LostRuins
Metadata
Metadata
Assignees
Labels
No labels