Skip to content

Commit 544f4f1

Browse files
committed
fix typo
1 parent 36a1abb commit 544f4f1

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

docs/multimodal.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ llama.cpp supports multimodal input via `libmtmd`. Currently, there are 2 tools
66

77
Currently, we support **image** and **audio** input. Audio is highly experimental and may have reduced quality.
88

9-
To enable it, can use use one of the 2 methods below:
9+
To enable it, you can use one of the 2 methods below:
1010

1111
- Use `-hf` option with a supported model (see a list of pre-quantized model below)
1212
- To load a model using `-hf` while disabling multimodal, use `--no-mmproj`

tools/mtmd/clip.cpp

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2206,6 +2206,8 @@ struct clip_model_loader {
22062206
auto & hparams = ctx_clip.vision_model.hparams;
22072207
std::map<std::string, size_t> tensor_offset;
22082208
std::vector<ggml_tensor *> tensors_to_load;
2209+
2210+
// TODO @ngxson : support both audio and video in the future
22092211
const char * prefix = hparams.has_audio ? "a" : "v";
22102212

22112213
// get offsets

0 commit comments

Comments
 (0)