[MM][Doc] Add documentation for configurable mm profiling #26200

wwl2755 · 2025-10-03T21:59:40Z

PR #25631 introduced a configurable mm profiling method, and this PR adds the corresponding usage guideline in the documents.

CC: @ywang96 @DarkLight1337 @Isotr0py @hmellor

gemini-code-assist

Code Review

This pull request adds documentation for the new configurable multi-modal profiling options. The documentation is clear and provides a good example. However, there is a TODO note left in the user-facing documentation which contains an important limitation of the feature. This should be rephrased to be clearer for users to avoid potential confusion regarding memory usage.

docs/configuration/conserving_memory.md

wwl2755 · 2025-10-03T22:53:20Z

Changes could be seen at: https://vllm--26200.org.readthedocs.build/en/26200/configuration/conserving_memory.html#multi-modal-input-limits

hmellor · 2025-10-08T09:08:58Z

docs/configuration/conserving_memory.md

+- `image`: `{"count": int, "width": int, "height": int}`
+- `video`: `{"count": int, "num_frames": int, "width": int, "height": int}`
+- `audio`: `{"count": int, "length": int}`


It might be nice to include links to the API docs here?

This way if the doc becomes out of sync, at least the reader can see the latest options in the API docs.

i.e. for audio you'd use:

[`AudioDummyOptions`][vllm.config.multimodal.AudioDummyOptions]

mergify · 2025-10-08T12:06:21Z

Documentation preview: https://vllm--26200.org.readthedocs.build/en/26200/

Signed-off-by: wwl2755 <[email protected]>

docs/configuration/conserving_memory.md

Signed-off-by: wwl2755 <[email protected]>

…to loader * 'loader' of https://github.com/dsxsteven/vllm_splitPR: (778 commits) [torchao] Add support for ModuleFqnToConfig using regex (vllm-project#26001) Add: Support for multiple hidden layers in Eagle3 (vllm-project#26164) Enable `RMSNorm` substitution for Transformers backend (vllm-project#26353) [Model] Gemma3: Fix GGUF loading and quantization (vllm-project#26189) Bump Flashinfer to v0.4.0 (vllm-project#26326) Update Dockerfile and install runai-model-streamer[gcs] package (vllm-project#26464) [Core] Relax the LoRA max rank (vllm-project#26461) [CI/Build] Fix model nightly tests (vllm-project#26466) [Hybrid]: Decouple Kernel Block Size from KV Page Size (vllm-project#24486) [Core][KVConnector] Propagate all tokens on resumed preemptions (vllm-project#24926) [MM][Doc] Add documentation for configurable mm profiling (vllm-project#26200) [Hardware][AMD] Enable FlexAttention backend on ROCm (vllm-project#26439) [Bugfix] Incorrect another MM data format in vllm bench throughput (vllm-project#26462) [Bugfix] Catch and log invalid token ids in detokenizer #2 (vllm-project#26445) [Minor] Change warning->warning_once in preprocess (vllm-project#26455) [Bugfix] Set the minimum python version for gpt-oss (vllm-project#26392) [Misc] Redact ray runtime env before logging (vllm-project#26302) Separate MLAAttention class from Attention (vllm-project#25103) [Attention] Register FLASHMLA_SPARSE (vllm-project#26441) [Kernels] Modular kernel refactor (vllm-project#24812) ...

…ct#26200) Signed-off-by: wwl2755 <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…ct#26200) Signed-off-by: wwl2755 <[email protected]> Signed-off-by: Dhruvil Bhatt <[email protected]>

…ct#26200) Signed-off-by: wwl2755 <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Oct 3, 2025

gemini-code-assist bot reviewed Oct 3, 2025

View reviewed changes

docs/configuration/conserving_memory.md Outdated Show resolved Hide resolved

wwl2755 force-pushed the mm-doc branch from b298a32 to 99b9692 Compare October 3, 2025 22:11

hmellor reviewed Oct 8, 2025

View reviewed changes

wwl2755 added 2 commits October 8, 2025 23:47

add doc

32867c9

Signed-off-by: wwl2755 <[email protected]>

add warning

5141009

Signed-off-by: wwl2755 <[email protected]>

wwl2755 force-pushed the mm-doc branch from 827a33b to 5141009 Compare October 8, 2025 23:48

add api link

32d54ff

Signed-off-by: wwl2755 <[email protected]>

DarkLight1337 reviewed Oct 9, 2025

View reviewed changes

docs/configuration/conserving_memory.md Outdated Show resolved Hide resolved

add note section

82fe406

Signed-off-by: wwl2755 <[email protected]>

DarkLight1337 approved these changes Oct 9, 2025

View reviewed changes

vllm-bot merged commit 43ab8cf into vllm-project:main Oct 9, 2025
5 checks passed

wwl2755 deleted the mm-doc branch October 9, 2025 06:22

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[MM][Doc] Add documentation for configurable mm profiling (vllm-proje…

ebb7e4a

…ct#26200) Signed-off-by: wwl2755 <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Dhruvilbhatt pushed a commit to Dhruvilbhatt/vllm that referenced this pull request Oct 14, 2025

[MM][Doc] Add documentation for configurable mm profiling (vllm-proje…

946efa3

…ct#26200) Signed-off-by: wwl2755 <[email protected]> Signed-off-by: Dhruvil Bhatt <[email protected]>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[MM][Doc] Add documentation for configurable mm profiling (vllm-proje…

922e278

…ct#26200) Signed-off-by: wwl2755 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MM][Doc] Add documentation for configurable mm profiling #26200

[MM][Doc] Add documentation for configurable mm profiling #26200

Uh oh!

wwl2755 commented Oct 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

wwl2755 commented Oct 3, 2025

Uh oh!

hmellor Oct 8, 2025

Uh oh!

mergify bot commented Oct 8, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[MM][Doc] Add documentation for configurable mm profiling #26200

[MM][Doc] Add documentation for configurable mm profiling #26200

Uh oh!

Conversation

wwl2755 commented Oct 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

wwl2755 commented Oct 3, 2025

Uh oh!

hmellor Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Oct 8, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

wwl2755 commented Oct 3, 2025 •

edited by github-actions bot

Loading