[V1] Remove input cache client #14864

DarkLight1337 · 2025-03-15T14:03:01Z

All multi-modal models that support V1 have been refactored to use the merged input processor, so we can remove the code path that corresponds to legacy input mapper.

Signed-off-by: DarkLight1337 <[email protected]>

github-actions · 2025-03-15T14:03:10Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: DarkLight1337 <[email protected]>

ywang96 · 2025-03-17T00:17:30Z

Ooops - I deleted a comment by accident - but yes we only need to consider models that are supported on V1.

…client

Signed-off-by: Roger Wang <[email protected]>

ywang96

LGTM! I added a few changes for better code clarity and fixed the CI errors

Signed-off-by: Roger Wang <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Signed-off-by: Mu Huai <[email protected]>

[VLM][V1] Remove input cache client

ef14eb5

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested a review from ywang96 March 15, 2025 14:03

DarkLight1337 requested review from WoosukKwon, robertgshaw2-redhat, njhill, comaniac and alexm-redhat as code owners March 15, 2025 14:03

DarkLight1337 mentioned this pull request Mar 15, 2025

[RFC]: Multi-modality Support on vLLM #4194

Open

53 tasks

mergify bot added the v1 label Mar 15, 2025

DarkLight1337 added multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed labels Mar 15, 2025

Add informative error message

2a88876

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added this to Multi-modality Core Mar 15, 2025

DarkLight1337 moved this to In Progress in Multi-modality Core Mar 15, 2025

DarkLight1337 self-assigned this Mar 15, 2025

DarkLight1337 changed the title ~~[VLM][V1] Remove input cache client~~ [V1] Remove input cache client Mar 15, 2025

DarkLight1337 assigned ywang96 Mar 15, 2025

DarkLight1337 added 2 commits March 16, 2025 14:36

Merge branch 'main' into deprecate-mm-cache-client

7554e6e

Signed-off-by: DarkLight1337 <[email protected]>

Fix

f7cea21

Signed-off-by: DarkLight1337 <[email protected]>

vllm-project deleted a comment from DarkLight1337 Mar 17, 2025

ywang96 added 3 commits March 16, 2025 17:22

Merge remote-tracking branch 'upstream/main' into deprecate-mm-cache-…

a070e0d

…client

rename & cleanup

520d951

Signed-off-by: Roger Wang <[email protected]>

comment

643e4ca

Signed-off-by: Roger Wang <[email protected]>

ywang96 approved these changes Mar 17, 2025

View reviewed changes

update

45c558f

Signed-off-by: Roger Wang <[email protected]>

DarkLight1337 enabled auto-merge (squash) March 17, 2025 05:16

vllm-bot merged commit b539222 into vllm-project:main Mar 17, 2025
29 of 31 checks passed

github-project-automation bot moved this from In Progress to Done in Multi-modality Core Mar 17, 2025

DarkLight1337 deleted the deprecate-mm-cache-client branch March 17, 2025 07:00

DarkLight1337 mentioned this pull request Mar 26, 2025

[RFC]: Merge input processor and input mapper for multi-modal models #10114

Closed

57 tasks

DarkLight1337 mentioned this pull request Apr 8, 2025

[Bugfix] Avoid transferring cached multi-modal items from P0 to P1 #16273

Merged

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[V1] Remove input cache client (vllm-project#14864)

c3c3452

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]>

lgeiger mentioned this pull request Jun 10, 2025

[Misc] Remove unused MultiModalHasher.hash_prompt_mm_data #19422

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V1] Remove input cache client #14864

[V1] Remove input cache client #14864

Uh oh!

DarkLight1337 commented Mar 15, 2025

Uh oh!

github-actions bot commented Mar 15, 2025

Uh oh!

ywang96 commented Mar 17, 2025

Uh oh!

ywang96 left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[V1] Remove input cache client #14864

[V1] Remove input cache client #14864

Uh oh!

Conversation

DarkLight1337 commented Mar 15, 2025

Uh oh!

github-actions bot commented Mar 15, 2025

Uh oh!

ywang96 commented Mar 17, 2025

Uh oh!

ywang96 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ywang96 left a comment •

edited

Loading