[Bugfix] Fix feature size calculation for LLaVA-NeXT #6982

DarkLight1337 · 2024-07-31T08:00:06Z

This PR primarily updates our code to be consistent with huggingface/transformers#32314.

The feature size calculation has also been updated to no longer depend on the floating-point error in CUDA for some specific image resolutions, e.g.:

# original_height = 183, original_width = 488, height = 24
>>> int(183 * torch.tensor(24).cuda() / 488)
8
>>> int(183 * torch.tensor(24).cpu() / 488)
9

This fixes a bug in which the number of image placeholder tokens becomes incorrect for such edge cases when the model is run on other devices such as CPU.

cc @xwjiang2010

github-actions · 2024-07-31T08:00:17Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

ywang96

LGTM but should we wait for a new transformers release then merge this with a version bump? It looks like that PR hasn't been released yet.

ywang96 · 2024-07-31T08:11:05Z

vllm/model_executor/models/internvl.py

 class InternVLImagePixelInputs(TypedDict):
    type: Literal["pixel_values"]
-    data: BatchedTensors
+    data: Union[torch.Tensor, List[torch.Tensor]]


Why do we no longer use BatchedTensors?

I updated the definition in #6836 but forgot to change the model files.

DarkLight1337 · 2024-07-31T08:19:27Z

LGTM but should we wait for a new transformers release then merge this with a version bump? It looks like that PR hasn't been released yet.

It's an internal change so it shouldn't break anything. I think we can do this in parallel with transformers development.

ywang96 · 2024-07-31T08:20:49Z

LGTM but should we wait for a new transformers release then merge this with a version bump? It looks like that PR hasn't been released yet.

It's an internal change so it shouldn't break anything. I think we can do this in parallel with transformers development.

I'm only worried about the model result consistency, but I guess since we're developing on the main branch it shouldn't be a big deal.

DarkLight1337 · 2024-07-31T08:25:03Z

LGTM but should we wait for a new transformers release then merge this with a version bump? It looks like that PR hasn't been released yet.

It's an internal change so it shouldn't break anything. I think we can do this in parallel with transformers development.

I'm only worried about the model result consistency, but I guess since we're developing on the main branch it shouldn't be a big deal.

Yeah, either way there will be a period of inconsistency as we can't time our release to be at the same time as transformers.

xwjiang2010

LGTM, thanks for the fix!

Signed-off-by: Alvant <[email protected]>

Signed-off-by: LeiWang1999 <[email protected]>

DarkLight1337 added 3 commits July 31, 2024 07:47

Fix wrong types

2b45671

Port over huggingface/transformers#32314

906263b

Make feature size calculation device-agnostic

d6a8440

DarkLight1337 requested a review from ywang96 July 31, 2024 08:01

ywang96 approved these changes Jul 31, 2024

View reviewed changes

DarkLight1337 added 2 commits July 31, 2024 08:16

Directly test specific sizes on the model

07d4e98

Format

a62b8f3

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 31, 2024

xwjiang2010 approved these changes Jul 31, 2024

View reviewed changes

DarkLight1337 merged commit daed30c into vllm-project:main Jul 31, 2024

DarkLight1337 deleted the llava-next-anyres-shapes branch July 31, 2024 15:46

dtrifiro mentioned this pull request Aug 5, 2024

Sync with [email protected] opendatahub-io/vllm#120

Closed

DarkLight1337 mentioned this pull request Aug 21, 2024

[Bug]: Mismatch in the number of image tokens and placeholders during batch inference #7669

Closed

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] Fix feature size calculation for LLaVA-NeXT (vllm-project#6982)

0238c09

Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[Bugfix] Fix feature size calculation for LLaVA-NeXT (vllm-project#6982)

a429823

Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix feature size calculation for LLaVA-NeXT #6982

[Bugfix] Fix feature size calculation for LLaVA-NeXT #6982

Uh oh!

DarkLight1337 commented Jul 31, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Jul 31, 2024

Uh oh!

ywang96 left a comment •

edited

Loading

Uh oh!

ywang96 Jul 31, 2024

Uh oh!

DarkLight1337 Jul 31, 2024

Uh oh!

DarkLight1337 commented Jul 31, 2024

Uh oh!

ywang96 commented Jul 31, 2024

Uh oh!

DarkLight1337 commented Jul 31, 2024 •

edited

Loading

Uh oh!

xwjiang2010 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix] Fix feature size calculation for LLaVA-NeXT #6982

[Bugfix] Fix feature size calculation for LLaVA-NeXT #6982

Uh oh!

Conversation

DarkLight1337 commented Jul 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 31, 2024

Uh oh!

ywang96 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywang96 Jul 31, 2024

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Jul 31, 2024

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Jul 31, 2024

Uh oh!

ywang96 commented Jul 31, 2024

Uh oh!

DarkLight1337 commented Jul 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xwjiang2010 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 commented Jul 31, 2024 •

edited

Loading

ywang96 left a comment •

edited

Loading

DarkLight1337 commented Jul 31, 2024 •

edited

Loading