[Bugfix] Fix Mistral3 spatial merge error #17270

mgoin · 2025-04-27T21:51:23Z

We just were not patching spatial_merge_size into the vision config in both of the places needed. This results in the dummy inputs having the right spatial_merge_size being applied but actual inferences were missing it, causing the shapes to mismatch during inference.

Here is where is it being applied already in vision.py

vllm/vllm/model_executor/models/vision.py

Lines 63 to 67 in 20e489e

    
           if isinstance(vision_config, PixtralVisionConfig): 
        
               # Need to sneak in spatial_merge_size for Mistral3 
        
               vision_config.spatial_merge_size = getattr(hf_config, 
        
                                                          "spatial_merge_size", 1) 
        
               return PixtralHFEncoderInfo(vision_config)

With this PR, the example passes:

python ~/code/vllm/examples/offline_inference/vision_language.py -m mistral3 

Processed prompts: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:03<00:00,  1.23it/s, est. speed input: 2559.91 toks/s, output: 78.61 toks/s]
--------------------------------------------------
The image depicts a vibrant and picturesque scene featuring the iconic Tokyo Skytree, a tall broadcasting and observation tower located in Tokyo, Japan. The Skytree is framed by cherry blossom trees in full bloom, creating a beautiful, natural archway of pink flowers. The sky is clear and bright blue, enhancing
--------------------------------------------------
The image depicts a picturesque scene featuring cherry blossom trees in full bloom. The blossoms are predominantly pink and create a beautiful, almost tunnel-like effect with their branches framing the image. In the background, the iconic Tokyo Skytree, a tall broadcasting and observation tower located in Tokyo, Japan, is visible.
--------------------------------------------------
The image depicts a picturesque scene featuring the iconic Tokyo Skytree, a tall broadcasting and observation tower located in Tokyo, Japan. The tower is framed by a beautiful canopy of cherry blossom (sakura) trees in full bloom. The cherry blossoms are in various shades of pink, creating a vibrant and colorful
--------------------------------------------------
This image features a picturesque scene of cherry blossom trees in full bloom with their vibrant pink flowers. The trees frame the image, creating a natural archway that leads the viewer's eye towards the central focus of the image: the iconic Tokyo Skytree, a tall broadcasting and observation tower located in Tokyo, Japan
--------------------------------------------------

Signed-off-by: mgoin <[email protected]>

github-actions · 2025-04-27T21:51:31Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

DarkLight1337 · 2025-04-28T02:18:32Z

Thanks for debugging and fixing!

Signed-off-by: mgoin <[email protected]>

Signed-off-by: mgoin <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]>

Signed-off-by: mgoin <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Signed-off-by: mgoin <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

Fix Mistral3 spatial merge error

07650f2

Signed-off-by: mgoin <[email protected]>

simon-mo added this to the v0.8.5 milestone Apr 27, 2025

mgoin added ready ONLY add when PR is ready to merge/full CI is needed bug Something isn't working labels Apr 28, 2025

DarkLight1337 approved these changes Apr 28, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) April 28, 2025 02:18

vllm-bot merged commit cb3f2d8 into vllm-project:main Apr 28, 2025
49 of 66 checks passed

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[Bugfix] Fix Mistral3 spatial merge error (vllm-project#17270)

2d81d57

Signed-off-by: mgoin <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Bugfix] Fix Mistral3 spatial merge error (vllm-project#17270)

8c20072

Signed-off-by: mgoin <[email protected]>

adobrzyn pushed a commit to HabanaAI/vllm-fork that referenced this pull request Apr 30, 2025

[Bugfix] Fix Mistral3 spatial merge error (vllm-project#17270)

8dc5351

Signed-off-by: mgoin <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]>

lk-chen mentioned this pull request May 1, 2025

[Bug][V1] 'PixtralVisionConfig' object has no attribute 'spatial_merge_size' in 0.8.5 #17565

Closed

1 task

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Bugfix] Fix Mistral3 spatial merge error (vllm-project#17270)

0296ede

Signed-off-by: mgoin <[email protected]> Signed-off-by: Mu Huai <[email protected]>

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[Bugfix] Fix Mistral3 spatial merge error (vllm-project#17270)

23f37d3

Signed-off-by: mgoin <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix Mistral3 spatial merge error #17270

[Bugfix] Fix Mistral3 spatial merge error #17270

Uh oh!

mgoin commented Apr 27, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 27, 2025

Uh oh!

DarkLight1337 commented Apr 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	if isinstance(vision_config, PixtralVisionConfig):
	# Need to sneak in spatial_merge_size for Mistral3
	vision_config.spatial_merge_size = getattr(hf_config,
	"spatial_merge_size", 1)
	return PixtralHFEncoderInfo(vision_config)

Uh oh!

[Bugfix] Fix Mistral3 spatial merge error #17270

[Bugfix] Fix Mistral3 spatial merge error #17270

Uh oh!

Conversation

mgoin commented Apr 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 27, 2025

Uh oh!

DarkLight1337 commented Apr 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mgoin commented Apr 27, 2025 •

edited by github-actions bot

Loading