FEAT: [model] Support Qwen3-VL #4112

Jun-Howie · 2025-09-29T10:18:05Z

What's New

Model Support
- Added support for Qwen3-VL Instruct/Thinking models.
Quantized Models
- Provided additional FP8 and AWQ quantized models.
- Fully tested and supported.
Tool Calling
- Added support for tool_call, consistent with the Qwen3 series.
Frontend UI
- Introduced the --enable-expert-parallel parameter.
- Aligned with vllm/core.py, enabling expert parallelism so that each GPU is allocated the same number of experts.

Dependencies (vLLM Backend)

pip install git+https://github.com/huggingface/transformers
pip install qwen-vl-utils==0.0.14

# Recommended (if the stable version is not working, use nightly)
# pip install 'vllm>0.10.2' 
uv pip install -U vllm \
    --torch-backend=auto \
    --extra-index-url https://wheels.vllm.ai/nightly

Usage Examples

2 × A100-80G (AWQ)

vllm serve \
    ./Qwen3-VL-235B-A22B-Instruct-AWQ \
    --enable-expert-parallel \
    --max-model-len 32768 \
    --tensor-parallel-size 2

4 × A100-80G (FP8)

vllm serve \
   ./Qwen3-VL-235B-A22B-Instruct-FP8 \
   --enable-expert-parallel \
   --max-model-len 32768 \
   --tensor-parallel-size 4

Test Results

qinxuye

LGTM

JunHowie added 3 commits September 29, 2025 17:28

FEAT:Support Qwen3-VL models，quantization model and tool_call

77cd96b

FEAT:Support param enable_expert_parallel

d8810ba

FIX:VLLM_VERSION check

f64da73

XprobeBot added the gpu label Sep 29, 2025

XprobeBot added this to the v1.x milestone Sep 29, 2025

qinxuye changed the title ~~Support Qwen3-VL~~ FEAT: [model] Support Qwen3-VL Sep 29, 2025

XprobeBot added the feature label Sep 29, 2025

qinxuye added 2 commits September 30, 2025 12:40

lint

e25b455

fix transformers

c92819f

qinxuye approved these changes Sep 30, 2025

View reviewed changes

qinxuye merged commit bc3b42c into xorbitsai:main Sep 30, 2025
4 of 13 checks passed

Jun-Howie deleted the Qwen3-VL branch October 9, 2025 09:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FEAT: [model] Support Qwen3-VL #4112

FEAT: [model] Support Qwen3-VL #4112

Uh oh!

Jun-Howie commented Sep 29, 2025 •

edited

Loading

Uh oh!

qinxuye left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

FEAT: [model] Support Qwen3-VL #4112

FEAT: [model] Support Qwen3-VL #4112

Uh oh!

Conversation

Jun-Howie commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's New

Dependencies (vLLM Backend)

Test Results

Uh oh!

qinxuye left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Jun-Howie commented Sep 29, 2025 •

edited

Loading