[0.7.3] optimize Qwen2.5 vl vit #623

zouyida2002 · 2025-04-22T13:05:15Z

What this PR does / why we need it?

optimize Qwen2 5 vl vit with pta

Does this PR introduce any user-facing change?

no

How was this patch tested?

we've tested on benchmark and it proves to be equal.

Signed-off-by: zouyida <[email protected]>

wangxiyuan · 2025-04-23T08:44:13Z

I'm fine with this change, this is for qwen2.5 vl performance improvement. @ganyi1996ppo please double check it. Thanks.

wangxiyuan · 2025-04-23T08:45:29Z

vllm_ascend/ops/rotary_embedding.py

+
+
 RotaryEmbedding.forward_oot = rope_forward_oot
+MRotaryEmbedding.forward = mrope_forward


basically, we don't want to change anything in vllm except foward_oot. For this kind of change, we should ask vllm to satisfy our requirement. Let's do it in the future. Thanks.

ganyi1996ppo · 2025-04-23T08:57:48Z

vllm_ascend/models/qwen2_5_vl.py

+
+        q, k, v = (rearrange(x, "s b ... -> b s ...").contiguous()
+                   for x in (q, k, v))
+        q = torch_npu.npu_rotary_mul(q, cos, sin)


Do we have any custom op here? Qwen2.5 VL, what's the shape of these q, k, cos and sin?

The function's functionality is as follows:

x1, x2 = torch.chunk(q, 2, -1) x_new = torch.cat((-x2, x1), dim=-1) output = cos * x + sin * x_new

I cann't find any custom op that meets my expectations.

Looks exactly what normal rotary embedding do.....

We can merge this PR first and optimize at next PR.

ok, thanks for your advice, I will optimize it soon.

Yikun · 2025-04-25T00:24:25Z

This should also be merged to main before v0.8.4.rc2.

zouyida2002 added 2 commits April 22, 2025 20:53

optimize qwen2_5_vl vit

e446366

Signed-off-by: zouyida <[email protected]>

optimize qwen2_5_vl vit

3665dfe

Signed-off-by: zouyida <[email protected]>

github-actions bot added the module:ops label Apr 22, 2025

zouyida2002 added 3 commits April 22, 2025 21:07

optimize qwen2_5_vl vit

8e629fd

Signed-off-by: zouyida <[email protected]>

optimize qwen2_5_vl vit

872a38f

Signed-off-by: zouyida <[email protected]>

optimize qwen2_5_vl vit

c61f387

Signed-off-by: zouyida <[email protected]>

wangxiyuan changed the title ~~optimize Qwen2.5 vl vit~~ [0.7.3] optimize Qwen2.5 vl vit Apr 23, 2025

optimize qwen2_5_vl vit

c4573ed

Signed-off-by: zouyida <[email protected]>

wangxiyuan approved these changes Apr 23, 2025

View reviewed changes

ganyi1996ppo reviewed Apr 23, 2025

View reviewed changes

ganyi1996ppo merged commit 1e56aae into vllm-project:v0.7.3-dev Apr 23, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[0.7.3] optimize Qwen2.5 vl vit #623

[0.7.3] optimize Qwen2.5 vl vit #623

Uh oh!

zouyida2002 commented Apr 22, 2025

Uh oh!

wangxiyuan commented Apr 23, 2025

Uh oh!

wangxiyuan Apr 23, 2025

Uh oh!

ganyi1996ppo Apr 23, 2025

Uh oh!

zouyida2052 Apr 23, 2025

Uh oh!

ganyi1996ppo Apr 23, 2025

Uh oh!

ganyi1996ppo Apr 23, 2025

Uh oh!

zouyida2002 Apr 23, 2025

Uh oh!

Uh oh!

Yikun commented Apr 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants



		RotaryEmbedding.forward_oot = rope_forward_oot
		MRotaryEmbedding.forward = mrope_forward

[0.7.3] optimize Qwen2.5 vl vit #623

[0.7.3] optimize Qwen2.5 vl vit #623

Uh oh!

Conversation

zouyida2002 commented Apr 22, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

wangxiyuan commented Apr 23, 2025

Uh oh!

wangxiyuan Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

ganyi1996ppo Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

zouyida2052 Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

ganyi1996ppo Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

ganyi1996ppo Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

zouyida2002 Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Yikun commented Apr 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants