The output tensor needs to be created on the same device as the query… #54

ji-huazhong · 2025-02-12T14:22:34Z

What this PR does / why we need it?

In open-r1, the rank 0 process will create an LLM instance and load the model to npu:7. We need to force the output tensor to be created on the same device as the query tensor.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Test by main branch

… tensor. Signed-off-by: ji-huazhong <[email protected]>

Yikun · 2025-02-13T03:18:50Z

Looks like we should also fix this on main, would you mind cherry pick this? @ji-huazhong

ji-huazhong · 2025-02-13T07:49:17Z

This issue has been addressed on main, see #25
@Yikun

vllm-project#54) ### What this PR does / why we need it? In open-r1, the rank 0 process will create an LLM instance and load the model to `npu:7`. We need to force the output tensor to be created on the same device as the query tensor. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Test by main branch Signed-off-by: angazenn <[email protected]>

move get_init_expert_map to forward_before

The output tensor needs to be created on the same device as the query…

6be84ba

… tensor. Signed-off-by: ji-huazhong <[email protected]>

wangxiyuan merged commit 49f3cb3 into vllm-project:v0.7.1-release Feb 13, 2025
1 check passed

ji-huazhong deleted the fix branch February 17, 2025 07:14

ZhengWG pushed a commit to ZhengWG/vllm-ascend that referenced this pull request Jun 18, 2025

Merge pull request vllm-project#54 from raindaywhu/dev_whq_eplb1

b888701

move get_init_expert_map to forward_before

offline893 pushed a commit to offline893/vllm-ascend that referenced this pull request Sep 9, 2025

Merge pull request vllm-project#54 from raindaywhu/dev_whq_eplb1

b24f89a

move get_init_expert_map to forward_before

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The output tensor needs to be created on the same device as the query… #54

The output tensor needs to be created on the same device as the query… #54

Uh oh!

ji-huazhong commented Feb 12, 2025 •

edited by wangxiyuan

Loading

Uh oh!

Uh oh!

Yikun commented Feb 13, 2025

Uh oh!

ji-huazhong commented Feb 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

The output tensor needs to be created on the same device as the query… #54

The output tensor needs to be created on the same device as the query… #54

Uh oh!

Conversation

ji-huazhong commented Feb 12, 2025 • edited by wangxiyuan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

Yikun commented Feb 13, 2025

Uh oh!

ji-huazhong commented Feb 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ji-huazhong commented Feb 12, 2025 •

edited by wangxiyuan

Loading

ji-huazhong commented Feb 13, 2025 •

edited

Loading