[Bugfix] Explicitly set LoRA triton kernel device #13043

jeejeelee · 2025-02-10T15:08:18Z

FIX #12967 (link existing issues this PR will resolve)

Signed-off-by: Jee Jee Li <[email protected]>

github-actions · 2025-02-10T15:08:31Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Jee Jee Li <[email protected]>

tlrmchlsmth

Seems that we will need to do this for all triton kernels. @fabianlim ran into this problem on Bamba as well (it uses triton kernels for the mamba mixer).

If we have to do this everywhere it will be easy to miss a spot. Is it possible to do this in the GPUModelRunner instead?

jeejeelee · 2025-02-11T01:40:30Z

Seems that we will need to do this for all triton kernels. @fabianlim ran into this problem on Bamba as well (it uses triton kernels for the mamba mixer).

If we have to do this everywhere it will be easy to miss a spot. Is it possible to do this in the GPUModelRunner instead?

Make sense, I will handling it ASAP

jeejeelee · 2025-02-11T03:13:18Z

#13027 can better solve this issue, so I close this one

Dond

d938b03

Signed-off-by: Jee Jee Li <[email protected]>

Done

2b057ed

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee requested a review from mgoin February 10, 2025 15:10

FMT

2216d59

Signed-off-by: Jee Jee Li <[email protected]>

tlrmchlsmth reviewed Feb 10, 2025

View reviewed changes

jeejeelee closed this Feb 11, 2025

jeejeelee deleted the fix-triton-device branch February 11, 2025 03:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Explicitly set LoRA triton kernel device #13043

[Bugfix] Explicitly set LoRA triton kernel device #13043

Uh oh!

jeejeelee commented Feb 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

tlrmchlsmth left a comment

Uh oh!

jeejeelee commented Feb 11, 2025

Uh oh!

jeejeelee commented Feb 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix] Explicitly set LoRA triton kernel device #13043

[Bugfix] Explicitly set LoRA triton kernel device #13043

Uh oh!

Conversation

jeejeelee commented Feb 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

tlrmchlsmth left a comment

Choose a reason for hiding this comment

Uh oh!

jeejeelee commented Feb 11, 2025

Uh oh!

jeejeelee commented Feb 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jeejeelee commented Feb 10, 2025 •

edited by github-actions bot

Loading