[BugFix] Fix async scheduling + request preemption #26385

njhill · 2025-10-07T22:43:49Z

Ensure model runner is refreshed with all request token ids following preemption, so that correct input ids are used.

vllm/v1/core/sched/scheduler.py

This is an attempt at a minimally invasive quick fix. I'm working on a better fix which will also address the penalties sampling parameter incompatibility. Signed-off-by: Nick Hill <[email protected]>

vllm/v1/worker/gpu_model_runner.py

njhill · 2025-10-08T14:56:16Z

@WoosukKwon so I think we can piggy-back on #24926 for the scheduler side of this? Then the remaining changes here would be even smaller...

mergify · 2025-10-09T09:39:44Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @njhill.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

# Conflicts: # vllm/v1/core/sched/output.py

Signed-off-by: Nick Hill <[email protected]>

njhill · 2025-10-09T17:24:25Z

@WoosukKwon the change becomes even smaller after rebasing on #24926. It includes another simplification I noticed that I've opened a separate PR for: #26508.

I will aim to add a test next.

WoosukKwon

LGTM. Thanks so much for the fix! It'd be nice if we can have a unit test to prevent this happening again.

…c-preempt # Conflicts: # vllm/v1/worker/gpu_model_runner.py

Signed-off-by: Nick Hill <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Dhruvil Bhatt <[email protected]>

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: bbartels <[email protected]>

mergify bot added the v1 label Oct 7, 2025

njhill commented Oct 7, 2025

View reviewed changes

vllm/v1/core/sched/scheduler.py Outdated Show resolved Hide resolved

[BugFix] Fix async scheduling + request preemption

4ce9ae4

This is an attempt at a minimally invasive quick fix. I'm working on a better fix which will also address the penalties sampling parameter incompatibility. Signed-off-by: Nick Hill <[email protected]>

njhill force-pushed the fix-async-preempt branch from 24457e0 to 4ce9ae4 Compare October 8, 2025 03:48

WoosukKwon reviewed Oct 8, 2025

View reviewed changes

vllm/v1/worker/gpu_model_runner.py Outdated Show resolved Hide resolved

WoosukKwon reviewed Oct 8, 2025

View reviewed changes

vllm/v1/worker/gpu_model_runner.py Outdated Show resolved Hide resolved

WoosukKwon mentioned this pull request Oct 8, 2025

[Core][KVConnector] Propagate all tokens on resumed preemptions #24926

Merged

5 tasks

mergify bot added the needs-rebase label Oct 9, 2025

njhill added 2 commits October 9, 2025 08:25

Merge remote-tracking branch 'origin/main' into fix-async-preempt

1477474

# Conflicts: # vllm/v1/core/sched/output.py

rebased/simplified

a8511cd

Signed-off-by: Nick Hill <[email protected]>

mergify bot removed the needs-rebase label Oct 9, 2025

remove added assert

0c1568a

Signed-off-by: Nick Hill <[email protected]>

WoosukKwon approved these changes Oct 9, 2025

View reviewed changes

njhill added 5 commits October 9, 2025 20:11

Merge remote-tracking branch 'refs/remotes/origin/main' into fix-asyn…

a0364b2

…c-preempt # Conflicts: # vllm/v1/worker/gpu_model_runner.py

fix and move check

195c3f6

Signed-off-by: Nick Hill <[email protected]>

add e2e test

ca2dc06

Signed-off-by: Nick Hill <[email protected]>

Merge remote-tracking branch 'origin/main' into fix-async-preempt

1f1be17

move test

4df3589

Signed-off-by: Nick Hill <[email protected]>

njhill marked this pull request as ready for review October 10, 2025 16:29

njhill requested review from alexm-redhat, comaniac, robertgshaw2-redhat and ywang96 as code owners October 10, 2025 16:29

fix test imports after move

c3eb64b

Signed-off-by: Nick Hill <[email protected]>

njhill force-pushed the fix-async-preempt branch from bee9aa2 to c3eb64b Compare October 10, 2025 16:57

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 10, 2025

njhill enabled auto-merge (squash) October 10, 2025 19:31

njhill merged commit 949cb01 into vllm-project:main Oct 10, 2025
49 checks passed

njhill deleted the fix-async-preempt branch October 10, 2025 20:37

Dhruvilbhatt pushed a commit to Dhruvilbhatt/vllm that referenced this pull request Oct 14, 2025

[BugFix] Fix async scheduling + request preemption (vllm-project#26385)

5e4a0a1

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: Dhruvil Bhatt <[email protected]>

bbartels pushed a commit to bbartels/vllm that referenced this pull request Oct 16, 2025

[BugFix] Fix async scheduling + request preemption (vllm-project#26385)

425eb45

Signed-off-by: Nick Hill <[email protected]> Signed-off-by: bbartels <[email protected]>

Ronald1995 mentioned this pull request Oct 16, 2025

[Core] Async Scheduling X Spec Decoding Compatibility #24799

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BugFix] Fix async scheduling + request preemption #26385

[BugFix] Fix async scheduling + request preemption #26385

Uh oh!

njhill commented Oct 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill commented Oct 8, 2025

Uh oh!

mergify bot commented Oct 9, 2025

Uh oh!

njhill commented Oct 9, 2025

Uh oh!

WoosukKwon left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[BugFix] Fix async scheduling + request preemption #26385

[BugFix] Fix async scheduling + request preemption #26385

Uh oh!

Conversation

njhill commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill commented Oct 8, 2025

Uh oh!

mergify bot commented Oct 9, 2025

Uh oh!

njhill commented Oct 9, 2025

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

njhill commented Oct 7, 2025 •

edited

Loading