Skip to content

Conversation

@njhill
Copy link
Member

@njhill njhill commented Oct 9, 2025

This is a minimally-invasive fix for compatibility of the current async scheduling implementation with built-in sampling parameters (specifically penalties and bad_words).

We plan to significantly refactor this but would like to first fix existing correctness issues so that async scheduling can be enabled by default.

@mergify mergify bot added the v1 label Oct 9, 2025
@njhill njhill force-pushed the async-sched-penalties branch from 9803a4b to 24a709f Compare October 9, 2025 15:32
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
@njhill njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 10, 2025
@njhill
Copy link
Member Author

njhill commented Oct 10, 2025

@WoosukKwon this one is ready. The e2e test also covers combination of these parameters with async sched + preemption.

@WoosukKwon WoosukKwon enabled auto-merge (squash) October 10, 2025 23:09
@WoosukKwon WoosukKwon merged commit 5bc26c4 into vllm-project:main Oct 10, 2025
48 checks passed
@njhill njhill deleted the async-sched-penalties branch October 10, 2025 23:52
Dhruvilbhatt pushed a commit to Dhruvilbhatt/vllm that referenced this pull request Oct 14, 2025
bbartels pushed a commit to bbartels/vllm that referenced this pull request Oct 16, 2025
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants