[Structured Output][Refactor] Move `apply_grammar_bitmask()` method from `ModelRunner` to structured output utils #21999

shen-shanshan · 2025-07-31T08:17:46Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

Currently, the feature structured output is closely coupled with the model_runner, and we need to implement duplicate apply_grammar_bitmask() method in different model_runner of each platform, e.g., gpu_model_runner, npu_model_runner. Once there are changes have made in this method, we need to update the method in all kinds of model_runner to sync these changes.

Thus, maybe it's better to move these structured output related code in model_runner to the structured_output module to make it clearer and more extensible.

Test Plan

Test Result

(Optional) Documentation Update

gemini-code-assist

Code Review

This pull request removes a redundant null check for grammar_bitmask. This is a good code cleanup that improves maintainability by removing unnecessary code. The change is straightforward and relies on the caller performing the null check, as stated in the PR description.

shen-shanshan · 2025-07-31T08:19:45Z

CC: @russellb

github-actions · 2025-07-31T09:17:07Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

russellb

The StructuredOutputManager isn't an object used by the model runner right now. How about just making it a utility function somewhere instead of hanging it off of that class? Maybe here https://github.com/vllm-project/vllm/blob/main/vllm/v1/structured_output/utils.py

shen-shanshan · 2025-08-05T12:49:31Z

The StructuredOutputManager isn't an object used by the model runner right now. How about just making it a utility function somewhere instead of hanging it off of that class? Maybe here https://github.com/vllm-project/vllm/blob/main/vllm/v1/structured_output/utils.py

Thanks for your suggestion! I will modify it later~

mergify · 2025-08-07T06:13:59Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @shen-shanshan.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

shen-shanshan · 2025-08-18T02:00:19Z

I'm good with this in general, but it will need to be updated one more time.

It's in conflict with main. Before fixing it though, there are a few bug fix PRs that I'd like to merge first. Once they go in, if you can incorporate them into your change, I think we'll be good.

[FIXBUG] Correctly Apply Grammar Bitmask in Mixed Batches #22896

Upgrade xgrammar to 0.1.23 #22988

[Structured Outputs] [Bug] Fix misalignment in apply_grammar_bitmask causing unintended masking and NaN logits #22963

OK, no problem. After these fix PR have been merged, I will update this soon.

shen-shanshan · 2025-09-04T02:36:44Z

@russellb Hello, all the fix PRs you mentioned have already been merged, and I have rebased on the latest code. 😃

russellb

lgtm, thanks!

shen-shanshan · 2025-09-09T01:51:22Z

@russellb The CI run failed due to #24366, and I have rebased once again. Now the CI has all passed.

benchislett

LGTM

mergify · 2025-09-11T07:36:23Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @shen-shanshan.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

…tput utils Signed-off-by: shen-shanshan <[email protected]>

Signed-off-by: shen-shanshan <[email protected]>

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]>

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]> Signed-off-by: charlifu <[email protected]>

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]>

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

shen-shanshan requested review from WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat and ywang96 as code owners July 31, 2025 08:17

mergify bot added the v1 label Jul 31, 2025

gemini-code-assist bot reviewed Jul 31, 2025

View reviewed changes

shen-shanshan changed the title ~~[Misc] Remove redundant check for bitmask in model_runner~~ [Structured Output] Remove redundant check for bitmask in model_runner Jul 31, 2025

shen-shanshan force-pushed the so branch from d0831e8 to 86a8dc5 Compare August 1, 2025 01:58

shen-shanshan changed the title ~~[Structured Output] Remove redundant check for bitmask in model_runner~~ [Structured Output] Replace if check for bitmask in model runner to assert Aug 1, 2025

shen-shanshan requested review from aarnphm, mgoin and russellb as code owners August 1, 2025 03:24

mergify bot added the structured-output label Aug 1, 2025

github-project-automation bot added this to Structured Output Aug 1, 2025

shen-shanshan changed the title ~~[Structured Output] Replace if check for bitmask in model runner to assert~~ [Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to StructuredOutputManager Aug 1, 2025

shen-shanshan changed the title ~~[Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to StructuredOutputManager~~ [Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to StructuredOutputManager Aug 1, 2025

shen-shanshan force-pushed the so branch from 07f9c09 to 5da78eb Compare August 4, 2025 03:17

russellb reviewed Aug 5, 2025

View reviewed changes

mergify bot added the needs-rebase label Aug 7, 2025

shen-shanshan force-pushed the so branch from 5da78eb to 6d63ac3 Compare August 7, 2025 09:16

mergify bot removed the needs-rebase label Aug 7, 2025

shen-shanshan force-pushed the so branch from 6d63ac3 to a924d17 Compare August 8, 2025 02:36

shen-shanshan mentioned this pull request Jul 17, 2025

[Feature]: Add Support for Guided Decoding (Structured Output) vllm-project/vllm-ascend#177

Closed

20 tasks

shen-shanshan changed the title ~~[Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to StructuredOutputManager~~ [Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to structured output utils Aug 8, 2025

shen-shanshan mentioned this pull request Aug 25, 2025

[Structured Output] Replace apply_grammar_bitmask() method with that in vllm to avoid maintenance vllm-project/vllm-ascend#2524

Merged

shen-shanshan force-pushed the so branch from c909e3c to 1d534e1 Compare September 3, 2025 07:42

mergify bot removed the needs-rebase label Sep 3, 2025

russellb approved these changes Sep 4, 2025

View reviewed changes

russellb enabled auto-merge (squash) September 4, 2025 23:31

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 4, 2025

auto-merge was automatically disabled September 8, 2025 01:45
Head branch was pushed to by a user without write access

shen-shanshan force-pushed the so branch from fb41dd5 to f3620d6 Compare September 8, 2025 01:45

shen-shanshan requested a review from benchislett as a code owner September 8, 2025 01:45

benchislett approved these changes Sep 9, 2025

View reviewed changes

mergify bot added the needs-rebase label Sep 11, 2025

shen-shanshan added 3 commits September 15, 2025 01:51

Move apply_grammar_bitmask() method from ModelRunner to structured ou…

8a8927b

…tput utils Signed-off-by: shen-shanshan <[email protected]>

rebase

ec7b892

Signed-off-by: shen-shanshan <[email protected]>

rebase

2a93b93

Signed-off-by: shen-shanshan <[email protected]>

shen-shanshan force-pushed the so branch from 285e1e0 to 2a93b93 Compare September 15, 2025 01:58

mergify bot removed the needs-rebase label Sep 15, 2025

DarkLight1337 merged commit 470484a into vllm-project:main Sep 18, 2025
42 checks passed

github-project-automation bot moved this to Done in Structured Output Sep 18, 2025

debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025

[Structured Output][Refactor] Move apply_grammar_bitmask() method f…

a408948

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Structured Output][Refactor] Move apply_grammar_bitmask() method f…

a643633

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Structured Output][Refactor] Move apply_grammar_bitmask() method f…

3e164f8

…rom `ModelRunner` to structured output utils (vllm-project#21999) Signed-off-by: shen-shanshan <[email protected]>

Uh oh!

[Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to structured output utils #21999

[Structured Output][Refactor] Move apply_grammar_bitmask() method from ModelRunner to structured output utils #21999

Uh oh!

Conversation

shen-shanshan commented Jul 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

shen-shanshan commented Jul 31, 2025

Uh oh!

github-actions bot commented Jul 31, 2025

Uh oh!

russellb left a comment

Choose a reason for hiding this comment

Uh oh!

shen-shanshan commented Aug 5, 2025

Uh oh!

mergify bot commented Aug 7, 2025

Uh oh!

shen-shanshan commented Aug 18, 2025

Uh oh!

shen-shanshan commented Sep 4, 2025

Uh oh!

russellb left a comment

Choose a reason for hiding this comment

Uh oh!

shen-shanshan commented Sep 9, 2025

Uh oh!

benchislett left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Structured Output][Refactor] Move `apply_grammar_bitmask()` method from `ModelRunner` to structured output utils #21999

[Structured Output][Refactor] Move `apply_grammar_bitmask()` method from `ModelRunner` to structured output utils #21999

shen-shanshan commented Jul 31, 2025 •

edited by github-actions bot

Loading