[Misc] Clean up flags in `vllm bench serve` #25138

ywang96 · 2025-09-18T04:57:04Z

Purpose

For some reason we have two flags to specify the backend/endpoint type and this can cause confusion since we're going to deprecate the benchmark scripts. This PR cleans them up.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Roger Wang <[email protected]>

gemini-code-assist

Code Review

This pull request aims to clean up redundant CLI flags by deprecating --backend in favor of --endpoint-type. The changes are in the right direction, but I've found a critical issue that will cause the script to crash, and another issue related to the deprecation logic that could lead to silent misconfigurations. Please see my detailed comments for suggestions on how to fix these.

vllm/benchmarks/serve.py

Signed-off-by: Roger Wang <[email protected]>

ywang96 · 2025-09-18T05:13:43Z

/gemini review

gemini-code-assist

Code Review

This pull request aims to clean up CLI flags in vllm bench serve by deprecating --backend in favor of --endpoint-type. The changes are in the right direction, replacing usages of the old flag and adding deprecation warnings. However, I've found a critical bug in the deprecation logic that causes the user-provided --endpoint-type to be incorrectly ignored. My review includes a comment with a detailed explanation and a code suggestion to fix this issue, ensuring that flag precedence is handled correctly while maintaining backward compatibility for the default behavior.

vllm/benchmarks/serve.py

Signed-off-by: Roger Wang <[email protected]>

ywang96 · 2025-09-18T05:28:56Z

/gemini review

gemini-code-assist

Code Review

This pull request does a good job of cleaning up the command-line flags for vllm bench serve by deprecating --backend in favor of --endpoint-type. The changes are consistent and improve clarity.

I have one suggestion to make the deprecation handling more robust. Currently, if both flags are provided, --backend unconditionally overrides --endpoint-type, which could be surprising. I've proposed a change to raise an error in case of conflicting values to avoid ambiguity and prevent users from running benchmarks against an unintended endpoint.

vllm/benchmarks/serve.py

Signed-off-by: Roger Wang <[email protected]>

ywang96 · 2025-09-18T06:32:00Z

On a second thought - it seems that --backend flag is rather used more often everywhere in vLLM, so I'll keep that instead.

Signed-off-by: Roger Wang <[email protected]>

ywang96 · 2025-09-18T07:01:07Z

vllm/benchmarks/serve.py

    current_dt = datetime.now().strftime("%Y%m%d-%H%M%S")
    result_json["date"] = current_dt
-    result_json["endpoint_type"] = args.endpoint_type
+    result_json["endpoint_type"] = args.backend


The "endpoint_type" key here is not modified on purpose.

maybe you wanna dump it to "backend" too. i guess this result is only used by @huydhn ?

Yea I'm not sure if this will break PyTorch's dashboard - but let me also dump it to "backend" too

ywang96 · 2025-09-18T07:01:49Z

/gemini review

gemini-code-assist

Code Review

This pull request cleans up the command-line flags for vllm bench serve by deprecating --endpoint-type in favor of --backend. The changes are applied across documentation, tests, and the benchmark implementation. My review identifies a critical issue in the deprecation logic that could cause user-provided --endpoint-type values to be ignored. I've also suggested a refactoring to improve code clarity by renaming a variable to align with its new purpose.

vllm/benchmarks/serve.py

yeqcharlotte · 2025-09-18T07:20:16Z

vllm/benchmarks/serve.py

+        type=str,
+        default=None,
        choices=list(ASYNC_REQUEST_FUNCS.keys()),
+        help="'--endpoint-type' is deprecated and will be removed in v0.11.0. "


nit: you can throw a warning with customized action

See if you're okay with the current version

yeqcharlotte

other than the warning LGTM

Signed-off-by: Roger Wang <[email protected]>

Signed-off-by: Roger Wang <[email protected]> Signed-off-by: charlifu <[email protected]>

Signed-off-by: Roger Wang <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Signed-off-by: Roger Wang <[email protected]>

Signed-off-by: Roger Wang <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

add

276deab

Signed-off-by: Roger Wang <[email protected]>

mergify bot added the performance Performance-related issues label Sep 18, 2025

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

vllm/benchmarks/serve.py Outdated Show resolved Hide resolved

vllm/benchmarks/serve.py Outdated Show resolved Hide resolved

ywang96 added 2 commits September 17, 2025 22:12

modify

fdd2247

Signed-off-by: Roger Wang <[email protected]>

Merge branch 'main' into cleanup-bench

8212013

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

vllm/benchmarks/serve.py Outdated Show resolved Hide resolved

update

de602a9

Signed-off-by: Roger Wang <[email protected]>

DarkLight1337 approved these changes Sep 18, 2025

View reviewed changes

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

vllm/benchmarks/serve.py Outdated Show resolved Hide resolved

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 18, 2025

ywang96 enabled auto-merge (squash) September 18, 2025 05:56

ywang96 added 3 commits September 17, 2025 22:58

minor change

9717093

Signed-off-by: Roger Wang <[email protected]>

Merge branch 'main' into cleanup-bench

4f71fcc

update doc

30599a7

Signed-off-by: Roger Wang <[email protected]>

ywang96 requested a review from hmellor as a code owner September 18, 2025 06:27

mergify bot added the documentation Improvements or additions to documentation label Sep 18, 2025

Merge branch 'main' into cleanup-bench

3c3858f

ywang96 added 7 commits September 17, 2025 23:32

Merge branch 'main' into cleanup-bench

7b1ba35

update

a70a5b7

Signed-off-by: Roger Wang <[email protected]>

revert

43f0f81

Signed-off-by: Roger Wang <[email protected]>

update

3d9856d

Signed-off-by: Roger Wang <[email protected]>

add

31ee9f2

Signed-off-by: Roger Wang <[email protected]>

update test

2a00750

Signed-off-by: Roger Wang <[email protected]>

modify

7ce0498

Signed-off-by: Roger Wang <[email protected]>

ywang96 commented Sep 18, 2025

View reviewed changes

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

vllm/benchmarks/serve.py Outdated Show resolved Hide resolved

vllm/benchmarks/serve.py Show resolved Hide resolved

yeqcharlotte reviewed Sep 18, 2025

View reviewed changes

yeqcharlotte approved these changes Sep 18, 2025

View reviewed changes

yeqcharlotte self-assigned this Sep 18, 2025

ywang96 added 4 commits September 18, 2025 00:52

update

ad379a0

Signed-off-by: Roger Wang <[email protected]>

quotes

b85f836

Signed-off-by: Roger Wang <[email protected]>

Merge branch 'main' into cleanup-bench

81eaf36

simplify

01ef01f

Signed-off-by: Roger Wang <[email protected]>

ywang96 merged commit 21da733 into vllm-project:main Sep 18, 2025
41 checks passed

debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

1dc20c2

Signed-off-by: Roger Wang <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

7f14594

Signed-off-by: Roger Wang <[email protected]>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

6dddc13

Signed-off-by: Roger Wang <[email protected]> Signed-off-by: charlifu <[email protected]>

Potabk mentioned this pull request Sep 27, 2025

Bump version to v0.11.0rc3 vllm-project/vllm-ascend#3213

Merged

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

c890ea6

Signed-off-by: Roger Wang <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

6d5f255

Signed-off-by: Roger Wang <[email protected]>

sducouedic pushed a commit to sducouedic/vllm that referenced this pull request Oct 16, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

b6b879c

Signed-off-by: Roger Wang <[email protected]>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

5469063

Signed-off-by: Roger Wang <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Misc] Clean up flags in vllm bench serve (vllm-project#25138)

7b5287d

Signed-off-by: Roger Wang <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Uh oh!

[Misc] Clean up flags in vllm bench serve #25138

[Misc] Clean up flags in vllm bench serve #25138

Uh oh!

Conversation

ywang96 commented Sep 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

ywang96 commented Sep 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ywang96 commented Sep 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ywang96 commented Sep 18, 2025

Uh oh!

ywang96 Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yeqcharlotte Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

ywang96 Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

ywang96 commented Sep 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

yeqcharlotte Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywang96 Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yeqcharlotte left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Misc] Clean up flags in `vllm bench serve` #25138

[Misc] Clean up flags in `vllm bench serve` #25138

ywang96 commented Sep 18, 2025 •

edited by github-actions bot

Loading

ywang96 Sep 18, 2025 •

edited

Loading

yeqcharlotte Sep 18, 2025 •

edited

Loading

ywang96 Sep 18, 2025 •

edited

Loading