[Doc]Add benchmark scripts #74

Potabk · 2025-02-17T09:01:42Z

What this PR does / why we need it?

The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code .

Does this PR introduce any user-facing change?

How was this patch tested?

Yikun

emm, only review on tutorials.md and bechmark_latency.py.

The problem is that should we copy vllm benchmark here or just use it?

Yikun · 2025-02-27T15:57:47Z

docs/source/tutorials.md

 INFO 02-19 17:37:35 metrics.py:453] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 1.9 tokens/s, Running: 0 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 0.0%, CPU KV cache usage: 0.0%.
+```
+
+## Performance Benchmark


This should be developer guide

Yikun · 2025-02-27T16:05:29Z

benchmarks/backend_request_func.py

@@ -0,0 +1,193 @@
+# SPDX-License-Identifier: Apache-2.0


# # Copyright (c) 2025 Huawei Technologies Co., Ltd. All Rights Reserved. # This file is a part of the vllm-ascend project. # Adapted from vllm-project/vllm/benchmarks/backend_request_func.py # Copyright 2023 The vLLM team. # # Licensed under the Apache License, Version 2.0 (the "License"); # you may not use this file except in compliance with the License. # You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. #

Yikun · 2025-02-27T16:06:36Z

benchmarks/backend_request_func.py

+            **kwargs,
+        )
+
+ASYNC_REQUEST_FUNCS = {


Pls add note for the different with vLLM.

Yikun · 2025-02-27T16:06:49Z

benchmarks/benchmark_latency.py

@@ -0,0 +1,152 @@
+# SPDX-License-Identifier: Apache-2.0


Potabk · 2025-03-21T07:09:50Z

Please note that we copied the benchmark code from the vllm benchmark. This is just a stopgap measure. In the future, we will push the community to complete the benchmark cli, and then we will run benchmark like:

vllm bench <options> --parameters ...

for details see *13993

wangxiyuan · 2025-03-21T07:41:16Z

benchmarks/README.md

+### Introduction
+This document outlines the benchmarking process for vllm-ascend, designed to evaluate its performance under various workloads. The primary goal is to help developers assess whether their pull requests improve or degrade vllm-ascend's performance.To maintain consistency with the vllm community, we have reused the vllm community [benchmark](https://github.com/vllm-project/vllm/tree/main/benchmarks) script.
+### Overview
+**Benchmarking Coverage**: We measure latency, throughput, and fixed-QPS serving on the Atlas800I A2 (see [quick_start](./quick_start.md) to learn more supported devices list), with different models(coming soon).


./quick_start.md 404

fixed at 98cfc5d

wangxiyuan · 2025-03-21T07:41:40Z

benchmarks/README.md

@@ -0,0 +1,54 @@
+### Introduction


fixed at 98cfc5d

Signed-off-by: wangli <[email protected]>

fixed

### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <[email protected]>

running time reduction forward_before and forward_end

Potabk changed the title ~~[Misc]dd benchmark scripts~~ [Misc]Add benchmark scripts Feb 17, 2025

Potabk changed the title ~~[Misc]Add benchmark scripts~~ [Misc][WIP]Add benchmark scripts Feb 17, 2025

Potabk force-pushed the benchmarks branch 2 times, most recently from 759243e to 1493751 Compare February 26, 2025 07:40

Potabk changed the title ~~[Misc][WIP]Add benchmark scripts~~ [Doc]Add benchmark scripts Feb 26, 2025

Potabk force-pushed the benchmarks branch from 76e7ee3 to 7ca9d2f Compare February 26, 2025 08:32

Yikun previously requested changes Feb 27, 2025

View reviewed changes

Potabk force-pushed the benchmarks branch from 4d13e93 to 9f7b9f9 Compare March 21, 2025 06:40

github-actions bot added the documentation Improvements or additions to documentation label Mar 21, 2025

wangxiyuan requested changes Mar 21, 2025

View reviewed changes

Potabk added 7 commits March 21, 2025 15:46

fix conflict

d6534ed

Signed-off-by: wangli <[email protected]>

fix isort

e2f2987

Signed-off-by: wangli <[email protected]>

fix ci

d3f29b1

Signed-off-by: wangli <[email protected]>

fix shellcheck

b2aba1e

Signed-off-by: wangli <[email protected]>

refactor script

d9cd73c

Signed-off-by: wangli <[email protected]>

fix doc

6ea791c

Signed-off-by: wangli <[email protected]>

fix doc title level

98cfc5d

Signed-off-by: wangli <[email protected]>

Potabk force-pushed the benchmarks branch from c4b745a to 98cfc5d Compare March 21, 2025 07:51

wangxiyuan approved these changes Mar 21, 2025

View reviewed changes

wangxiyuan merged commit 9a175ca into vllm-project:main Mar 21, 2025
5 checks passed

Potabk deleted the benchmarks branch April 1, 2025 08:55

Skywalker-EP pushed a commit to Skywalker-EP/vllm-ascend that referenced this pull request Jul 24, 2025

Merge pull request vllm-project#74 from raindaywhu/dev_whq_eplb

53728f3

running time reduction forward_before and forward_end

offline893 pushed a commit to offline893/vllm-ascend that referenced this pull request Sep 9, 2025

Merge pull request vllm-project#74 from raindaywhu/dev_whq_eplb

1169dfc

running time reduction forward_before and forward_end

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Doc]Add benchmark scripts #74

[Doc]Add benchmark scripts #74

Uh oh!

Potabk commented Feb 17, 2025 •

edited

Loading

Uh oh!

Yikun left a comment

Uh oh!

Yikun Feb 27, 2025

Uh oh!

Yikun Feb 27, 2025

Uh oh!

Yikun Feb 27, 2025

Uh oh!

Yikun Feb 27, 2025

Uh oh!

Potabk commented Mar 21, 2025

Uh oh!

wangxiyuan Mar 21, 2025

Uh oh!

Potabk Mar 21, 2025

Uh oh!

wangxiyuan Mar 21, 2025

Uh oh!

Potabk Mar 21, 2025

Uh oh!

Uh oh!

Uh oh!

[Doc]Add benchmark scripts #74

[Doc]Add benchmark scripts #74

Uh oh!

Conversation

Potabk commented Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Yikun left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Potabk commented Mar 21, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Potabk commented Feb 17, 2025 •

edited

Loading