Skip to content

Commit 98cfc5d

Browse files
committed
fix doc title level
Signed-off-by: wangli <[email protected]>
1 parent 6ea791c commit 98cfc5d

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

benchmarks/README.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
1-
### Introduction
1+
# Introduction
22
This document outlines the benchmarking process for vllm-ascend, designed to evaluate its performance under various workloads. The primary goal is to help developers assess whether their pull requests improve or degrade vllm-ascend's performance.To maintain consistency with the vllm community, we have reused the vllm community [benchmark](https://github.com/vllm-project/vllm/tree/main/benchmarks) script.
3-
### Overview
4-
**Benchmarking Coverage**: We measure latency, throughput, and fixed-QPS serving on the Atlas800I A2 (see [quick_start](./quick_start.md) to learn more supported devices list), with different models(coming soon).
3+
# Overview
4+
**Benchmarking Coverage**: We measure latency, throughput, and fixed-QPS serving on the Atlas800I A2 (see [quick_start](../docs/source/quick_start.md) to learn more supported devices list), with different models(coming soon).
55
- Latency tests
66
- Input length: 32 tokens.
77
- Output length: 128 tokens.
@@ -26,8 +26,8 @@ This document outlines the benchmarking process for vllm-ascend, designed to eva
2626
**Benchmarking Duration**: about 800senond for single model.
2727

2828

29-
### Quick Use
30-
#### Prerequisites
29+
# Quick Use
30+
## Prerequisites
3131
Before running the benchmarks, ensure the following:
3232
- vllm and vllm-ascend are installed and properly set up in an NPU environment, as these scripts are specifically designed for NPU devices.
3333
- Install necessary dependencies for benchmarks:
@@ -37,7 +37,7 @@ Before running the benchmarks, ensure the following:
3737
3838
- Models and datasets are cached locally to accelerate execution. Modify the paths in the JSON files located in benchmarks/tests accordingly. feel free to add your own models and parameters in the JSON to run your customized benchmarks.
3939
40-
#### Run benchmarks
40+
## Run benchmarks
4141
The provided scripts automatically execute performance tests for serving, throughput, and latency. To start the benchmarking process, run command in the vllm-ascend root directory:
4242
```
4343
bash benchmarks/scripts/run-performance-benchmarks.sh

0 commit comments

Comments
 (0)