Skip to content

Commit 2c3b192

Browse files
committed
server: bench: init
1 parent 9269c11 commit 2c3b192

File tree

2 files changed

+9
-11
lines changed

2 files changed

+9
-11
lines changed

.github/workflows/bench.yml

Lines changed: 7 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -106,7 +106,7 @@ jobs:
106106
--branch ${{ github.head_ref || github.ref_name }} \
107107
--commit ${{ github.sha }} \
108108
--scenario script.js \
109-
--duration 1m \
109+
--duration 10m \
110110
--hf-repo ggml-org/models \
111111
--hf-file phi-2/ggml-model-q4_0.gguf \
112112
--model-path-prefix /models \
@@ -120,8 +120,6 @@ jobs:
120120
--max-tokens 2048
121121
122122
cat results.github.env >> $GITHUB_ENV
123-
124-
cat results.github.env
125123
126124
- name: Commit status
127125
uses: Sibz/github-status-action@v1
@@ -151,17 +149,17 @@ jobs:
151149
with:
152150
message-id: bench-${{ github.job }}-${{ env.RUNNER_LABEL }}
153151
message: |
154-
📈 **llama.cpp** server benchmark for _${{ github.job }}_ on **${{ env.RUNNER_LABEL }}**: **${{ env.BENCH_ITERATIONS}} iterations** 🚀
152+
📈 **llama.cpp server** benchmark for _${{ github.job }}_ on _${{ env.RUNNER_LABEL }}_: **${{ env.BENCH_ITERATIONS}} iterations** 🚀
155153
<p align="center">
156-
<img width="80%" height="80%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[0] }}" alt="prompt_tokens_seconds" />
157-
<img width="80%" height="80%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[1] }}" alt="predicted_tokens_seconds"/>
154+
<img width="100%" height="100%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[0] }}" alt="prompt_tokens_seconds" />
155+
<img width="100%" height="100%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[1] }}" alt="predicted_tokens_seconds"/>
158156
</p>
159157
<details>
160158
<summary>Details</summary>
161159
<p align="center">
162-
<img width="80%" height="80%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[2] }}" alt="kv_cache_usage_ratio" />
163-
<img width="80%" height="80%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[3] }}" alt="requests_processing"/>
164-
<img width="80%" height="80%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[4] }}" alt="requests_deferred"/>
160+
<img width="100%" height="100%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[2] }}" alt="kv_cache_usage_ratio" />
161+
<img width="100%" height="100%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[3] }}" alt="requests_processing"/>
162+
<img width="100%" height="100%" src="${{ fromJSON(steps.imgur_step.outputs.imgur_urls)[4] }}" alt="requests_deferred"/>
165163
</p>
166164
</detail>
167165

examples/server/bench/bench.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -122,8 +122,8 @@ def main(args_in: list[str] | None = None) -> None:
122122
plt.xticks(rotation=0, fontsize=14, horizontalalignment='center', alpha=.7)
123123
plt.yticks(fontsize=12, alpha=.7)
124124

125-
plt.title(f"llama.cpp {args.name} on {args.runner_label} {iterations} iterations\n"
126-
f"duration={args.duration}",
125+
plt.title(f"llama.cpp {args.name} on {args.runner_label}\n"
126+
f"duration={args.duration} {iterations} iterations",
127127
fontsize=14, wrap=True)
128128
plt.grid(axis='both', alpha=.3)
129129
plt.ylabel(f"llamacpp:{metric}", fontsize=22)

0 commit comments

Comments
 (0)