Skip to content

Commit e36e8b8

Browse files
divakar-amdrasmith
authored andcommitted
[Misc] sort torch profiler table by kernel timing (vllm-project#11813)
1 parent d4d766b commit e36e8b8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

benchmarks/benchmark_latency.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ def run_to_completion(profile_dir: Optional[str] = None):
5252
llm.generate(dummy_prompts,
5353
sampling_params=sampling_params,
5454
use_tqdm=False)
55-
print(p.key_averages())
55+
print(p.key_averages().table(sort_by="self_cuda_time_total"))
5656
else:
5757
start_time = time.perf_counter()
5858
llm.generate(dummy_prompts,

0 commit comments

Comments
 (0)