Skip to content

Commit 4d29e91

Browse files
authored
[Misc] sort torch profiler table by kernel timing (#11813)
1 parent 91445c7 commit 4d29e91

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

benchmarks/benchmark_latency.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ def run_to_completion(profile_dir: Optional[str] = None):
5252
llm.generate(dummy_prompts,
5353
sampling_params=sampling_params,
5454
use_tqdm=False)
55-
print(p.key_averages())
55+
print(p.key_averages().table(sort_by="self_cuda_time_total"))
5656
else:
5757
start_time = time.perf_counter()
5858
llm.generate(dummy_prompts,

0 commit comments

Comments
 (0)