Clean up JSON output #76

sjmonson · 2025-02-25T20:51:48Z

There is a lot of minor improvements that can be made to the JSON output of guidellm. For example:

Deduplicate the prompt field for each request
Flatten some field structures (e.g. "decode_times": { "data": [] } -> "decode_times": [] ).
Label percentiles (e.g. "request_latency_percentiles": [ 1, 2,... ] -> "request_latency_percentiles": { "p01": 1, "p05": 5,. ... })
Give max and min with all percentiles
Drop concurrences timestamps (possibly replace with percentiles/min/max/mean)

The text was updated successfully, but these errors were encountered:

markurtz · 2025-02-25T21:21:39Z

Feel free to dive in, but wanted to callout that there will be some restructuring along these lines for the output standardization in case there is duplicate work. Will share something out a bit later

rgreenberg1 · 2025-03-06T13:30:38Z

Let's also make sure that the output JSON is storing all of the metadata we may want to know:
Model_name
Quantized (None, INT4, INT8, FP8)
Hardware
Inference Scenario
vllm version
vllm-config file (need the file)

GuideLLM results:
Tokens per Second
Time to First Token (TTFT)
Inter-token Latency (ITL)
End-to-End Request Latency (e2e_latency)
Requests Per Second (RPS) Profiles/Sweeps
Cost to generate a million output tokens (Internal) - Future

markurtz · 2025-03-10T18:01:31Z

#91 fixes the issues @sjmonson brought up within this issue. @rgreenberg1 can we move the extra pieces you included here into it's own issue?

rgreenberg1 assigned sjmonson Feb 25, 2025

markurtz mentioned this issue Mar 10, 2025

Rework Backend to Native HTTP Requests and Enhance API Compatibility & Performance #91

Merged

markurtz closed this as completed in #91 Mar 12, 2025

markurtz closed this as completed in 3b346b5 Mar 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clean up JSON output #76

Clean up JSON output #76

sjmonson commented Feb 25, 2025

markurtz commented Feb 25, 2025

Uh oh!

rgreenberg1 commented Mar 6, 2025

Uh oh!

markurtz commented Mar 10, 2025

Uh oh!

Clean up JSON output #76

Clean up JSON output #76

Comments

sjmonson commented Feb 25, 2025

markurtz commented Feb 25, 2025

Uh oh!

rgreenberg1 commented Mar 6, 2025

Uh oh!

markurtz commented Mar 10, 2025

Uh oh!