-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Description
🚀 The feature, motivation and pitch
PR #10980 added parallel sampling support. Currently, the v1 engine's parallel sampling capability does not support metrics: a request with n>1
returns a request output with null
/None
in the metrics field.
The desired behavior is for parallel sampling request outputs to include metrics; this may require defining how we want metrics to appear for parallel sampling requests in v1.
Alternatives
Currently a user must account for parallel sampling request outputs having no metrics.
Additional context
I was responsible for #10980 ; metrics support was challenging because each child request passes through the engine separately & accrues its own metrics. It was unclear how to synthesize one parent request metrics data structure from n
child request metrics data structures, in part because I was not clear on the spec for how metrics are supposed to appear for parallel sampling requests.
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
Metadata
Metadata
Assignees
Labels
Type
Projects
Status