Skip to content

[Feature]: [V1] Parallel sampling should support metrics #13792

@afeldman-nm

Description

@afeldman-nm

🚀 The feature, motivation and pitch

PR #10980 added parallel sampling support. Currently, the v1 engine's parallel sampling capability does not support metrics: a request with n>1 returns a request output with null/None in the metrics field.

The desired behavior is for parallel sampling request outputs to include metrics; this may require defining how we want metrics to appear for parallel sampling requests in v1.

Alternatives

Currently a user must account for parallel sampling request outputs having no metrics.

Additional context

I was responsible for #10980 ; metrics support was challenging because each child request passes through the engine separately & accrues its own metrics. It was unclear how to synthesize one parent request metrics data structure from n child request metrics data structures, in part because I was not clear on the spec for how metrics are supposed to appear for parallel sampling requests.

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    feature requestNew feature or requestunstaleRecieved activity after being labelled stale

    Type

    No type

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions