Skip to content

server: metrics endpoint #5850

Closed
Closed
@phymbert

Description

@phymbert

Issues

  • Server /metrics endpoint share the same task event as /health: TASK_TYPE_METRICS. It means metrics are reset on both calls.
  • the Process-Start-Time-Unix http response header is not set.
  • metrics llamacpp:prompt_tokens_seconds and llamacpp:predicted_tokens_seconds are per slots, while the server actually process llamacpp:prompt_tokens_seconds * n_slots

Proposal

  • Add a data params in TASK_TYPE_METRICS to reset the metric bucket only in /metrics
  • Add llamacpp:prompt_tokens_seconds_total and llamacpp:predicted_tokens_seconds_total

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions