Skip to content

Measure p90 latency and failure rate of HUD PR and commit pages #556

@huydhn

Description

@huydhn

Pitch

After all recent Rockset and GitHub outages, I think we should start tracking some KPI for important developer facing HUD pages like PR and commit pages. Here are some observations:

  • It takes a noticeable amount of time to load these pages fully, from few seconds to half a minute. Having the metric helps us balance between adding more features and performance, especially if we want to push for HUD adoption among developers (everyone loves pages that load faster)

Screen Shot 2022-08-18 at 14 53 22

  • These page returns error sometime when its upstream dependencies fail, probably GitHub API but I'm not entirely sure. So it makes sense to start thinking about availability

Solutions

TBD. May be vercel already has these metrics somewhere that we can just tap in

cc @janeyx99 @ZainRizvi @pytorch/pytorch-dev-infra

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions