[telemetry] Fix telemetry causing high DB CPU load

Currently we're executing an [expensive query](https://github.com/gitpod-io/gitpod/blob/29df192bd0463473390716567fad2985d696f43b/components/gitpod-db/src/typeorm/workspace-db-impl.ts#L337-L344) more often than we expect. This caused 3 incidents over the course of the last 2,5 weeks. We still are not sure why it's triggered this often (there are no traces/logs), but still we can improve the situation by:

 1. DB:
   1. improve the query itself: our hypothesis is that the ORM generates a query like this `SELECT COUNT(1) FROM (SELECT ... JOIN...)` where the subquery is the reason for the slowness (MySQL tries to materialize the table). Try writing direct SQL ala `SELECT COUNT(1) FROM d_b_workspace_instance AS wsi JOIN d_b_workspace AS ws ON ws.id = wsi.workspaceId WHERE ws.type = 'regular'. Test this against a failover prod DB.
   2. double-check we have an index on `workspace.type` _in both prod DBs_
 2. API: use different API/HTTP calls/requests for "config" and "telemetry data": e.g., don't execute the queries in case we are not sending the result anyway
 3. better observability: add tracing to the [HTTP endpoint](https://github.com/gitpod-io/gitpod/blob/3be4e0b7a56a1249ff00c015efd28a5ce715c161/components/server/src/installation-admin/installation-admin-controller.ts#L17-L20)

/cc @corneliusludmann 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[telemetry] Fix telemetry causing high DB CPU load #8638

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[telemetry] Fix telemetry causing high DB CPU load #8638

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions