Pipeline: Annotation Service (grafana) Clean up various latency metrics #121

gfr10598 · 2018-12-02T15:59:18Z

The latency metrics are horribly confusing. Some come from etl metrics, and some from annotator metrics. "service" therefore means different things in different panels.

Some of the metrics have ridiculous - 10s or 100s of minutes - so there are clearly some errors, most likely in the queries.

gfr10598 · 2018-12-02T16:15:42Z

Might help to break down by test_type. etl metric already has this field.
The outlier seems to be annotatorss internal measurement. It is often many minutes, even though the etl timeout is 2 seconds.
So one thing that would help is to have an appropriate context for request handling in the annotator.

gfr10598 · 2018-12-06T14:32:17Z

The annotatorss currently running is 20181108t112031. There does not seem to be a corresponding travis build, though. Unfortunately, annotation-service does not yet implement a useful status page, either.
The logs show a huge number of 499, with 7 second latency. The latency for successful requests is on the order of 0.7 to 1.4 seconds. For batch requests, we probably should increase the timeout to more than the current 2 seconds.

ALSO: The code does not specify the quantiles for the summary. So we are only getting the default 0.5, 0.9, 0.99. We should also update this.
It looks like the median is much more sensible than the average, suggesting that there are very long duration outliers.

gfr10598 · 2018-12-10T15:39:02Z

m-lab/prometheus-support#369

gfr10598 added Sprint 4 labels Dec 2, 2018

kokosta added Sprint 4 and removed review/triage Team should review and assign priority labels Dec 10, 2018

gfr10598 mentioned this issue Dec 10, 2018

Annotator dashboard improvements m-lab/prometheus-support#369

Merged

gfr10598 added the Sprint 4 label Dec 10, 2018

gfr10598 self-assigned this Dec 10, 2018

kokosta added Sprint 4 and removed Sprint 22 Sprint 23 labels Dec 17, 2018

kokosta added Sprint 4 and removed Sprint 23 Sprint 23 labels Jan 14, 2019

kokosta added Sprint 4 and removed Sprint 24 Sprint 25 labels Jan 28, 2019

kokosta added Sprint 4 and removed Sprint 3 Sprint 3 labels Feb 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline: Annotation Service (grafana) Clean up various latency metrics #121

Pipeline: Annotation Service (grafana) Clean up various latency metrics #121

gfr10598 commented Dec 2, 2018

gfr10598 commented Dec 2, 2018

gfr10598 commented Dec 6, 2018

gfr10598 commented Dec 10, 2018

Pipeline: Annotation Service (grafana) Clean up various latency metrics #121

Pipeline: Annotation Service (grafana) Clean up various latency metrics #121

Comments

gfr10598 commented Dec 2, 2018

gfr10598 commented Dec 2, 2018

gfr10598 commented Dec 6, 2018

gfr10598 commented Dec 10, 2018