Identify health check to run in the Resource Health BB #106

j08lue · 2024-10-15T07:40:18Z

The Resource Health BB is establishing patterns for fetching health checks and trace data (OpenTelemetry) from building blocks.

How far are we from being able to provide any of these via APIs or so?

j08lue · 2024-10-15T07:42:42Z

FYI @dovydas-an @tilowiklundSensmetry

j08lue · 2024-11-26T19:48:49Z

Kubernetes already runs a couple of health checks every 15 seconds and throws an error on 3 consecutive fails:

https://github.com/developmentseed/eoapi-k8s/blob/817ec9df82815f04d25faae9ca86fed985695e1b/helm-chart/eoapi/templates/services/deployment.yaml#L38-L49

The health checks it makes are /healthz etc routes on our FastAPI runtimes, which are basic pings (i.e. no test whether a db connection exists or so).

That means we could leverage the aggregated metrics in Kubernetes as an overall eoAPI health check, e.g. checking that a minimum number of pods are alive, possibly via the Grafana API on the support service, currently at https://eoapisupport.develop.eoepca.org/.

Activate Grafana dashboard on Data Access building block #76

j08lue · 2024-12-11T11:52:06Z

We received a more detailed request from @dovydas-an:

we (the developers of Resource Health BB) are reaching out in order to obtain necessary information to be able to set up exemplary health checks.

At the moment, we are considering a simple health check that would "ping" your BB, receive a response and depending on the response produce a health check outcome. In order to be able to setup such a check, we would need to know:

URL of the endpoint

Expected response code that would correspond to "OK" health check outcome (e.g. 200)

If authentication is needed, authentication credentials and how they should be added to the request (header)

j08lue · 2024-12-11T11:57:22Z

@dovydas-an, the Data Access BB consists of several services, each of which should have a health endpoint like you requested.

Raster API: https://eoapi.develop.eoepca.org/raster/healthz, 200, no auth
Vector API: https://eoapi.develop.eoepca.org/vector/healthz, 200, no auth
STAC API: TBD, maybe this would need to be added to our EOEPCA runtime, @pantierra?
Coverages API: TBD @jankovicgd

As I understand it, all of these endpoints are already being probed on a regular basis by Kubernetes. Maybe there is a way to ask Kubernetes for a status, instead of also pinging these endpoints directly? Perhaps @ividito can advice?

If we can rely on Kubernetes for liveliness checks and the building blocks internally do their due diligence to make sure their services are up, I think that would be preferable, no?

pantierra · 2024-12-13T20:34:56Z

STAC API has one here: https://eoapi.develop.eoepca.org/stac/_mgmt/ping, 200, no auth

j08lue mentioned this issue Oct 15, 2024

Maintain Helm charts for eoAPI EOEPCA/resource-discovery#87

Open

5 tasks

j08lue mentioned this issue Nov 1, 2024

Requirements Analysis and Architectural Design - Coordinator Development Seed (Q3) #99

Open

2 tasks

j08lue added this to the Q3 milestone Nov 26, 2024

j08lue assigned j08lue and unassigned j08lue Nov 26, 2024

j08lue added the DevSeed label Nov 28, 2024

j08lue self-assigned this Nov 28, 2024

j08lue mentioned this issue Dec 3, 2024

Plan 2.0.0-beta.2 release by 31st January 2025 #113

Open

5 tasks

kalxas mentioned this issue Dec 13, 2024

Identify health check to run in the Resource Health BB EOEPCA/resource-discovery#104

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Identify health check to run in the Resource Health BB #106

Identify health check to run in the Resource Health BB #106

j08lue commented Oct 15, 2024

j08lue commented Oct 15, 2024

j08lue commented Nov 26, 2024

j08lue commented Dec 11, 2024

j08lue commented Dec 11, 2024 •

edited

Loading

pantierra commented Dec 13, 2024

Identify health check to run in the Resource Health BB #106

Identify health check to run in the Resource Health BB #106

Comments

j08lue commented Oct 15, 2024

j08lue commented Oct 15, 2024

j08lue commented Nov 26, 2024

j08lue commented Dec 11, 2024

j08lue commented Dec 11, 2024 • edited Loading

pantierra commented Dec 13, 2024

j08lue commented Dec 11, 2024 •

edited

Loading