-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Identify health check to run in the Resource Health BB #106
Comments
Kubernetes already runs a couple of health checks every 15 seconds and throws an error on 3 consecutive fails: The health checks it makes are That means we could leverage the aggregated metrics in Kubernetes as an overall eoAPI health check, e.g. checking that a minimum number of pods are alive, possibly via the Grafana API on the support service, currently at https://eoapisupport.develop.eoepca.org/. |
We received a more detailed request from @dovydas-an:
|
@dovydas-an, the Data Access BB consists of several services, each of which should have a health endpoint like you requested.
As I understand it, all of these endpoints are already being probed on a regular basis by Kubernetes. Maybe there is a way to ask Kubernetes for a status, instead of also pinging these endpoints directly? Perhaps @ividito can advice? If we can rely on Kubernetes for liveliness checks and the building blocks internally do their due diligence to make sure their services are up, I think that would be preferable, no? |
STAC API has one here: https://eoapi.develop.eoepca.org/stac/_mgmt/ping, 200, no auth |
The Resource Health BB is establishing patterns for fetching health checks and trace data (OpenTelemetry) from building blocks.
How far are we from being able to provide any of these via APIs or so?
The text was updated successfully, but these errors were encountered: