Skip to content

Latest commit

 

History

History
54 lines (46 loc) · 7.17 KB

COMPONENTS.md

File metadata and controls

54 lines (46 loc) · 7.17 KB

Components

GPU components

General Hardware components

  • cpu: Tracks the combined usage of all CPUs (not per-CPU).
  • disk: Tracks the disk usage of all the mount points specified in the configuration.
  • memory: Tracks the memory usage of the host.
  • network-latency: Tracks global network connectivity statistics.
  • power-supply: Tracks the power supply/usage on the host.
  • pci: Tracks the PCI devices and their Access Control Services (ACS) status.

System components

  • info: Provides static information about the host (e.g., labels, IDs).
  • os: Queries the host OS information (e.g., kernel version).
  • systemd: Tracks the systemd state and unit files.
  • dmesg: Scans and watches dmesg outputs for errors,, as specified in the configuration (e.g., regex match NVIDIA GPU errors).
  • file-descriptor: Tracks the number of file descriptors used on the host.
  • kernel-module: Tracks the kernel modules loaded on the host.

Misc. components

  • containerd-pod: Tracks the current pods from the containerd CRI.
  • k8s-pod: Tracks the current pods from the kubelet read-only port.
  • docker-container: Tracks the current containers from the docker runtime.
  • tailscale: Tracks the tailscale state (e.g., version) if available.
  • file: Returns healthy if and only if all the specified files exist.
  • library: Returns healthy if and only if all the specified libraries exist.