mirror of
https://github.com/kevinveenbirkenbach/computer-playbook.git
synced 2025-08-16 08:56:46 +02:00
27 lines
1.4 KiB
Markdown
27 lines
1.4 KiB
Markdown
# Docker Container Health Check
|
|
|
|
## Description
|
|
|
|
This role monitors the health status of Docker containers on the system. It detects containers that are either **unhealthy** or have **exited with a non-zero code**, and triggers alerts if issues are found.
|
|
|
|
## Overview
|
|
|
|
The role installs a health check script along with a `systemd` service and timer to run these checks at scheduled intervals.
|
|
If unhealthy or failed containers are detected, the configured failure notifier (via `sys-alm-compose`) is triggered.
|
|
|
|
## Purpose
|
|
|
|
The primary purpose of this role is to ensure that Docker-based services remain operational. By automatically monitoring container health, it enables administrators to react quickly to failures, reducing downtime and preventing unnoticed service degradation.
|
|
|
|
## Features
|
|
|
|
- **Automated Health Checks:** Detects containers in `unhealthy` state or exited with non-zero exit codes.
|
|
- **Systemd Integration:** Installs a one-shot service and timer to run health checks on a schedule.
|
|
- **Alerting Support:** Works with the [`sys-alm-compose`](../sys-alm-compose/README.md) role for failure notifications.
|
|
- **Configurable Script Location:** Controlled via the `path_administrator_scripts` variable.
|
|
|
|
## Further Resources
|
|
|
|
- [Docker Health Checks Documentation](https://docs.docker.com/engine/reference/run/#healthcheck)
|
|
- [Systemd Timers Documentation](https://www.freedesktop.org/software/systemd/man/systemd.timer.html)
|