mirror of
https://github.com/kevinveenbirkenbach/computer-playbook.git
synced 2025-08-31 07:48:04 +02:00
Refactor systemctl services and categories due to alarm bugs
This commit restructures systemctl service definitions and category mappings. Motivation: Alarm-related bugs revealed inconsistencies in service and role handling. Preparation step: lays the groundwork for fixing the alarm issues by aligning categories, roles, and service templates.
This commit is contained in:
26
roles/sys-ctl-hlth-docker-container/README.md
Normal file
26
roles/sys-ctl-hlth-docker-container/README.md
Normal file
@@ -0,0 +1,26 @@
|
||||
# Docker Container Health Check
|
||||
|
||||
## Description
|
||||
|
||||
This role monitors the health status of Docker containers on the system. It detects containers that are either **unhealthy** or have **exited with a non-zero code**, and triggers alerts if issues are found.
|
||||
|
||||
## Overview
|
||||
|
||||
The role installs a health check script along with a `systemd` service and timer to run these checks at scheduled intervals.
|
||||
If unhealthy or failed containers are detected, the configured failure notifier (via `sys-ctl-alm-compose`) is triggered.
|
||||
|
||||
## Purpose
|
||||
|
||||
The primary purpose of this role is to ensure that Docker-based services remain operational. By automatically monitoring container health, it enables administrators to react quickly to failures, reducing downtime and preventing unnoticed service degradation.
|
||||
|
||||
## Features
|
||||
|
||||
- **Automated Health Checks:** Detects containers in `unhealthy` state or exited with non-zero exit codes.
|
||||
- **Systemd Integration:** Installs a one-shot service and timer to run health checks on a schedule.
|
||||
- **Alerting Support:** Works with the [`sys-ctl-alm-compose`](../sys-ctl-alm-compose/README.md) role for failure notifications.
|
||||
- **Configurable Script Location:** Controlled via the `PATH_ADMINISTRATOR_SCRIPTS` variable.
|
||||
|
||||
## Further Resources
|
||||
|
||||
- [Docker Health Checks Documentation](https://docs.docker.com/engine/reference/run/#healthcheck)
|
||||
- [Systemd Timers Documentation](https://www.freedesktop.org/software/systemd/man/systemd.timer.html)
|
Reference in New Issue
Block a user