Prometheus collects metrics, Grafana provides dashboards. Together the most popular monitoring stack.
Prometheus Config
global:
scrape_interval: 15s
scrape_configs:
- job_name: node
static_configs:
- targets: ["web1:9100", "db1:9100"]
Node Exporter
[Unit]
Description=Node Exporter
[Service]
ExecStart=/usr/local/bin/node_exporter
Restart=always
[Install]
WantedBy=multi-user.target
Alert Rules
groups:
- name: server_alerts
rules:
- alert: HighCPU
expr: 100 - (avg by(instance)(rate(node_cpu_seconds_total{mode="idle"}[5m])) * 100) > 80
for: 5m
- alert: DiskSpaceLow
expr: (node_filesystem_avail_bytes / node_filesystem_size_bytes) * 100 < 15
for: 10m
Master monitoring with our Monitoring eBook collection.