Grafana Dashboard¶

A ready-made Grafana dashboard is included with 9 rows covering all metric categories.

Auto-Provisioning via Helm¶

The dashboard auto-provisions via the Grafana sidecar when deployed with Helm:

grafana:
  dashboard:
    enabled: true   # default
    label: "1"      # matches Grafana sidecar default

The Helm chart creates a ConfigMap with the dashboard JSON, labeled for automatic pickup by the Grafana sidecar.

Manual Import¶

For non-Helm setups, import grafana/kpod-metrics-dashboard.json directly via the Grafana UI:

Open Grafana
Go to Dashboards > Import
Upload grafana/kpod-metrics-dashboard.json
Select your Prometheus data source

Dashboard Rows¶

The dashboard includes 9 rows:

Overview — Pod count, collection cycle duration, collector health
CPU — Run queue latency (p50/p90/p99), context switches
Network — TCP bytes, connections, retransmits, RTT
Memory — OOM kills, page faults, cgroup usage
Syscalls — Per-syscall count, errors, latency
Disk I/O — Block I/O latency, read/write throughput
Filesystem — Capacity, usage, available space
Interrupts — Hardware/software IRQ latency
Topology — Service dependency Node Graph (requires Infinity plugin)

Recording Rules¶

When Prometheus Operator is enabled, 17 recording rules are provisioned for precomputed p50/p90/p99 aggregations. The dashboard uses these recording rules for efficient rendering.