Grafana Dashboard¶
A ready-made Grafana dashboard is included with 9 rows covering all metric categories.
Auto-Provisioning via Helm¶
The dashboard auto-provisions via the Grafana sidecar when deployed with Helm:
The Helm chart creates a ConfigMap with the dashboard JSON, labeled for automatic pickup by the Grafana sidecar.
Manual Import¶
For non-Helm setups, import grafana/kpod-metrics-dashboard.json directly via the Grafana UI:
- Open Grafana
- Go to Dashboards > Import
- Upload
grafana/kpod-metrics-dashboard.json - Select your Prometheus data source
Dashboard Rows¶
The dashboard includes 9 rows:
- Overview — Pod count, collection cycle duration, collector health
- CPU — Run queue latency (p50/p90/p99), context switches
- Network — TCP bytes, connections, retransmits, RTT
- Memory — OOM kills, page faults, cgroup usage
- Syscalls — Per-syscall count, errors, latency
- Disk I/O — Block I/O latency, read/write throughput
- Filesystem — Capacity, usage, available space
- Interrupts — Hardware/software IRQ latency
- Topology — Service dependency Node Graph (requires Infinity plugin)
Recording Rules¶
When Prometheus Operator is enabled, 17 recording rules are provisioned for precomputed p50/p90/p99 aggregations. The dashboard uses these recording rules for efficient rendering.