Cluster Capacity Snapshot
Quantitative node-capacity signals and scheduling headroom captured directly from the Kubernetes bundle.
Operator
Infrastructure
Stabilize the cluster and namespace capacity first: investigate aks-system-000001 MemoryPressure, add memory capacity if needed, and resolve the FailedScheduling condition affecting payments-api.
Contain the exposed payments surface: verify whether payments-public must be LoadBalancer; if not, make it internal, and add namespace NetworkPolicies to default-deny traffic with explicit allows.
Harden and fix payments-api before the next rollout: correct the image pull failure and CrashLoopBackOff, then move to a dedicated least-privilege service account with token automount disabled, add probes, requests/limits, seccomp RuntimeDefault, and a read-only root filesystem.
Quantitative node-capacity signals and scheduling headroom captured directly from the Kubernetes bundle.
Autoscaling, disruption, quota, and namespace-default coverage that changes how operators should interpret capacity signals.
Short, operator-oriented callouts for scheduling, rollout, and failing-workload evidence.
A compact operator view of severity and signal distribution before you drop into detailed findings.
Filter the findings table by signal or severity while keeping the current visible count in view.
Detailed review
Expanded explanation for operators who want the model summary after reviewing the findings table.