Cluster Capacity Snapshot
Quantitative node-capacity signals and scheduling headroom captured directly from the Kubernetes bundle.
Operator
Infrastructure
Stabilize the payments namespace by fixing cluster capacity issues first: remediate MemoryPressure on aks-system-000001, add memory capacity if needed, and resolve the payments-api scheduling failures and CrashLoopBackOff.
Investigate and correct the payments-api image deployment failure by validating the ghcr.io/acme/payments:bad image tag and registry access, then redeploy a known-good version.
Reduce exposure and blast radius for the payments workload by confirming the public LoadBalancer is intended, then add default-deny NetworkPolicies and move payments-api to a dedicated least-privilege service account with token automount disabled if possible.
Quantitative node-capacity signals and scheduling headroom captured directly from the Kubernetes bundle.
Autoscaling, disruption, quota, and namespace-default coverage that changes how operators should interpret capacity signals.
Short, operator-oriented callouts for scheduling, rollout, and failing-workload evidence.
A compact operator view of severity and signal distribution before you drop into detailed findings.
Filter the findings table by signal or severity while keeping the current visible count in view.
Detailed review
Expanded explanation for operators who want the model summary after reviewing the findings table.