SAL
Prod
RelEng
Firehose
Other
2026-07-02 -
production
16:52
<ryankemper>
[ml-serve-eqiad] Cleared out 1302 failed (Evicted) pods: `kubectl -n llm delete pods --field-selector=status.phase=Failed`, freeing calico-kube-controllers from OOM crashloop (evictions were caused by disk pressure)
[production]