SAL
Prod
RelEng
Firehose
Other
2023-09-04 -
production
08:00
<elukey>
restart kubelet on ml-serve1002 to check if stale prometheus metrics are the cause of the stop_container alert
[production]