| 2023-06-30
      
      § | 
    
  | 16:26 | <jayme@deploy1002> | helmfile [codfw] DONE helmfile.d/services/mathoid: apply | [production] | 
            
  | 16:25 | <jayme@deploy1002> | helmfile [codfw] START helmfile.d/services/mathoid: apply | [production] | 
            
  | 16:25 | <jayme@deploy1002> | helmfile [staging] DONE helmfile.d/services/mathoid: apply | [production] | 
            
  | 16:25 | <jayme@deploy1002> | helmfile [staging] START helmfile.d/services/mathoid: apply | [production] | 
            
  | 16:09 | <aikochou@deploy1002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . | [production] | 
            
  | 15:50 | <jhancock@cumin2002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1149.eqiad.wmnet with OS bullseye | [production] | 
            
  | 15:35 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . | [production] | 
            
  | 15:35 | <elukey@deploy1002> | helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . | [production] | 
            
  | 15:21 | <isaranto@deploy1002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . | [production] | 
            
  | 14:43 | <jiji@cumin1001> | conftool action : γετ; selector: service=kube-apiserver | [production] | 
            
  | 14:42 | <sbassett> | Deployed updated mitigation for T337593 | [production] | 
            
  | 14:30 | <jhancock@cumin2002> | START - Cookbook sre.hosts.reimage for host an-worker1149.eqiad.wmnet with OS bullseye | [production] | 
            
  | 14:14 | <isaranto@deploy1002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . | [production] | 
            
  | 13:23 | <jayme@deploy1002> | helmfile [staging] DONE helmfile.d/services/mathoid: apply | [production] | 
            
  | 13:23 | <jayme@deploy1002> | helmfile [staging] START helmfile.d/services/mathoid: apply | [production] | 
            
  | 12:39 | <kharlan@deploy1002> | helmfile [eqiad] START helmfile.d/services/ipoid: apply | [production] | 
            
  | 12:30 | <bking@cumin1001> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2021.codfw.wmnet with OS bullseye | [production] | 
            
  | 12:22 | <jbond@cumin1001> | END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 12:20 | <jiji@cumin1001> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubestagemaster2002.codfw.wmnet with OS bullseye | [production] | 
            
  | 12:17 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 12:17 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 12:16 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 12:10 | <kharlan@deploy1002> | helmfile [staging] DONE helmfile.d/services/ipoid: apply | [production] | 
            
  | 12:09 | <kharlan@deploy1002> | helmfile [staging] START helmfile.d/services/ipoid: apply | [production] | 
            
  | 12:03 | <kharlan@deploy1002> | helmfile [staging] START helmfile.d/services/ipoid: apply | [production] | 
            
  | 11:59 | <jiji@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1002.eqiad.wmnet with OS bullseye | [production] | 
            
  | 11:54 | <jiji@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster2002.codfw.wmnet with reason: host reimage | [production] | 
            
  | 11:51 | <jiji@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster2002.codfw.wmnet with reason: host reimage | [production] | 
            
  | 11:39 | <jiji@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1002.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 11:38 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:36 | <jiji@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1002.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 11:31 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:28 | <jiji@cumin1001> | START - Cookbook sre.hosts.reimage for host kubestagemaster2002.codfw.wmnet with OS bullseye | [production] | 
            
  | 11:28 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:28 | <jiji@cumin1001> | START - Cookbook sre.hosts.reimage for host kubestagemaster1002.eqiad.wmnet with OS bullseye | [production] | 
            
  | 11:23 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:23 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:22 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:15 | <jayme> | published image docker-registry.discovery.wmnet/envoy:1.18.3-2-s3 and docker-registry.discovery.wmnet/envoy-future:1.23.10-1-s1 - T300324 | [production] | 
            
  | 11:14 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:14 | <jayme> | imported envoyproxy 1.23.10 to component/envoy-future in buster-wikimedia - T300324 | [production] | 
            
  | 11:05 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:05 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:05 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:05 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 11:04 | <jbond@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 10:45 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['sretest1003'] | [production] | 
            
  | 10:24 | <elukey@deploy1002> | helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . | [production] | 
            
  | 10:22 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . | [production] | 
            
  | 10:20 | <elukey@deploy1002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . | [production] |