| 2024-10-21
      
      ยง | 
    
  | 09:29 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2037.codfw.wmnet | [production] | 
            
  | 09:29 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ping2004.codfw.wmnet | [production] | 
            
  | 09:27 | <jayme@cumin1002> | START - Cookbook sre.hosts.reimage for host kubestagemaster1005.eqiad.wmnet with OS bookworm | [production] | 
            
  | 09:27 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1004.eqiad.wmnet with OS bookworm | [production] | 
            
  | 09:27 | <dcausse@deploy2002> | dcausse: Backport for [[gerrit:1081402|Fix phan issue with getCounter returning NullMetric|CounterMetric]], [[gerrit:1081396|Do not pass null to DataSender::sendWeightedTagsUpdate $tagWeights (T376715)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 09:26 | <klausman@cumin1002> | START - Cookbook sre.hosts.reboot-single for host ml-serve1010.eqiad.wmnet | [production] | 
            
  | 09:24 | <klausman@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1009.eqiad.wmnet | [production] | 
            
  | 09:22 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2037.codfw.wmnet | [production] | 
            
  | 09:19 | <klausman@cumin1002> | START - Cookbook sre.hosts.reboot-single for host ml-serve1009.eqiad.wmnet | [production] | 
            
  | 09:18 | <klausman@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-lab1002.eqiad.wmnet | [production] | 
            
  | 09:18 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2038.codfw.wmnet | [production] | 
            
  | 09:16 | <dcausse@deploy2002> | Started scap sync-world: Backport for [[gerrit:1081402|Fix phan issue with getCounter returning NullMetric|CounterMetric]], [[gerrit:1081396|Do not pass null to DataSender::sendWeightedTagsUpdate $tagWeights (T376715)]] | [production] | 
            
  | 09:12 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2038.codfw.wmnet | [production] | 
            
  | 09:12 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2037.codfw.wmnet | [production] | 
            
  | 09:11 | <klausman@cumin1002> | START - Cookbook sre.hosts.reboot-single for host ml-lab1002.eqiad.wmnet | [production] | 
            
  | 09:11 | <elukey@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:11 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:10 | <elukey@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:10 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:09 | <elukey@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:09 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART | [production] | 
            
  | 09:07 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2037.codfw.wmnet | [production] | 
            
  | 09:06 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1004.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 09:04 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2044.codfw.wmnet | [production] | 
            
  | 09:04 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox-dev2003.codfw.wmnet | [production] | 
            
  | 09:03 | <klausman@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-lab1001.eqiad.wmnet | [production] | 
            
  | 09:02 | <jayme@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1004.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 09:00 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host netbox-dev2003.codfw.wmnet | [production] | 
            
  | 08:58 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2044.codfw.wmnet | [production] | 
            
  | 08:57 | <klausman@cumin1002> | START - Cookbook sre.hosts.reboot-single for host ml-lab1001.eqiad.wmnet | [production] | 
            
  | 08:55 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2043.codfw.wmnet | [production] | 
            
  | 08:55 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2039.codfw.wmnet | [production] | 
            
  | 08:53 | <andrewtavis-wmde@deploy2002> | Finished deploy [airflow-dags/wmde@d176c47]: (no justification provided) (duration: 00m 11s) | [production] | 
            
  | 08:53 | <andrewtavis-wmde@deploy2002> | Started deploy [airflow-dags/wmde@d176c47]: (no justification provided) | [production] | 
            
  | 08:50 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2043.codfw.wmnet | [production] | 
            
  | 08:50 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2039.codfw.wmnet | [production] | 
            
  | 08:48 | <jayme@cumin1002> | START - Cookbook sre.hosts.reimage for host kubestagemaster1004.eqiad.wmnet with OS bookworm | [production] | 
            
  | 08:47 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1003.eqiad.wmnet with OS bookworm | [production] | 
            
  | 08:46 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2040.codfw.wmnet | [production] | 
            
  | 08:46 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2041.codfw.wmnet | [production] | 
            
  | 08:44 | <jnuche@deploy2002> | Installing scap version "4.114.0" for 210 hosts | [production] | 
            
  | 08:41 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2041.codfw.wmnet | [production] | 
            
  | 08:41 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2040.codfw.wmnet | [production] | 
            
  | 08:26 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1003.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 08:23 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 08:23 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 08:23 | <jayme@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1003.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 08:22 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 08:21 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 08:09 | <jayme@cumin1002> | START - Cookbook sre.hosts.reimage for host kubestagemaster1003.eqiad.wmnet with OS bookworm | [production] |