| 2024-05-28
      
      ยง | 
    
  | 19:24 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) | [admin] | 
            
  | 19:24 | <herron> | ganeti1027:~$ sudo gnt-instance reboot grafana1002 | [production] | 
            
  | 19:24 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 19:22 | <logmsgbot> | @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 19:22 | <logmsgbot> | @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 19:22 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) | [admin] | 
            
  | 19:21 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 19:21 | <andrew@cloudcumin1001> | END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) | [admin] | 
            
  | 19:20 | <logmsgbot> | @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 19:20 | <logmsgbot> | @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 19:19 | <jclark@cumin1002> | START - Cookbook sre.hosts.reimage for host kafka-main1010.eqiad.wmnet with OS bullseye | [production] | 
            
  | 19:19 | <logmsgbot> | @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 19:18 | <logmsgbot> | @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 19:17 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 19:16 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) | [admin] | 
            
  | 19:14 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 19:01 | <bking@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host elastic1056.eqiad.wmnet | [production] | 
            
  | 19:00 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1211 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P63476 and previous config saved to /var/cache/conftool/dbconfig/20240528-190021-root.json | [production] | 
            
  | 18:59 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) | [admin] | 
            
  | 18:57 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 18:54 | <bking@cumin2002> | START - Cookbook sre.hosts.reboot-single for host elastic1056.eqiad.wmnet | [production] | 
            
  | 18:54 | <logmsgbot> | @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:54 | <logmsgbot> | @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:53 | <bking@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on elastic1056.eqiad.wmnet with reason: rebooting after abnormally high load | [production] | 
            
  | 18:53 | <bking@cumin2002> | START - Cookbook sre.hosts.downtime for 1:00:00 on elastic1056.eqiad.wmnet with reason: rebooting after abnormally high load | [production] | 
            
  | 18:51 | <logmsgbot> | @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:51 | <logmsgbot> | @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:50 | <jclark@cumin1002> | START - Cookbook sre.hosts.reimage for host kafka-main1009.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:48 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) | [admin] | 
            
  | 18:47 | <jclark@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 18:47 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 18:45 | <dancy@deploy1002> | rebuilt and synchronized wikiversions files: group0 wikis to 1.43.0-wmf.7  refs T361401 | [production] | 
            
  | 18:45 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1211 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P63475 and previous config saved to /var/cache/conftool/dbconfig/20240528-184515-root.json | [production] | 
            
  | 18:41 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db1174 (T364299)', diff saved to https://phabricator.wikimedia.org/P63474 and previous config saved to /var/cache/conftool/dbconfig/20240528-184110-marostegui.json | [production] | 
            
  | 18:41 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 18:40 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 18:37 | <jclark@cumin1002> | START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 18:36 | <jclark@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 18:35 | <jclark@cumin1002> | START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 18:30 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1211 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P63473 and previous config saved to /var/cache/conftool/dbconfig/20240528-183009-root.json | [production] | 
            
  | 18:21 | <jclark@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 18:20 | <dancy@deploy1002> | sync-world aborted: Backport for [[gerrit:1036725|Remove the php symlink (T359643)]] (duration: 00m 30s) | [production] | 
            
  | 18:19 | <dancy@deploy1002> | Started scap: Backport for [[gerrit:1036725|Remove the php symlink (T359643)]] | [production] | 
            
  | 18:19 | <jclark@cumin1002> | START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 18:17 | <dancy@deploy1002> | sync-world aborted: Backport for [[gerrit:1036725|Remove the php symlink (T359643)]] (duration: 01m 00s) | [production] | 
            
  | 18:16 | <dancy@deploy1002> | Started scap: Backport for [[gerrit:1036725|Remove the php symlink (T359643)]] | [production] | 
            
  | 18:16 | <jclark@cumin1002> | START - Cookbook sre.hosts.reimage for host kafka-main1010.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:15 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1211 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P63472 and previous config saved to /var/cache/conftool/dbconfig/20240528-181503-root.json | [production] | 
            
  | 17:59 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1211 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P63471 and previous config saved to /var/cache/conftool/dbconfig/20240528-175954-root.json | [production] | 
            
  | 17:58 | <cdanis@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mw-api-int: sync | [production] |