| 2022-09-13
      
      ยง | 
    
  | 09:20 | <marostegui@cumin1001> | dbctl commit (dc=all): 'Promote db2112 to s1 primary T317614', diff saved to https://phabricator.wikimedia.org/P34579 and previous config saved to /var/cache/conftool/dbconfig/20220913-092032-root.json | [production] | 
            
  | 09:19 | <marostegui> | Starting s1 codfw failover from db2103 to db2112 - T317614 | [production] | 
            
  | 09:13 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P34578 and previous config saved to /var/cache/conftool/dbconfig/20220913-091320-ladsgroup.json | [production] | 
            
  | 09:11 | <volans@cumin1001> | END (PASS) - Cookbook sre.network.cf (exit_code=0) | [production] | 
            
  | 09:11 | <volans@cumin1001> | START - Cookbook sre.network.cf | [production] | 
            
  | 09:02 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cr1-codfw,cr1-codfw IPv6,re0.cr1-codfw.mgmt with reason: router upgrade | [production] | 
            
  | 09:02 | <cmooney@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on cr1-codfw,cr1-codfw IPv6,re0.cr1-codfw.mgmt with reason: router upgrade | [production] | 
            
  | 08:58 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P34577 and previous config saved to /var/cache/conftool/dbconfig/20220913-085814-ladsgroup.json | [production] | 
            
  | 08:56 | <topranks> | Flipping primary routing engine to RE1 on cr1-codfw (disruptive) as part of upgrade. | [production] | 
            
  | 08:54 | <marostegui@cumin1001> | dbctl commit (dc=all): 'Set db2112 with weight 0 T317614', diff saved to https://phabricator.wikimedia.org/P34576 and previous config saved to /var/cache/conftool/dbconfig/20220913-085456-marostegui.json | [production] | 
            
  | 08:54 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 37 hosts with reason: Primary switchover s1 T317614 | [production] | 
            
  | 08:54 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 1:00:00 on 37 hosts with reason: Primary switchover s1 T317614 | [production] | 
            
  | 08:46 | <topranks> | Disabled LVS/PyBal peerings on cr1-codfw ain advance of upgrade to router. | [production] | 
            
  | 08:46 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1100.eqiad.wmnet | [production] | 
            
  | 08:43 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2150 (T314041)', diff saved to https://phabricator.wikimedia.org/P34575 and previous config saved to /var/cache/conftool/dbconfig/20220913-084307-ladsgroup.json | [production] | 
            
  | 08:39 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-worker1100.eqiad.wmnet | [production] | 
            
  | 08:36 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1099.eqiad.wmnet | [production] | 
            
  | 08:27 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-worker1099.eqiad.wmnet | [production] | 
            
  | 08:27 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.network.cf (exit_code=0) | [production] | 
            
  | 08:27 | <cmooney@cumin1001> | START - Cookbook sre.network.cf | [production] | 
            
  | 08:17 | <moritzm> | roll-restarting apache/FPM on mw canaries to pick up zlib security updates | [production] | 
            
  | 08:15 | <topranks> | de-pooling codfw ahead of core router upgrades at the site | [production] | 
            
  | 07:24 | <mwdebug-deploy@deploy1002> | helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 07:24 | <mwdebug-deploy@deploy1002> | helmfile [codfw] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 07:24 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 07:19 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 07:18 | <jhuneidi@deploy1002> | Finished scap: testwikis wikis to 1.39.0-wmf.28  refs T314190 (duration: 04m 29s) | [production] | 
            
  | 07:14 | <jhuneidi@deploy1002> | Started scap: testwikis wikis to 1.39.0-wmf.28  refs T314190 | [production] | 
            
  | 07:11 | <jhuneidi@deploy1002> | deploy-promote aborted:  (duration: 00m 09s) | [production] | 
            
  | 06:55 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 06:55 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 06:54 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1170:3317 (T314041)', diff saved to https://phabricator.wikimedia.org/P34574 and previous config saved to /var/cache/conftool/dbconfig/20220913-065457-ladsgroup.json | [production] | 
            
  | 06:39 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1170:3317', diff saved to https://phabricator.wikimedia.org/P34573 and previous config saved to /var/cache/conftool/dbconfig/20220913-063951-ladsgroup.json | [production] | 
            
  | 06:39 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depooling db2109 (T314041)', diff saved to https://phabricator.wikimedia.org/P34572 and previous config saved to /var/cache/conftool/dbconfig/20220913-063908-ladsgroup.json | [production] | 
            
  | 06:39 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2109.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 06:38 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2109.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 06:38 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1102.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 06:38 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1102.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 06:24 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1170:3317', diff saved to https://phabricator.wikimedia.org/P34571 and previous config saved to /var/cache/conftool/dbconfig/20220913-062444-ladsgroup.json | [production] | 
            
  | 06:09 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1170:3317 (T314041)', diff saved to https://phabricator.wikimedia.org/P34570 and previous config saved to /var/cache/conftool/dbconfig/20220913-060938-ladsgroup.json | [production] | 
            
  | 04:58 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depooling db2150 (T314041)', diff saved to https://phabricator.wikimedia.org/P34569 and previous config saved to /var/cache/conftool/dbconfig/20220913-045832-ladsgroup.json | [production] | 
            
  | 04:58 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 04:58 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 04:58 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2122 (T314041)', diff saved to https://phabricator.wikimedia.org/P34568 and previous config saved to /var/cache/conftool/dbconfig/20220913-045811-ladsgroup.json | [production] | 
            
  | 04:43 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P34567 and previous config saved to /var/cache/conftool/dbconfig/20220913-044304-ladsgroup.json | [production] | 
            
  | 04:27 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P34566 and previous config saved to /var/cache/conftool/dbconfig/20220913-042758-ladsgroup.json | [production] | 
            
  | 04:12 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2122 (T314041)', diff saved to https://phabricator.wikimedia.org/P34565 and previous config saved to /var/cache/conftool/dbconfig/20220913-041251-ladsgroup.json | [production] | 
            
  | 04:08 | <mwdebug-deploy@deploy1002> | helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 04:01 | <mwdebug-deploy@deploy1002> | helmfile [codfw] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 04:01 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | [production] |