1101-1150 of 10000 results (43ms)
2022-03-30 ยง
07:41 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1160', diff saved to https://phabricator.wikimedia.org/P23745 and previous config saved to /var/cache/conftool/dbconfig/20220330-074118-marostegui.json [production]
07:39 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve-ctrl2002.codfw.wmnet [production]
07:33 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve-ctrl2002.codfw.wmnet [production]
07:33 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores1001.eqiad.wmnet [production]
07:33 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve-ctrl2001.codfw.wmnet [production]
07:33 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve-ctrl1002.eqiad.wmnet [production]
07:31 <moritzm> updating libapache2-mod-auth-cas on bullseye hosts [production]
07:27 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve-ctrl2001.codfw.wmnet [production]
07:26 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve-ctrl1002.eqiad.wmnet [production]
07:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1160 (T298557)', diff saved to https://phabricator.wikimedia.org/P23744 and previous config saved to /var/cache/conftool/dbconfig/20220330-072613-marostegui.json [production]
07:24 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve-ctrl1001.eqiad.wmnet [production]
07:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1127 (T298565)', diff saved to https://phabricator.wikimedia.org/P23743 and previous config saved to /var/cache/conftool/dbconfig/20220330-072045-ladsgroup.json [production]
07:20 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1127.eqiad.wmnet with reason: Maintenance [production]
07:20 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1127.eqiad.wmnet with reason: Maintenance [production]
07:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1174 (T298565)', diff saved to https://phabricator.wikimedia.org/P23742 and previous config saved to /var/cache/conftool/dbconfig/20220330-072037-ladsgroup.json [production]
07:16 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
07:16 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1179 (T297189)', diff saved to https://phabricator.wikimedia.org/P23741 and previous config saved to /var/cache/conftool/dbconfig/20220330-071650-marostegui.json [production]
07:16 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1179.eqiad.wmnet with reason: Maintenance [production]
07:16 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db1179.eqiad.wmnet with reason: Maintenance [production]
07:16 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
07:16 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
07:15 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve-ctrl1001.eqiad.wmnet [production]
07:15 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
07:14 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores2009.codfw.wmnet [production]
07:10 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores2009.codfw.wmnet [production]
07:10 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores2008.codfw.wmnet [production]
07:10 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
07:09 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
07:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
07:08 <taavi> UTC morning deploys done [production]
07:08 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
07:08 <taavi@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:775012|Enable Realtime Preview on testwiki (T302506)]] (duration: 00m 56s) [production]
07:06 <elukey> restart rsyslog on ml-serve1002 [production]
07:06 <marostegui@cumin1001> dbctl commit (dc=all): 'db1131 (re)pooling @ 100%: After schema change', diff saved to https://phabricator.wikimedia.org/P23740 and previous config saved to /var/cache/conftool/dbconfig/20220330-070604-root.json [production]
07:05 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P23739 and previous config saved to /var/cache/conftool/dbconfig/20220330-070532-ladsgroup.json [production]
07:03 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores2008.codfw.wmnet [production]
06:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1129 (T298565)', diff saved to https://phabricator.wikimedia.org/P23738 and previous config saved to /var/cache/conftool/dbconfig/20220330-065822-ladsgroup.json [production]
06:58 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1129.eqiad.wmnet with reason: Maintenance [production]
06:58 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1129.eqiad.wmnet with reason: Maintenance [production]
06:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3312 (T298565)', diff saved to https://phabricator.wikimedia.org/P23737 and previous config saved to /var/cache/conftool/dbconfig/20220330-065814-ladsgroup.json [production]
06:54 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores2007.codfw.wmnet [production]
06:51 <marostegui@cumin1001> dbctl commit (dc=all): 'db1131 (re)pooling @ 75%: After schema change', diff saved to https://phabricator.wikimedia.org/P23736 and previous config saved to /var/cache/conftool/dbconfig/20220330-065100-root.json [production]
06:50 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P23735 and previous config saved to /var/cache/conftool/dbconfig/20220330-065027-ladsgroup.json [production]
06:49 <jayme> updated scap to 4.5.0 on all hosts - T304134 [production]
06:48 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores2007.codfw.wmnet [production]
06:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3312', diff saved to https://phabricator.wikimedia.org/P23734 and previous config saved to /var/cache/conftool/dbconfig/20220330-064309-ladsgroup.json [production]
06:42 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1001.eqiad.wmnet [production]
06:40 <marostegui@cumin1001> dbctl commit (dc=all): 'db1179 (re)pooling @ 100%: After downgrade', diff saved to https://phabricator.wikimedia.org/P23733 and previous config saved to /var/cache/conftool/dbconfig/20220330-064037-root.json [production]
06:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores2006.codfw.wmnet [production]
06:39 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1145.eqiad.wmnet with reason: Maintenance [production]