2022-08-04
ยง
|
15:12 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be[2058,2064].codfw.wmnet |
[production] |
15:12 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for ms-be[2058,2064].codfw.wmnet |
[production] |
15:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depool hosts for PDU maint (T310145)', diff saved to https://phabricator.wikimedia.org/P32284 and previous config saved to /var/cache/conftool/dbconfig/20220804-151121-ladsgroup.json |
[production] |
15:09 |
<godog> |
poweroff logstash2002 - T310145 |
[production] |
15:07 |
<_joe_> |
pwoering down mc203{0,1} |
[production] |
15:07 |
<root@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on logstash2002.codfw.wmnet with reason: pdu |
[production] |
15:06 |
<root@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on logstash2002.codfw.wmnet with reason: pdu |
[production] |
15:05 |
<btullis@cumin1001> |
START - Cookbook sre.aqs.roll-restart for AQS aqs cluster: Roll restart of all AQS's nodejs daemons. |
[production] |
14:58 |
<jelto> |
power off mc20[30-31] |
[production] |
14:56 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mc[2030-2031].codfw.wmnet with reason: PDU swap |
[production] |
14:56 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on mc[2030-2031].codfw.wmnet with reason: PDU swap |
[production] |
14:56 |
<XioNoX> |
draining codfw-ulsfo link - T310310 |
[production] |
14:36 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=maps2009.codfw.wmnet |
[production] |
14:35 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=maps2007.codfw.wmnet |
[production] |
14:35 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=restbase2025.codfw.wmnet |
[production] |
14:35 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=restbase2020.codfw.wmnet |
[production] |
14:35 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=restbase2016.codfw.wmnet |
[production] |
14:32 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs2011.codfw.wmnet with reason: T310145 |
[production] |
14:31 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs2011.codfw.wmnet with reason: T310145 |
[production] |
14:25 |
<jelto> |
power off gitlab-runner2003 |
[production] |
14:25 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:30:00 on gitlab-runner2003.codfw.wmnet with reason: PDU swap |
[production] |
14:25 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs2001.codfw.wmnet with reason: T310145 |
[production] |
14:24 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs2001.codfw.wmnet with reason: T310145 |
[production] |
14:24 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:30:00 on gitlab-runner2003.codfw.wmnet with reason: PDU swap |
[production] |
14:23 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2032.codfw.wmnet with reason: T310145 |
[production] |
14:22 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2032.codfw.wmnet with reason: T310145 |
[production] |
14:22 |
<root@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on logstash2035.codfw.wmnet with reason: pdu |
[production] |
14:22 |
<godog> |
poweroff logstash2035 - T310145 |
[production] |
14:22 |
<root@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on logstash2035.codfw.wmnet with reason: pdu |
[production] |
14:21 |
<Emperor> |
shutdown ms-be20[58,64].codfw.wmnet for PDU swap T310145 |
[production] |
14:20 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
14:19 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
14:19 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
14:18 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
14:14 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
14:13 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/CommonSettings-labs.php: Config: [[gerrit:820454|Remove unused $wgMathUseRestBase (T274436)]] (duration: 03m 01s) |
[production] |
14:13 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
14:12 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
14:12 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
14:11 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
14:06 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
14:05 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/CommonSettings-labs.php: Config: [[gerrit:820254|CommonSettings-labs: Fix usage of $wgSFSValidateIPListLocationMD5]] (duration: 02m 51s) |
[production] |
14:05 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
14:05 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
14:05 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2033.codfw.wmnet with reason: T310145 |
[production] |
14:04 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2033.codfw.wmnet with reason: T310145 |
[production] |
14:04 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
13:59 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/wikitech.php: Config: [[gerrit:820255|wikitech: Remove old LDAP config vars]] (duration: 02m 54s) |
[production] |
13:59 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:58 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ms-be[2058,2064].codfw.wmnet with reason: PDU work |
[production] |