2023-08-09
§
|
08:32 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2098.codfw.wmnet with reason: Maintenance |
[production] |
08:32 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance |
[production] |
08:32 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2098.codfw.wmnet with reason: Maintenance |
[production] |
07:58 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1003.eqiad.wmnet |
[production] |
07:52 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-be1003.eqiad.wmnet |
[production] |
07:12 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:946852|testwiki: Enable Section Translation for 7 Wikipedias (T343211)]] (duration: 09m 58s) |
[production] |
07:05 |
<kartik@deploy1002> |
kartik: Continuing with sync |
[production] |
07:03 |
<kartik@deploy1002> |
kartik: Backport for [[gerrit:946852|testwiki: Enable Section Translation for 7 Wikipedias (T343211)]] synced to the testservers mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) |
[production] |
07:02 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:946852|testwiki: Enable Section Translation for 7 Wikipedias (T343211)]] |
[production] |
06:52 |
<root@cumin2002> |
END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jkieserman out of all services on: 33 hosts |
[production] |
06:51 |
<root@cumin2002> |
START - Cookbook sre.idm.logout Logging Jkieserman out of all services on: 33 hosts |
[production] |
06:51 |
<root@cumin2002> |
END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jkieserman out of all services on: 716 hosts |
[production] |
06:51 |
<root@cumin2002> |
START - Cookbook sre.idm.logout Logging Jkieserman out of all services on: 716 hosts |
[production] |
06:47 |
<root@cumin2002> |
END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jkieserman out of all services on: 1309 hosts |
[production] |
06:46 |
<root@cumin2002> |
START - Cookbook sre.idm.logout Logging Jkieserman out of all services on: 1309 hosts |
[production] |
06:46 |
<root@cumin2002> |
END (FAIL) - Cookbook sre.idm.logout (exit_code=99) Logging Jmads out of all services on: 1309 hosts |
[production] |
06:46 |
<root@cumin2002> |
START - Cookbook sre.idm.logout Logging Jmads out of all services on: 1309 hosts |
[production] |
06:18 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1146:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50222 and previous config saved to /var/cache/conftool/dbconfig/20230809-061826-ladsgroup.json |
[production] |
06:18 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance |
[production] |
06:18 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance |
[production] |
01:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2137:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50219 and previous config saved to /var/cache/conftool/dbconfig/20230809-013145-ladsgroup.json |
[production] |
01:31 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2137.codfw.wmnet with reason: Maintenance |
[production] |
01:31 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2137.codfw.wmnet with reason: Maintenance |
[production] |
01:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2136 (T342617)', diff saved to https://phabricator.wikimedia.org/P50218 and previous config saved to /var/cache/conftool/dbconfig/20230809-013124-ladsgroup.json |
[production] |
01:16 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P50217 and previous config saved to /var/cache/conftool/dbconfig/20230809-011618-ladsgroup.json |
[production] |
01:01 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P50216 and previous config saved to /var/cache/conftool/dbconfig/20230809-010112-ladsgroup.json |
[production] |
00:46 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2136 (T342617)', diff saved to https://phabricator.wikimedia.org/P50215 and previous config saved to /var/cache/conftool/dbconfig/20230809-004605-ladsgroup.json |
[production] |
00:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
00:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
00:38 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1144:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50214 and previous config saved to /var/cache/conftool/dbconfig/20230809-003817-ladsgroup.json |
[production] |
00:23 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1144:3314', diff saved to https://phabricator.wikimedia.org/P50213 and previous config saved to /var/cache/conftool/dbconfig/20230809-002310-ladsgroup.json |
[production] |
00:08 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1144:3314', diff saved to https://phabricator.wikimedia.org/P50212 and previous config saved to /var/cache/conftool/dbconfig/20230809-000804-ladsgroup.json |
[production] |
2023-08-08
§
|
23:52 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1144:3314 (T342617)', diff saved to https://phabricator.wikimedia.org/P50211 and previous config saved to /var/cache/conftool/dbconfig/20230808-235258-ladsgroup.json |
[production] |
22:33 |
<urbanecm> |
mwmaint1002: stop persistRevisionThreadItems.php frwiki instance because of T343859 (cc T315510) |
[production] |
22:04 |
<bking@deploy1002> |
Finished deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 (duration: 00m 17s) |
[production] |
22:03 |
<bking@deploy1002> |
Started deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 |
[production] |
21:57 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
21:46 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
21:46 |
<bking@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wcqs1003.eqiad.wmnet with OS bullseye |
[production] |
21:22 |
<brett> |
Exported varnish-modules 0.15.0-4 for bookworm-wikimedia (T342154) |
[production] |
21:18 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage |
[production] |
21:15 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wcqs1003.eqiad.wmnet with reason: host reimage |
[production] |
21:06 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 |
[production] |
21:06 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.debug for Netbox circuit ID 108 |
[production] |
21:04 |
<bking@cumin1001> |
conftool action : set/pooled=no; selector: name=wcqs1003.eqiad.wmnet,service=wcqs |
[production] |
21:02 |
<bking@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=wcqs,name=eqiad |
[production] |
21:02 |
<bking@cumin1001> |
START - Cookbook sre.hosts.reimage for host wcqs1003.eqiad.wmnet with OS bullseye |
[production] |
20:58 |
<bking@deploy1002> |
Finished deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 (duration: 00m 17s) |
[production] |
20:58 |
<bking@deploy1002> |
Started deploy [wdqs/wdqs@f1a6177] (wcqs): f1a6177 |
[production] |
20:57 |
<bking@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wcqs1002.eqiad.wmnet with OS bullseye |
[production] |