2021-02-11
ยง
|
23:23 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mwdebug2002.codfw.wmnet with reason: OS upgrade |
[production] |
23:20 |
<mutante> |
reimaging mwdebug2002 - stretch -> buster |
[production] |
22:57 |
<Urbanecm> |
Run scap pull at mwmaint1002 |
[production] |
22:53 |
<mutante> |
powercycling crashed mwmaint1002 |
[production] |
22:53 |
<Urbanecm> |
Deploy security patch for T274514 |
[production] |
22:11 |
<legoktm@deploy1001> |
Synchronized php-1.36.0-wmf.30/extensions/GlobalWatchlist: GlobalWatchlist backports (duration: 01m 11s) |
[production] |
22:05 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1332.eqiad.wmnet with reason: REIMAGE |
[production] |
22:03 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1331.eqiad.wmnet with reason: REIMAGE |
[production] |
22:03 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1332.eqiad.wmnet with reason: REIMAGE |
[production] |
22:01 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1330.eqiad.wmnet with reason: REIMAGE |
[production] |
22:01 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1331.eqiad.wmnet with reason: REIMAGE |
[production] |
21:59 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1329.eqiad.wmnet with reason: REIMAGE |
[production] |
21:59 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1330.eqiad.wmnet with reason: REIMAGE |
[production] |
21:57 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1329.eqiad.wmnet with reason: REIMAGE |
[production] |
21:55 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1354.eqiad.wmnet |
[production] |
21:50 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1354.eqiad.wmnet |
[production] |
21:50 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1355.eqiad.wmnet |
[production] |
21:45 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1355.eqiad.wmnet |
[production] |
21:44 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1359.eqiad.wmnet |
[production] |
21:40 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1359.eqiad.wmnet |
[production] |
21:36 |
<mutante> |
mw1355, mw1359 - power cycling |
[production] |
21:23 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1354.eqiad.wmnet with reason: REIMAGE |
[production] |
21:21 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1354.eqiad.wmnet with reason: REIMAGE |
[production] |
21:20 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1360.eqiad.wmnet |
[production] |
21:12 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1360.eqiad.wmnet |
[production] |
21:05 |
<mutante> |
mw1360 - powercycling |
[production] |
21:01 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1364.eqiad.wmnet |
[production] |
20:59 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1364.eqiad.wmnet |
[production] |
20:52 |
<mutante> |
mw1364 - powercycled |
[production] |
20:44 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1355.eqiad.wmnet with reason: REIMAGE |
[production] |
20:42 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1355.eqiad.wmnet with reason: REIMAGE |
[production] |
20:31 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1359.eqiad.wmnet with reason: REIMAGE |
[production] |
20:29 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1359.eqiad.wmnet with reason: REIMAGE |
[production] |
20:26 |
<twentyafterfour> |
new train blocker preventing deploy of 1.36.0-wmf.30 to all wikis. T274589 blocks T271344 |
[production] |
20:24 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1365.eqiad.wmnet |
[production] |
20:23 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1365.eqiad.wmnet |
[production] |
20:15 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1360.eqiad.wmnet with reason: REIMAGE |
[production] |
20:13 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1360.eqiad.wmnet with reason: REIMAGE |
[production] |
20:11 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1361.eqiad.wmnet |
[production] |
20:09 |
<mutante> |
mw1365 - powercycle - reboot issue |
[production] |
20:08 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1361.eqiad.wmnet |
[production] |
20:02 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1364.eqiad.wmnet with reason: REIMAGE |
[production] |
20:00 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1364.eqiad.wmnet with reason: REIMAGE |
[production] |
19:55 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1362.eqiad.wmnet |
[production] |
19:54 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1362.eqiad.wmnet |
[production] |
19:42 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1368.eqiad.wmnet |
[production] |
19:41 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1361.eqiad.wmnet with reason: REIMAGE |
[production] |
19:40 |
<mutante> |
mw1368 - had the reboot via IPMI issue, did DRAC reset and repeated wmf-autoreimage, issue did not happen again |
[production] |
19:40 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1368.eqiad.wmnet |
[production] |
19:39 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1361.eqiad.wmnet with reason: REIMAGE |
[production] |