8801-8850 of 10000 results (42ms)
2021-02-11 ยง
23:23 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mwdebug2002.codfw.wmnet with reason: OS upgrade [production]
23:20 <mutante> reimaging mwdebug2002 - stretch -> buster [production]
22:57 <Urbanecm> Run scap pull at mwmaint1002 [production]
22:53 <mutante> powercycling crashed mwmaint1002 [production]
22:53 <Urbanecm> Deploy security patch for T274514 [production]
22:11 <legoktm@deploy1001> Synchronized php-1.36.0-wmf.30/extensions/GlobalWatchlist: GlobalWatchlist backports (duration: 01m 11s) [production]
22:05 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1332.eqiad.wmnet with reason: REIMAGE [production]
22:03 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1331.eqiad.wmnet with reason: REIMAGE [production]
22:03 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1332.eqiad.wmnet with reason: REIMAGE [production]
22:01 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1330.eqiad.wmnet with reason: REIMAGE [production]
22:01 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1331.eqiad.wmnet with reason: REIMAGE [production]
21:59 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1329.eqiad.wmnet with reason: REIMAGE [production]
21:59 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1330.eqiad.wmnet with reason: REIMAGE [production]
21:57 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1329.eqiad.wmnet with reason: REIMAGE [production]
21:55 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1354.eqiad.wmnet [production]
21:50 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1354.eqiad.wmnet [production]
21:50 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1355.eqiad.wmnet [production]
21:45 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1355.eqiad.wmnet [production]
21:44 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1359.eqiad.wmnet [production]
21:40 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1359.eqiad.wmnet [production]
21:36 <mutante> mw1355, mw1359 - power cycling [production]
21:23 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1354.eqiad.wmnet with reason: REIMAGE [production]
21:21 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1354.eqiad.wmnet with reason: REIMAGE [production]
21:20 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1360.eqiad.wmnet [production]
21:12 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1360.eqiad.wmnet [production]
21:05 <mutante> mw1360 - powercycling [production]
21:01 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1364.eqiad.wmnet [production]
20:59 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1364.eqiad.wmnet [production]
20:52 <mutante> mw1364 - powercycled [production]
20:44 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1355.eqiad.wmnet with reason: REIMAGE [production]
20:42 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1355.eqiad.wmnet with reason: REIMAGE [production]
20:31 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1359.eqiad.wmnet with reason: REIMAGE [production]
20:29 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1359.eqiad.wmnet with reason: REIMAGE [production]
20:26 <twentyafterfour> new train blocker preventing deploy of 1.36.0-wmf.30 to all wikis. T274589 blocks T271344 [production]
20:24 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1365.eqiad.wmnet [production]
20:23 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1365.eqiad.wmnet [production]
20:15 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1360.eqiad.wmnet with reason: REIMAGE [production]
20:13 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1360.eqiad.wmnet with reason: REIMAGE [production]
20:11 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1361.eqiad.wmnet [production]
20:09 <mutante> mw1365 - powercycle - reboot issue [production]
20:08 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1361.eqiad.wmnet [production]
20:02 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1364.eqiad.wmnet with reason: REIMAGE [production]
20:00 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1364.eqiad.wmnet with reason: REIMAGE [production]
19:55 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1362.eqiad.wmnet [production]
19:54 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1362.eqiad.wmnet [production]
19:42 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1368.eqiad.wmnet [production]
19:41 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1361.eqiad.wmnet with reason: REIMAGE [production]
19:40 <mutante> mw1368 - had the reboot via IPMI issue, did DRAC reset and repeated wmf-autoreimage, issue did not happen again [production]
19:40 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1368.eqiad.wmnet [production]
19:39 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1361.eqiad.wmnet with reason: REIMAGE [production]