2021-01-14
ยง
|
23:07 |
<jforrester@deploy1001> |
Synchronized static/images/project-logos/enwiki20.png: T272094 Sync out logo before going live, 1/3 (duration: 01m 02s) |
[production] |
23:02 |
<mutante> |
Happy 20th Birthday Wikipedia - https://20.wikipedia.org - https://gerrit.wikimedia.org/r/656268 |
[production] |
22:55 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2236.codfw.wmnet |
[production] |
22:38 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2270.codfw.wmnet |
[production] |
22:38 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2268.codfw.wmnet |
[production] |
22:38 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2269.codfw.wmnet |
[production] |
22:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2269.codfw.wmnet |
[production] |
22:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2270.codfw.wmnet |
[production] |
22:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2268.codfw.wmnet |
[production] |
22:04 |
<thcipriani> |
restart apache on gerrit1001 |
[production] |
21:57 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2236.codfw.wmnet with reason: REIMAGE |
[production] |
21:55 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2236.codfw.wmnet with reason: REIMAGE |
[production] |
21:53 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2270.codfw.wmnet with reason: REIMAGE |
[production] |
21:51 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2269.codfw.wmnet with reason: REIMAGE |
[production] |
21:50 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2270.codfw.wmnet with reason: REIMAGE |
[production] |
21:49 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2268.codfw.wmnet with reason: REIMAGE |
[production] |
21:48 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2269.codfw.wmnet with reason: REIMAGE |
[production] |
21:47 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2268.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2258.codfw.wmnet |
[production] |
21:23 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2255.codfw.wmnet |
[production] |
21:19 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1003.eqiad.wmnet with reason: REIMAGE |
[production] |
21:18 |
<razzi@cumin1001> |
END (PASS) - Cookbook sre.druid.reboot-workers (exit_code=0) for Druid analytics cluster: Reboot Druid nodes - razzi@cumin1001 |
[production] |
21:18 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1004.eqiad.wmnet with reason: REIMAGE |
[production] |
21:16 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2242.codfw.wmnet |
[production] |
21:16 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2241.codfw.wmnet |
[production] |
21:16 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1002.eqiad.wmnet with reason: REIMAGE |
[production] |
21:15 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1003.eqiad.wmnet with reason: REIMAGE |
[production] |
21:15 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1004.eqiad.wmnet with reason: REIMAGE |
[production] |
21:14 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1002.eqiad.wmnet with reason: REIMAGE |
[production] |
21:12 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2258.codfw.wmnet |
[production] |
21:12 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2255.codfw.wmnet |
[production] |
21:10 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2242.codfw.wmnet |
[production] |
21:10 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2241.codfw.wmnet |
[production] |
20:17 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1001.eqiad.wmnet with reason: REIMAGE |
[production] |
20:17 |
<mutante> |
ACKing all unhandled crit alerts about systemd on clouddb hosts - notifications are disabled but this cleans up Icinga web UI noise - T267090 |
[production] |
20:15 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1001.eqiad.wmnet with reason: REIMAGE |
[production] |
20:05 |
<razzi@cumin1001> |
START - Cookbook sre.druid.reboot-workers for Druid analytics cluster: Reboot Druid nodes - razzi@cumin1001 |
[production] |
19:31 |
<urbanecm@deploy1001> |
Synchronized dblists/closed.dblist: d3e274e9b953f5edda07fa3a016b7291a451ceb2: Close lrcwiki (T272041) (duration: 00m 58s) |
[production] |
19:03 |
<mutante> |
mc1024 - attempting to power on via mgmt, went down and power down |
[production] |
18:45 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2258.codfw.wmnet with reason: REIMAGE |
[production] |
18:43 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2255.codfw.wmnet with reason: REIMAGE |
[production] |
18:41 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2242.codfw.wmnet with reason: REIMAGE |
[production] |
18:41 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2258.codfw.wmnet with reason: REIMAGE |
[production] |
18:40 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2255.codfw.wmnet with reason: REIMAGE |
[production] |
18:39 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2241.codfw.wmnet with reason: REIMAGE |
[production] |
18:38 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2242.codfw.wmnet with reason: REIMAGE |
[production] |
18:38 |
<Amir1> |
started mass deletion of lrcwiki (T272041) - https://w.wiki/uPV |
[production] |
18:37 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2241.codfw.wmnet with reason: REIMAGE |
[production] |
18:36 |
<jynus> |
restarting backup1002, backup2002 T271913 |
[production] |
18:05 |
<jynus> |
restarting backup1001, backup2001 T271913 |
[production] |