2021-01-28
ยง
|
22:20 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2284.codfw.wmnet |
[production] |
22:16 |
<bblack> |
disabling puppet on all eqiad lvs for https://gerrit.wikimedia.org/r/659439 risks |
[production] |
22:03 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2284.codfw.wmnet |
[production] |
22:03 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2285.codfw.wmnet |
[production] |
22:02 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2286.codfw.wmnet |
[production] |
22:02 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2287.codfw.wmnet |
[production] |
21:33 |
<robh@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on db1173.eqiad.wmnet with reason: REIMAGE |
[production] |
21:32 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1172.eqiad.wmnet with reason: REIMAGE |
[production] |
21:30 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: REIMAGE |
[production] |
21:28 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1173.eqiad.wmnet with reason: REIMAGE |
[production] |
21:28 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1172.eqiad.wmnet with reason: REIMAGE |
[production] |
21:28 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1175.eqiad.wmnet with reason: REIMAGE |
[production] |
21:28 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.28 |
[production] |
21:28 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2287.codfw.wmnet with reason: reimaging |
[production] |
21:27 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on mw2287.codfw.wmnet with reason: reimaging |
[production] |
21:27 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2285.codfw.wmnet with reason: reimaging |
[production] |
21:27 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on mw2285.codfw.wmnet with reason: reimaging |
[production] |
21:27 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2284.codfw.wmnet with reason: REIMAGE |
[production] |
21:25 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2286.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<legoktm@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2285.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<legoktm@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2287.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2284.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2285.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2287.codfw.wmnet with reason: REIMAGE |
[production] |
21:23 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2286.codfw.wmnet with reason: REIMAGE |
[production] |
21:19 |
<brennen@deploy1001> |
Synchronized php: group1 wikis to 1.36.0-wmf.28 (duration: 01m 05s) |
[production] |
21:17 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.36.0-wmf.28 |
[production] |
21:15 |
<brennen> |
1.36.0-wmf.28 train status (T271342): blockers resolved, going go group1 to be follow shortly by all wikis |
[production] |
21:11 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.28/extensions/CentralAuth/includes/: Backport: [[gerrit:659362|Revert CentralAuthCreateLocalAccountJob changes in 9f79de4 (T273205)]] (duration: 01m 09s) |
[production] |
20:49 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.28/tests/phpunit/includes/parser/ParserOptionsTest.php: Backport: [[gerrit:659103|Make ParserOptions::isSafeToCache more robust (T273120)]] (duration: 01m 07s) |
[production] |
20:46 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.28/includes/parser/ParserOptions.php: Backport: [[gerrit:659103|Make ParserOptions::isSafeToCache more robust (T273120)]] (duration: 01m 08s) |
[production] |
20:25 |
<bblack> |
lvs1014,lvs1016 - all back to "normal" state |
[production] |
20:24 |
<bblack> |
lvs1014 - restart pybal |
[production] |
20:20 |
<bblack> |
lvs1016 - restart pybal |
[production] |
20:15 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@911731d]: write articletopic and drafttopic to hourly tables (duration: 01m 44s) |
[production] |
20:13 |
<bblack> |
lvs1014,lvs1016 - puppet temporarily disabled for new service config deploy - T271476 |
[production] |
20:13 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2223.codfw.wmnet |
[production] |
20:13 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2247.codfw.wmnet |
[production] |
20:13 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1264.eqiad.wmnet |
[production] |
20:13 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@911731d]: write articletopic and drafttopic to hourly tables |
[production] |
20:13 |
<mutante> |
scap pulling and repooling: mw1264, mw2223, mw2247 |
[production] |
20:11 |
<bstorm@cumin1001> |
conftool action : set/pooled=yes; selector: name=dbproxy1019.eqiad.wmnet |
[production] |
20:10 |
<bstorm@cumin1001> |
conftool action : set/pooled=yes; selector: name=dbproxy1018.eqiad.wmnet |
[production] |
20:01 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2223.codfw.wmnet |
[production] |
20:00 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2247.codfw.wmnet |
[production] |
20:00 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1264.eqiad.wmnet |
[production] |
19:57 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1171.eqiad.wmnet with reason: REIMAGE |
[production] |
19:55 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1171.eqiad.wmnet with reason: REIMAGE |
[production] |
19:53 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@ba1acd6]: airflow: start ores_predictions_daily one day earlier (duration: 01m 09s) |
[production] |
19:52 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@ba1acd6]: airflow: start ores_predictions_daily one day earlier |
[production] |