451-500 of 10000 results (34ms)
2021-01-28 ยง
23:29 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2262.codfw.wmnet with reason: REIMAGE [production]
23:29 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2261.codfw.wmnet with reason: REIMAGE [production]
23:14 <mutante> reimaging jobrunners/videoscallers mw2248,mw2249 [production]
22:43 <brennen@deploy1001> Synchronized php-1.36.0-wmf.27/includes/parser/CacheTime.php: [[gerrit:658688|CacheTime: Extra protection for rollback unserialization (T273007)]] (duration: 00m 57s) [production]
22:41 <bblack> eqiad lvs should be back to normal state now with everything working [production]
22:39 <bblack> lvs1014 - apply https://gerrit.wikimedia.org/r/659439 [production]
22:37 <bblack> lvs1013 - testing https://gerrit.wikimedia.org/r/659439 (expect nop, worked on 1015!) [production]
22:36 <bblack> lvs1015 - testing https://gerrit.wikimedia.org/r/659439 (expect nop) [production]
22:21 <bblack> lvs1016 - trying https://gerrit.wikimedia.org/r/659439 on backup LVS... [production]
22:21 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2287.codfw.wmnet [production]
22:21 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2286.codfw.wmnet [production]
22:20 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2285.codfw.wmnet [production]
22:20 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2284.codfw.wmnet [production]
22:16 <bblack> disabling puppet on all eqiad lvs for https://gerrit.wikimedia.org/r/659439 risks [production]
22:03 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2284.codfw.wmnet [production]
22:03 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2285.codfw.wmnet [production]
22:02 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2286.codfw.wmnet [production]
22:02 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2287.codfw.wmnet [production]
21:33 <robh@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on db1173.eqiad.wmnet with reason: REIMAGE [production]
21:32 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1172.eqiad.wmnet with reason: REIMAGE [production]
21:30 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: REIMAGE [production]
21:28 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1173.eqiad.wmnet with reason: REIMAGE [production]
21:28 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1172.eqiad.wmnet with reason: REIMAGE [production]
21:28 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1175.eqiad.wmnet with reason: REIMAGE [production]
21:28 <brennen@deploy1001> rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.28 [production]
21:28 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2287.codfw.wmnet with reason: reimaging [production]
21:27 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on mw2287.codfw.wmnet with reason: reimaging [production]
21:27 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2285.codfw.wmnet with reason: reimaging [production]
21:27 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on mw2285.codfw.wmnet with reason: reimaging [production]
21:27 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2284.codfw.wmnet with reason: REIMAGE [production]
21:25 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2286.codfw.wmnet with reason: REIMAGE [production]
21:23 <legoktm@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2285.codfw.wmnet with reason: REIMAGE [production]
21:23 <legoktm@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2287.codfw.wmnet with reason: REIMAGE [production]
21:23 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2284.codfw.wmnet with reason: REIMAGE [production]
21:23 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2285.codfw.wmnet with reason: REIMAGE [production]
21:23 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2287.codfw.wmnet with reason: REIMAGE [production]
21:23 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2286.codfw.wmnet with reason: REIMAGE [production]
21:19 <brennen@deploy1001> Synchronized php: group1 wikis to 1.36.0-wmf.28 (duration: 01m 05s) [production]
21:17 <brennen@deploy1001> rebuilt and synchronized wikiversions files: group1 wikis to 1.36.0-wmf.28 [production]
21:15 <brennen> 1.36.0-wmf.28 train status (T271342): blockers resolved, going go group1 to be follow shortly by all wikis [production]
21:11 <brennen@deploy1001> Synchronized php-1.36.0-wmf.28/extensions/CentralAuth/includes/: Backport: [[gerrit:659362|Revert CentralAuthCreateLocalAccountJob changes in 9f79de4 (T273205)]] (duration: 01m 09s) [production]
20:49 <brennen@deploy1001> Synchronized php-1.36.0-wmf.28/tests/phpunit/includes/parser/ParserOptionsTest.php: Backport: [[gerrit:659103|Make ParserOptions::isSafeToCache more robust (T273120)]] (duration: 01m 07s) [production]
20:46 <brennen@deploy1001> Synchronized php-1.36.0-wmf.28/includes/parser/ParserOptions.php: Backport: [[gerrit:659103|Make ParserOptions::isSafeToCache more robust (T273120)]] (duration: 01m 08s) [production]
20:25 <bblack> lvs1014,lvs1016 - all back to "normal" state [production]
20:24 <bblack> lvs1014 - restart pybal [production]
20:20 <bblack> lvs1016 - restart pybal [production]
20:15 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@911731d]: write articletopic and drafttopic to hourly tables (duration: 01m 44s) [production]
20:13 <bblack> lvs1014,lvs1016 - puppet temporarily disabled for new service config deploy - T271476 [production]
20:13 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2223.codfw.wmnet [production]
20:13 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2247.codfw.wmnet [production]