6851-6900 of 10000 results (43ms)
2021-01-29 §
09:32 <marostegui> Expand lvs on db1155-db1175 T258361 [production]
09:31 <vgutierrez> depool cp5006 [production]
08:20 <marostegui> Change buffer pool sizes on clouddb1013,1015,1017,1019 T267090 [production]
07:11 <marostegui> Upgrade pc2007 to 10.4.18 T268457 [production]
06:55 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1078 to clone db1175', diff saved to https://phabricator.wikimedia.org/P14044 and previous config saved to /var/cache/conftool/dbconfig/20210129-065529-marostegui.json [production]
03:35 <marostegui> Reload haproxy1018 [production]
02:42 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2251.codfw.wmnet [production]
02:42 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2252.codfw.wmnet [production]
02:37 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2252.codfw.wmnet [production]
02:37 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2251.codfw.wmnet [production]
02:04 <krinkle@deploy1001> Synchronized wmf-config/profiler.php: If0c71a983772c (duration: 00m 58s) [production]
01:49 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2252.codfw.wmnet with reason: REIMAGE [production]
01:48 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2251.codfw.wmnet with reason: REIMAGE [production]
01:46 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2252.codfw.wmnet with reason: REIMAGE [production]
01:46 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2251.codfw.wmnet with reason: REIMAGE [production]
01:09 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2253.codfw.wmnet [production]
01:07 <mutante> repooled mw2248,mw2249 - jobrunners/videoscalers now on buster [production]
01:06 <mutante> repooled mw2048,mw2049 - jobrunners/videoscalers now on buster [production]
01:06 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2253.codfw.wmnet [production]
01:06 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2249.codfw.wmnet [production]
01:05 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2248.codfw.wmnet [production]
01:03 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2249.codfw.wmnet [production]
01:03 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2248.codfw.wmnet [production]
00:19 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2261.codfw.wmnet [production]
00:14 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2262.codfw.wmnet [production]
00:13 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2283.codfw.wmnet [production]
2021-01-28 §
23:58 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2261.codfw.wmnet [production]
23:58 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2262.codfw.wmnet [production]
23:57 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2283.codfw.wmnet [production]
23:52 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2253.codfw.wmnet with reason: REIMAGE [production]
23:49 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2253.codfw.wmnet with reason: REIMAGE [production]
23:47 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2248.codfw.wmnet with reason: REIMAGE [production]
23:45 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2249.codfw.wmnet with reason: REIMAGE [production]
23:44 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2248.codfw.wmnet with reason: REIMAGE [production]
23:43 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2249.codfw.wmnet with reason: REIMAGE [production]
23:34 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2283.codfw.wmnet with reason: reimaging [production]
23:33 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on mw2283.codfw.wmnet with reason: reimaging [production]
23:33 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2262.codfw.wmnet with reason: REIMAGE [production]
23:31 <legoktm@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2283.codfw.wmnet with reason: REIMAGE [production]
23:31 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2261.codfw.wmnet with reason: REIMAGE [production]
23:29 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2283.codfw.wmnet with reason: REIMAGE [production]
23:29 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2262.codfw.wmnet with reason: REIMAGE [production]
23:29 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2261.codfw.wmnet with reason: REIMAGE [production]
23:14 <mutante> reimaging jobrunners/videoscallers mw2248,mw2249 [production]
22:43 <brennen@deploy1001> Synchronized php-1.36.0-wmf.27/includes/parser/CacheTime.php: [[gerrit:658688|CacheTime: Extra protection for rollback unserialization (T273007)]] (duration: 00m 57s) [production]
22:41 <bblack> eqiad lvs should be back to normal state now with everything working [production]
22:39 <bblack> lvs1014 - apply https://gerrit.wikimedia.org/r/659439 [production]
22:37 <bblack> lvs1013 - testing https://gerrit.wikimedia.org/r/659439 (expect nop, worked on 1015!) [production]
22:36 <bblack> lvs1015 - testing https://gerrit.wikimedia.org/r/659439 (expect nop) [production]
22:21 <bblack> lvs1016 - trying https://gerrit.wikimedia.org/r/659439 on backup LVS... [production]