2021-01-29
§
|
09:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1078 (re)pooling @ 10%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14046 and previous config saved to /var/cache/conftool/dbconfig/20210129-093451-root.json |
[production] |
09:32 |
<marostegui> |
Expand lvs on db1155-db1175 T258361 |
[production] |
09:31 |
<vgutierrez> |
depool cp5006 |
[production] |
08:20 |
<marostegui> |
Change buffer pool sizes on clouddb1013,1015,1017,1019 T267090 |
[production] |
07:11 |
<marostegui> |
Upgrade pc2007 to 10.4.18 T268457 |
[production] |
06:55 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1078 to clone db1175', diff saved to https://phabricator.wikimedia.org/P14044 and previous config saved to /var/cache/conftool/dbconfig/20210129-065529-marostegui.json |
[production] |
03:35 |
<marostegui> |
Reload haproxy1018 |
[production] |
02:42 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2251.codfw.wmnet |
[production] |
02:42 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2252.codfw.wmnet |
[production] |
02:37 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2252.codfw.wmnet |
[production] |
02:37 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2251.codfw.wmnet |
[production] |
02:04 |
<krinkle@deploy1001> |
Synchronized wmf-config/profiler.php: If0c71a983772c (duration: 00m 58s) |
[production] |
01:49 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2252.codfw.wmnet with reason: REIMAGE |
[production] |
01:48 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2251.codfw.wmnet with reason: REIMAGE |
[production] |
01:46 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2252.codfw.wmnet with reason: REIMAGE |
[production] |
01:46 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2251.codfw.wmnet with reason: REIMAGE |
[production] |
01:09 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2253.codfw.wmnet |
[production] |
01:07 |
<mutante> |
repooled mw2248,mw2249 - jobrunners/videoscalers now on buster |
[production] |
01:06 |
<mutante> |
repooled mw2048,mw2049 - jobrunners/videoscalers now on buster |
[production] |
01:06 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2253.codfw.wmnet |
[production] |
01:06 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2249.codfw.wmnet |
[production] |
01:05 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2248.codfw.wmnet |
[production] |
01:03 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2249.codfw.wmnet |
[production] |
01:03 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2248.codfw.wmnet |
[production] |
00:19 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2261.codfw.wmnet |
[production] |
00:14 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2262.codfw.wmnet |
[production] |
00:13 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2283.codfw.wmnet |
[production] |
2021-01-28
§
|
23:58 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2261.codfw.wmnet |
[production] |
23:58 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2262.codfw.wmnet |
[production] |
23:57 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2283.codfw.wmnet |
[production] |
23:52 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2253.codfw.wmnet with reason: REIMAGE |
[production] |
23:49 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2253.codfw.wmnet with reason: REIMAGE |
[production] |
23:47 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2248.codfw.wmnet with reason: REIMAGE |
[production] |
23:45 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2249.codfw.wmnet with reason: REIMAGE |
[production] |
23:44 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2248.codfw.wmnet with reason: REIMAGE |
[production] |
23:43 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2249.codfw.wmnet with reason: REIMAGE |
[production] |
23:34 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2283.codfw.wmnet with reason: reimaging |
[production] |
23:33 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on mw2283.codfw.wmnet with reason: reimaging |
[production] |
23:33 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2262.codfw.wmnet with reason: REIMAGE |
[production] |
23:31 |
<legoktm@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2283.codfw.wmnet with reason: REIMAGE |
[production] |
23:31 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2261.codfw.wmnet with reason: REIMAGE |
[production] |
23:29 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2283.codfw.wmnet with reason: REIMAGE |
[production] |
23:29 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2262.codfw.wmnet with reason: REIMAGE |
[production] |
23:29 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2261.codfw.wmnet with reason: REIMAGE |
[production] |
23:14 |
<mutante> |
reimaging jobrunners/videoscallers mw2248,mw2249 |
[production] |
22:43 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.27/includes/parser/CacheTime.php: [[gerrit:658688|CacheTime: Extra protection for rollback unserialization (T273007)]] (duration: 00m 57s) |
[production] |
22:41 |
<bblack> |
eqiad lvs should be back to normal state now with everything working |
[production] |
22:39 |
<bblack> |
lvs1014 - apply https://gerrit.wikimedia.org/r/659439 |
[production] |
22:37 |
<bblack> |
lvs1013 - testing https://gerrit.wikimedia.org/r/659439 (expect nop, worked on 1015!) |
[production] |
22:36 |
<bblack> |
lvs1015 - testing https://gerrit.wikimedia.org/r/659439 (expect nop) |
[production] |