2021-01-20
§
|
10:51 |
<XioNoX> |
Discard the non-whitelisted 172.16.0.0/12 traffic - T209082 |
[production] |
10:49 |
<arturo> |
merging core router firewall change https://gerrit.wikimedia.org/r/c/operations/homer/public/+/657302 (T209082) |
[admin] |
10:48 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2030.codfw.wmnet |
[production] |
10:46 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2029.codfw.wmnet |
[production] |
10:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1079 (re)pooling @ 25%: After moving wikireplicas to another host', diff saved to https://phabricator.wikimedia.org/P13844 and previous config saved to /var/cache/conftool/dbconfig/20210120-104257-root.json |
[production] |
10:37 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2029.codfw.wmnet |
[production] |
10:35 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2028.codfw.wmnet |
[production] |
10:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1079 to stop replication T272008', diff saved to https://phabricator.wikimedia.org/P13842 and previous config saved to /var/cache/conftool/dbconfig/20210120-103449-marostegui.json |
[production] |
10:26 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2028.codfw.wmnet |
[production] |
10:26 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2027.codfw.wmnet |
[production] |
10:17 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2027.codfw.wmnet |
[production] |
10:16 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2026.codfw.wmnet |
[production] |
10:07 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2026.codfw.wmnet |
[production] |
10:05 |
<dcaro> |
Everything looks ok, created a new vm with a volume in ceph without issues, and on warnings/errors on ceph status, closing (T272303) |
[admin] |
10:05 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2025.codfw.wmnet |
[production] |
09:59 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2025.codfw.wmnet |
[production] |
09:57 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2024.codfw.wmnet |
[production] |
09:55 |
<dcaro> |
Eqiad ceph cluster uprgaded, doing sanity checks (T272303) |
[admin] |
09:49 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2024.codfw.wmnet |
[production] |
09:47 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2023.codfw.wmnet |
[production] |
09:46 |
<dcaro> |
75% of the eqiad cluster upgraded... continuing (T272303) |
[admin] |
09:39 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2023.codfw.wmnet |
[production] |
09:39 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2021.codfw.wmnet |
[production] |
09:37 |
<dcaro> |
25% of the eqiad cluster upgraded... continuing (T272303) |
[admin] |
09:32 |
<moritzm> |
installing cuminunpriv1001 |
[production] |
09:32 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2021.codfw.wmnet |
[production] |
09:31 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2020.codfw.wmnet |
[production] |
09:24 |
<dcaro> |
Mgr daemons upgraded and running, upgrading osd daemons on servers cloudcephosd1*, this make take a bit longer (T272303) |
[admin] |
09:24 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2020.codfw.wmnet |
[production] |
09:22 |
<dcaro> |
Mon daemons upgraded and running, upgrading mgr daemons on servers cloudcephmon1* (T272303) |
[admin] |
09:19 |
<XioNoX> |
configure Lumen interfaces |
[production] |
09:16 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2019.codfw.wmnet |
[production] |
09:16 |
<dcaro> |
Starting eqiad ceph upgrade, upgrading the mon servers cloudcephmon1* (T272303) |
[admin] |
09:09 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2019.codfw.wmnet |
[production] |
09:08 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2018.codfw.wmnet |
[production] |
09:01 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2018.codfw.wmnet |
[production] |
09:01 |
<dcaro> |
Will start the ceph upgrade in 15 min, no downtime nor performance impact is expected (T272303) |
[admin] |
00:43 |
<tgr@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:656284|Update /analytics/legacy/homepagemodule/ schema version to 1.1.0 (T270309)]] (duration: 01m 03s) |
[production] |
00:30 |
<tgr@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:655863|(no-op) GrowthExperiments: Disable link recommendations (T261408)]] (duration: 01m 05s) |
[production] |
00:09 |
<legoktm> |
uploaded docker-report 0.0.4-1~deb9u1 to stretch-wikimedia (T179696) |
[production] |
2021-01-19
§
|
23:32 |
<bstorm> |
truncated 34GB error log file that was full of warnings like "Only variables should be passed by reference in /data/project/geohack/public_html/geohack.php on line 192" T272247 |
[tools.geohack] |
23:30 |
<bstorm> |
truncated 36GB mybot.out file T272247 |
[tools.ping08bot] |
22:57 |
<bstorm> |
truncated 75GB error log /data/project/robokobot/virgule.err T272247 |
[tools] |
22:48 |
<bstorm> |
truncated 100GB error log /data/project/magnus-toolserver/error.log T272247 |
[tools] |
22:43 |
<bstorm> |
truncated 107GB log '/data/project/meetbot/logs/messages.log' T272247 |
[tools] |
22:34 |
<bstorm> |
truncating 194 GB error log '/data/project/mix-n-match/mnm-microsync.err' T272247 |
[tools] |
21:52 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2314.codfw.wmnet |
[production] |
21:51 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: Revert group0 wikis to 1.36.0-wmf.26 |
[production] |
21:51 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2313.codfw.wmnet |
[production] |
21:51 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2312.codfw.wmnet |
[production] |