4151-4200 of 10000 results (32ms)
2021-01-20 §
11:28 <marostegui@cumin1001> dbctl commit (dc=all): 'db1079 (re)pooling @ 100%: After moving wikireplicas to another host', diff saved to https://phabricator.wikimedia.org/P13847 and previous config saved to /var/cache/conftool/dbconfig/20210120-112808-root.json [production]
11:13 <marostegui@cumin1001> dbctl commit (dc=all): 'db1079 (re)pooling @ 75%: After moving wikireplicas to another host', diff saved to https://phabricator.wikimedia.org/P13846 and previous config saved to /var/cache/conftool/dbconfig/20210120-111305-root.json [production]
10:58 <marostegui@cumin1001> dbctl commit (dc=all): 'db1079 (re)pooling @ 50%: After moving wikireplicas to another host', diff saved to https://phabricator.wikimedia.org/P13845 and previous config saved to /var/cache/conftool/dbconfig/20210120-105801-root.json [production]
10:53 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2030.codfw.wmnet [production]
10:51 <XioNoX> Discard the non-whitelisted 172.16.0.0/12 traffic - T209082 [production]
10:49 <arturo> merging core router firewall change https://gerrit.wikimedia.org/r/c/operations/homer/public/+/657302 (T209082) [admin]
10:48 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2030.codfw.wmnet [production]
10:46 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2029.codfw.wmnet [production]
10:42 <marostegui@cumin1001> dbctl commit (dc=all): 'db1079 (re)pooling @ 25%: After moving wikireplicas to another host', diff saved to https://phabricator.wikimedia.org/P13844 and previous config saved to /var/cache/conftool/dbconfig/20210120-104257-root.json [production]
10:37 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2029.codfw.wmnet [production]
10:35 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2028.codfw.wmnet [production]
10:34 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1079 to stop replication T272008', diff saved to https://phabricator.wikimedia.org/P13842 and previous config saved to /var/cache/conftool/dbconfig/20210120-103449-marostegui.json [production]
10:26 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2028.codfw.wmnet [production]
10:26 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2027.codfw.wmnet [production]
10:17 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2027.codfw.wmnet [production]
10:16 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2026.codfw.wmnet [production]
10:07 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2026.codfw.wmnet [production]
10:05 <dcaro> Everything looks ok, created a new vm with a volume in ceph without issues, and on warnings/errors on ceph status, closing (T272303) [admin]
10:05 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2025.codfw.wmnet [production]
09:59 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2025.codfw.wmnet [production]
09:57 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2024.codfw.wmnet [production]
09:55 <dcaro> Eqiad ceph cluster uprgaded, doing sanity checks (T272303) [admin]
09:49 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2024.codfw.wmnet [production]
09:47 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2023.codfw.wmnet [production]
09:46 <dcaro> 75% of the eqiad cluster upgraded... continuing (T272303) [admin]
09:39 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2023.codfw.wmnet [production]
09:39 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2021.codfw.wmnet [production]
09:37 <dcaro> 25% of the eqiad cluster upgraded... continuing (T272303) [admin]
09:32 <moritzm> installing cuminunpriv1001 [production]
09:32 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2021.codfw.wmnet [production]
09:31 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2020.codfw.wmnet [production]
09:24 <dcaro> Mgr daemons upgraded and running, upgrading osd daemons on servers cloudcephosd1*, this make take a bit longer (T272303) [admin]
09:24 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2020.codfw.wmnet [production]
09:22 <dcaro> Mon daemons upgraded and running, upgrading mgr daemons on servers cloudcephmon1* (T272303) [admin]
09:19 <XioNoX> configure Lumen interfaces [production]
09:16 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2019.codfw.wmnet [production]
09:16 <dcaro> Starting eqiad ceph upgrade, upgrading the mon servers cloudcephmon1* (T272303) [admin]
09:09 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2019.codfw.wmnet [production]
09:08 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2018.codfw.wmnet [production]
09:01 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2018.codfw.wmnet [production]
09:01 <dcaro> Will start the ceph upgrade in 15 min, no downtime nor performance impact is expected (T272303) [admin]
00:43 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:656284|Update /analytics/legacy/homepagemodule/ schema version to 1.1.0 (T270309)]] (duration: 01m 03s) [production]
00:30 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:655863|(no-op) GrowthExperiments: Disable link recommendations (T261408)]] (duration: 01m 05s) [production]
00:09 <legoktm> uploaded docker-report 0.0.4-1~deb9u1 to stretch-wikimedia (T179696) [production]
2021-01-19 §
23:32 <bstorm> truncated 34GB error log file that was full of warnings like "Only variables should be passed by reference in /data/project/geohack/public_html/geohack.php on line 192" T272247 [tools.geohack]
23:30 <bstorm> truncated 36GB mybot.out file T272247 [tools.ping08bot]
22:57 <bstorm> truncated 75GB error log /data/project/robokobot/virgule.err T272247 [tools]
22:48 <bstorm> truncated 100GB error log /data/project/magnus-toolserver/error.log T272247 [tools]
22:43 <bstorm> truncated 107GB log '/data/project/meetbot/logs/messages.log' T272247 [tools]
22:34 <bstorm> truncating 194 GB error log '/data/project/mix-n-match/mnm-microsync.err' T272247 [tools]