601-650 of 10000 results (17ms)
2021-01-20 §
10:16 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2026.codfw.wmnet [production]
10:07 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2026.codfw.wmnet [production]
10:05 <dcaro> Everything looks ok, created a new vm with a volume in ceph without issues, and on warnings/errors on ceph status, closing (T272303) [admin]
10:05 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2025.codfw.wmnet [production]
09:59 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2025.codfw.wmnet [production]
09:57 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2024.codfw.wmnet [production]
09:55 <dcaro> Eqiad ceph cluster uprgaded, doing sanity checks (T272303) [admin]
09:49 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2024.codfw.wmnet [production]
09:47 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2023.codfw.wmnet [production]
09:46 <dcaro> 75% of the eqiad cluster upgraded... continuing (T272303) [admin]
09:39 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2023.codfw.wmnet [production]
09:39 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2021.codfw.wmnet [production]
09:37 <dcaro> 25% of the eqiad cluster upgraded... continuing (T272303) [admin]
09:32 <moritzm> installing cuminunpriv1001 [production]
09:32 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2021.codfw.wmnet [production]
09:31 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2020.codfw.wmnet [production]
09:24 <dcaro> Mgr daemons upgraded and running, upgrading osd daemons on servers cloudcephosd1*, this make take a bit longer (T272303) [admin]
09:24 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2020.codfw.wmnet [production]
09:22 <dcaro> Mon daemons upgraded and running, upgrading mgr daemons on servers cloudcephmon1* (T272303) [admin]
09:19 <XioNoX> configure Lumen interfaces [production]
09:16 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2019.codfw.wmnet [production]
09:16 <dcaro> Starting eqiad ceph upgrade, upgrading the mon servers cloudcephmon1* (T272303) [admin]
09:09 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2019.codfw.wmnet [production]
09:08 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2018.codfw.wmnet [production]
09:01 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2018.codfw.wmnet [production]
09:01 <dcaro> Will start the ceph upgrade in 15 min, no downtime nor performance impact is expected (T272303) [admin]
00:43 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:656284|Update /analytics/legacy/homepagemodule/ schema version to 1.1.0 (T270309)]] (duration: 01m 03s) [production]
00:30 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:655863|(no-op) GrowthExperiments: Disable link recommendations (T261408)]] (duration: 01m 05s) [production]
00:09 <legoktm> uploaded docker-report 0.0.4-1~deb9u1 to stretch-wikimedia (T179696) [production]
2021-01-19 §
23:32 <bstorm> truncated 34GB error log file that was full of warnings like "Only variables should be passed by reference in /data/project/geohack/public_html/geohack.php on line 192" T272247 [tools.geohack]
23:30 <bstorm> truncated 36GB mybot.out file T272247 [tools.ping08bot]
22:57 <bstorm> truncated 75GB error log /data/project/robokobot/virgule.err T272247 [tools]
22:48 <bstorm> truncated 100GB error log /data/project/magnus-toolserver/error.log T272247 [tools]
22:43 <bstorm> truncated 107GB log '/data/project/meetbot/logs/messages.log' T272247 [tools]
22:34 <bstorm> truncating 194 GB error log '/data/project/mix-n-match/mnm-microsync.err' T272247 [tools]
21:52 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2314.codfw.wmnet [production]
21:51 <brennen@deploy1001> rebuilt and synchronized wikiversions files: Revert group0 wikis to 1.36.0-wmf.26 [production]
21:51 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2313.codfw.wmnet [production]
21:51 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2312.codfw.wmnet [production]
21:50 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2315.codfw.wmnet [production]
21:46 <ottomata> wiping kafka-test cluster data and starting from scratch - T255973 [production]
21:00 <Urbanecm> Start of `foreachwikiindblist group2 extensions/AbuseFilter/maintenance/MigrateAflFilter.php --batch-size=1000` (T269713) [production]
20:09 <brennen@deploy1001> rebuilt and synchronized wikiversions files: group0 wikis to 1.36.0-wmf.27 [production]
20:08 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2315.codfw.wmnet [production]
20:07 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2314.codfw.wmnet [production]
20:07 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2313.codfw.wmnet [production]
20:07 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2312.codfw.wmnet [production]
19:46 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
19:30 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
19:27 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 [production]