7901-7950 of 10000 results (55ms)
2021-03-10 ยง
10:16 <arturo> briefly stopping deployment-puppetdb03 to disable VMX CPU flag [deployment-prep]
10:16 <arturo> briefly stopping deployment-puppetdb03 to disable VMX CPU flag [releng]
10:12 <marostegui> Drop testreduce_vd from m5 master - T276787 [production]
10:11 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2034.codfw.wmnet [production]
10:03 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2033.codfw.wmnet [production]
09:58 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2033.codfw.wmnet [production]
09:58 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2032.codfw.wmnet [production]
09:52 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2032.codfw.wmnet [production]
09:49 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2031.codfw.wmnet [production]
09:40 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2031.codfw.wmnet [production]
09:37 <arturo> draining cloudvirt1023 for T275753 [admin]
09:35 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2030.codfw.wmnet [production]
09:30 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2030.codfw.wmnet [production]
09:27 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2029.codfw.wmnet [production]
09:25 <aborrero@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2003-dev.codfw.wmnet with reason: REIMAGE [production]
09:23 <aborrero@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2003-dev.codfw.wmnet with reason: REIMAGE [production]
09:21 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2029.codfw.wmnet [production]
09:18 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2028.codfw.wmnet [production]
09:12 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ms-be2028.codfw.wmnet [production]
09:07 <arturo> [codfw1dev] reimaging cloudvirt2003-dev (T276964) [admin]
08:39 <marostegui> Upgrade mysql and kernel on db2132 [production]
08:25 <marostegui> Upgrade mysql and kernel on db2078 [production]
08:21 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thorium.eqiad.wmnet [production]
08:20 <moritzm> pruning obsolete kernels from ganeti hosts in eqiad/codfw [production]
08:17 <moritzm> powercycling thorium, stuck on reboot [production]
08:16 <marostegui@cumin1001> dbctl commit (dc=all): 'db1085 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P14719 and previous config saved to /var/cache/conftool/dbconfig/20210310-081627-root.json [production]
08:11 <marostegui> Check tables on db1150:3315 - T276742 [production]
08:09 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host thorium.eqiad.wmnet [production]
08:05 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host analytics-tool1001.eqiad.wmnet [production]
08:03 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host analytics-tool1001.eqiad.wmnet [production]
08:01 <marostegui@cumin1001> dbctl commit (dc=all): 'db1085 (re)pooling @ 60%: 10', diff saved to https://phabricator.wikimedia.org/P14718 and previous config saved to /var/cache/conftool/dbconfig/20210310-080123-root.json [production]
07:52 <marostegui> Deploy schema change on s7 codfw (lag will appear) T276150 T276156 [production]
07:46 <marostegui@cumin1001> dbctl commit (dc=all): 'db1085 (re)pooling @ 30%: 10', diff saved to https://phabricator.wikimedia.org/P14717 and previous config saved to /var/cache/conftool/dbconfig/20210310-074618-root.json [production]
07:33 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite1004.eqiad.wmnet [production]
07:29 <filippo@cumin1001> START - Cookbook sre.hosts.reboot-single for host graphite1004.eqiad.wmnet [production]
07:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1085 for schema change', diff saved to https://phabricator.wikimedia.org/P14716 and previous config saved to /var/cache/conftool/dbconfig/20210310-072642-marostegui.json [production]
07:25 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1113:3316', diff saved to https://phabricator.wikimedia.org/P14715 and previous config saved to /var/cache/conftool/dbconfig/20210310-072508-marostegui.json [production]
07:07 <elukey> sudo apt-get remove linux-image-4.9.0-9-amd64 on sodium to free space for /boot [production]
07:06 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db2145', diff saved to https://phabricator.wikimedia.org/P14714 and previous config saved to /var/cache/conftool/dbconfig/20210310-070642-marostegui.json [production]
07:05 <elukey> all hadoop worker nodes on Buster [analytics]
07:03 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1113:3316 for schema change', diff saved to https://phabricator.wikimedia.org/P14713 and previous config saved to /var/cache/conftool/dbconfig/20210310-070312-marostegui.json [production]
07:01 <elukey> remove the oldest kernel on ganeti nodes to free space for /boot [production]
07:00 <marostegui> Depool clouddb1016 [production]
06:45 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1111.eqiad.wmnet with reason: REIMAGE [production]
06:43 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1111.eqiad.wmnet with reason: REIMAGE [production]
06:28 <elukey> force the re-run of refine_eventlogging_legacy - failed due to worker reimage in progress [analytics]
06:17 <elukey> reimage an-worker1111 to buster [production]
06:17 <elukey> reimage an-worker1111 to buster [analytics]
05:27 <ryankemper> T266470 Rollout of updated certificate complete. We're now ready to implement envoy for `wdqs-test` which will allow `wdqs1009` to be reachable via port 443 and thereby allow us to go live with `query-preview.wikidata.org` when the time comes [production]
05:26 <ryankemper> T266470 `ryankemper@cumin1001:~$ sudo -E cumin 'A:wdqs-all' 'sudo enable-puppet "revoking old cert and generating new one with new alt_names - T266470 - root"'` and `ryankemper@cumin1001:~$ sudo -E cumin 'A:wdqs-all' 'sudo run-puppet-agent'` [production]