2021-02-26
ยง
|
16:03 |
<razzi> |
rebalance kafka partitions for webrequest_upload partition 4 |
[analytics] |
15:35 |
<dcaro> |
removed toolsbeta-test-k8s-etcd-9 with depool from kubeadmin/etcd (T274497) |
[toolsbeta] |
15:04 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:59 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:58 |
<dcaro> |
[eqiad] rebooting cloudcephosd1015 (last osd \o/) for kernel upgrade (T275753) |
[admin] |
14:57 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:51 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:51 |
<dcaro> |
[eqiad] rebooting cloudcephosd1014 for kernel upgrade (T275753) |
[admin] |
14:49 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:44 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:44 |
<dcaro> |
[eqiad] rebooting cloudcephosd1013 for kernel upgrade (T275753) |
[admin] |
14:43 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:38 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:38 |
<dcaro> |
[eqiad] rebooting cloudcephosd1012 for kernel upgrade (T275753) |
[admin] |
14:37 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:31 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:31 |
<dcaro> |
[eqiad] rebooting cloudcephosd1011 for kernel upgrade (T275753) |
[admin] |
14:30 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:25 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:25 |
<dcaro> |
[eqiad] rebooting cloudcephosd1010 for kernel upgrade (T275753) |
[admin] |
14:22 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
14:17 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
14:17 |
<dcaro> |
[eqiad] rebooting cloudcephosd1009 for kernel upgrade (T275753) |
[admin] |
13:56 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
13:54 |
<dcaro> |
[eqiad] downtimed alert1001 Ceph OSDs down alert until 18:00 GMT+1 as that is not under the host being rebooted (T275753) |
[admin] |
13:51 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
13:51 |
<dcaro> |
[eqiad] rebooting cloudcephosd1008 for kernel upgrade (T275753) |
[admin] |
13:51 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
13:45 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
13:45 |
<dcaro> |
[eqiad] rebooting cloudcephosd1007 for kernel upgrade (T275753) |
[admin] |
13:44 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) |
[production] |
13:38 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.upgrade-and-reboot |
[production] |
13:38 |
<dcaro> |
[eqiad] rebooting cloudcephosd1006 for kernel upgrade (T275753) |
[admin] |
13:05 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1031.eqiad.wmnet |
[production] |
13:00 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host mc1031.eqiad.wmnet |
[production] |
12:59 |
<effie> |
upgrade memcached on mc1031, mc2031 |
[production] |
12:40 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
12:40 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
12:40 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
12:40 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
12:33 |
<elukey> |
reimaged an-worker1096 (GPU node) to Debian buster (preserving datanode dirs) |
[analytics] |
12:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
12:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
12:22 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
12:22 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
12:19 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
12:19 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
12:18 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
12:18 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
12:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Add new vslow,dump host to codfw s4 - T275633', diff saved to https://phabricator.wikimedia.org/P14508 and previous config saved to /var/cache/conftool/dbconfig/20210226-121438-marostegui.json |
[production] |