2020-07-20
ยง
|
16:27 |
<akosiaris> |
increase codfw mobileapps kubernetes traffic to 25% T218733. Take #2 |
[production] |
16:27 |
<akosiaris@cumin1001> |
conftool action : set/weight=8; selector: dc=codfw,service=mobileapps,name=scb.* |
[production] |
15:59 |
<elukey> |
restart airflow-webserver/scheduler to pick up TLS to mysql settings |
[production] |
15:21 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:21 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:17 |
<hnowlan> |
draining and restarting sessionstore2002 |
[production] |
15:17 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:17 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:16 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
15:13 |
<jynus> |
dropping and recreating nagios@localhost users on all m1 servers |
[production] |
15:12 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
15:09 |
<hnowlan> |
draining and restarting sessionstore2001 |
[production] |
15:09 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:09 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:09 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:09 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:08 |
<moritzm> |
draining restbase2023 for eventual reboot for kernel security update |
[production] |
15:04 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
15:00 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
14:56 |
<moritzm> |
draining restbase2022 for eventual reboot for kernel security update |
[production] |
14:56 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:56 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:54 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
14:52 |
<hnowlan> |
draining and restarting sessionstore1003 |
[production] |
14:52 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:52 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:51 |
<mholloway-shell@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mobileapps' for release 'nontls' . |
[production] |
14:51 |
<mholloway-shell@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
14:49 |
<mholloway-shell@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
14:49 |
<mholloway-shell@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'mobileapps' for release 'nontls' . |
[production] |
14:49 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
14:47 |
<mholloway-shell@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
14:47 |
<moritzm> |
draining restbase2021 for eventual reboot for kernel security update |
[production] |
14:44 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:43 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:37 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
14:36 |
<mholloway-shell@deploy1001> |
Finished deploy [mobileapps/deploy@ff49fdf]: Update mobileapps to 0bf7bafa (duration: 03m 50s) |
[production] |
14:34 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:34 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:34 |
<hnowlan> |
starting drain and restart of sessionstore hosts for new kernel |
[production] |
14:33 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
14:32 |
<mholloway-shell@deploy1001> |
Started deploy [mobileapps/deploy@ff49fdf]: Update mobileapps to 0bf7bafa |
[production] |
14:26 |
<moritzm> |
draining restbase2020 for eventual reboot for kernel security update |
[production] |
14:23 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:23 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:20 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
14:17 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
14:14 |
<moritzm> |
draining restbase2019 for eventual reboot for kernel security update |
[production] |
14:08 |
<ema> |
lvs101[34] (primaries) - restart pybal to apply varnish healthcheck changes https://gerrit.wikimedia.org/r/c/operations/puppet/+/610047 T255015 |
[production] |
14:07 |
<ema> |
lvs1016 (secondary) - restart pybal to apply varnish healthcheck changes https://gerrit.wikimedia.org/r/c/operations/puppet/+/610047 T255015 |
[production] |