4151-4200 of 10000 results (25ms)
2020-09-29 ยง
12:28 <kormat@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:28 <kormat@cumin1001> dbctl commit (dc=all): 'Temporarily add db2126 to dump/vslow T259831', diff saved to https://phabricator.wikimedia.org/P12835 and previous config saved to /var/cache/conftool/dbconfig/20200929-122811-kormat.json [production]
12:05 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . [production]
11:54 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . [production]
11:28 <vgutierrez> disabling DHE-RSA-AES128-SHA support - T258405 [production]
11:18 <marostegui@cumin1001> dbctl commit (dc=all): 'es2026 (re)pooling @ 100%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12834 and previous config saved to /var/cache/conftool/dbconfig/20200929-111804-root.json [production]
11:03 <marostegui@cumin1001> dbctl commit (dc=all): 'es2026 (re)pooling @ 75%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12833 and previous config saved to /var/cache/conftool/dbconfig/20200929-110300-root.json [production]
10:47 <marostegui@cumin1001> dbctl commit (dc=all): 'es2026 (re)pooling @ 50%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12832 and previous config saved to /var/cache/conftool/dbconfig/20200929-104757-root.json [production]
10:42 <XioNoX> re-enable TFTP ALGs on all mr [production]
10:42 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:40 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
10:40 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
10:40 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:40 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:40 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:39 <moritzm> installing libdbi-perl security updates for stretch/buster [production]
10:32 <marostegui@cumin1001> dbctl commit (dc=all): 'es2026 (re)pooling @ 25%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12831 and previous config saved to /var/cache/conftool/dbconfig/20200929-103253-root.json [production]
10:16 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . [production]
10:07 <kormat@cumin1001> dbctl commit (dc=all): 'Promote db1104 on s8 eqiad master T239238', diff saved to https://phabricator.wikimedia.org/P12830 and previous config saved to /var/cache/conftool/dbconfig/20200929-100723-kormat.json [production]
10:05 <kormat> Starting s8 eqiad failover from db1109 to db1104 - T239238 [production]
10:01 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:59 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
09:59 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
09:59 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:59 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:59 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:51 <kormat@cumin1001> dbctl commit (dc=all): 'Set db1104 with weight 0 T239238', diff saved to https://phabricator.wikimedia.org/P12829 and previous config saved to /var/cache/conftool/dbconfig/20200929-095135-kormat.json [production]
09:51 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:51 <kormat@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:47 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . [production]
09:17 <marostegui> Depool labsdb1010 from web role [production]
09:08 <jbond42> update rails on puppetmasters [production]
08:21 <jayme> switching esams pybal back to conf1006 - T196487 [production]
08:01 <ema> cp3050: varnish upgrade to 6.0.6-1wm1 T263557 [production]
07:55 <gehel> badblocks check on wdqs1009 - T263125 [production]
07:46 <marostegui> Stop MySQL on es2019 before decommissioning T264063 [production]
07:46 <marostegui@cumin1001> dbctl commit (dc=all): 'Remove es2019 from dbctl T264063', diff saved to https://phabricator.wikimedia.org/P12825 and previous config saved to /var/cache/conftool/dbconfig/20200929-074602-marostegui.json [production]
06:05 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es2019 T264063', diff saved to https://phabricator.wikimedia.org/P12824 and previous config saved to /var/cache/conftool/dbconfig/20200929-060538-marostegui.json [production]
06:02 <marostegui@cumin1001> dbctl commit (dc=all): 'Promote es2034 as es3 master in codfw T261717', diff saved to https://phabricator.wikimedia.org/P12823 and previous config saved to /var/cache/conftool/dbconfig/20200929-060253-marostegui.json [production]
05:13 <marostegui> Stop mysql and reboot es2026 - T263837 [production]
05:12 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es2026 T263837', diff saved to https://phabricator.wikimedia.org/P12822 and previous config saved to /var/cache/conftool/dbconfig/20200929-051236-marostegui.json [production]
05:10 <marostegui> Remove es2013 from tendril and zarcillo T263740 [production]
05:06 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
04:59 <marostegui@cumin1001> START - Cookbook sre.hosts.decommission [production]
03:15 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
03:13 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
03:12 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
03:12 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
03:11 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
03:09 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]