2020-09-29
ยง
|
12:53 |
<moritzm> |
installing QT security updates |
[production] |
12:29 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'db2108 depooling: schema change T259831', diff saved to https://phabricator.wikimedia.org/P12836 and previous config saved to /var/cache/conftool/dbconfig/20200929-122914-kormat.json |
[production] |
12:28 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:28 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:28 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Temporarily add db2126 to dump/vslow T259831', diff saved to https://phabricator.wikimedia.org/P12835 and previous config saved to /var/cache/conftool/dbconfig/20200929-122811-kormat.json |
[production] |
12:05 |
<jayme@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . |
[production] |
11:54 |
<jayme@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . |
[production] |
11:28 |
<vgutierrez> |
disabling DHE-RSA-AES128-SHA support - T258405 |
[production] |
11:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2026 (re)pooling @ 100%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12834 and previous config saved to /var/cache/conftool/dbconfig/20200929-111804-root.json |
[production] |
11:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2026 (re)pooling @ 75%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12833 and previous config saved to /var/cache/conftool/dbconfig/20200929-110300-root.json |
[production] |
10:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2026 (re)pooling @ 50%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12832 and previous config saved to /var/cache/conftool/dbconfig/20200929-104757-root.json |
[production] |
10:42 |
<XioNoX> |
re-enable TFTP ALGs on all mr |
[production] |
10:42 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:40 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:40 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:40 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:40 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:40 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:39 |
<moritzm> |
installing libdbi-perl security updates for stretch/buster |
[production] |
10:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2026 (re)pooling @ 25%: After reboot to troubleshoot a degraded RAID', diff saved to https://phabricator.wikimedia.org/P12831 and previous config saved to /var/cache/conftool/dbconfig/20200929-103253-root.json |
[production] |
10:16 |
<jayme@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . |
[production] |
10:07 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Promote db1104 on s8 eqiad master T239238', diff saved to https://phabricator.wikimedia.org/P12830 and previous config saved to /var/cache/conftool/dbconfig/20200929-100723-kormat.json |
[production] |
10:05 |
<kormat> |
Starting s8 eqiad failover from db1109 to db1104 - T239238 |
[production] |
10:01 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:59 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
09:59 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
09:59 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:59 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:59 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:51 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Set db1104 with weight 0 T239238', diff saved to https://phabricator.wikimedia.org/P12829 and previous config saved to /var/cache/conftool/dbconfig/20200929-095135-kormat.json |
[production] |
09:51 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:51 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:47 |
<jayme@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . |
[production] |
09:17 |
<marostegui> |
Depool labsdb1010 from web role |
[production] |
09:08 |
<jbond42> |
update rails on puppetmasters |
[production] |
08:21 |
<jayme> |
switching esams pybal back to conf1006 - T196487 |
[production] |
08:01 |
<ema> |
cp3050: varnish upgrade to 6.0.6-1wm1 T263557 |
[production] |
07:55 |
<gehel> |
badblocks check on wdqs1009 - T263125 |
[production] |
07:46 |
<marostegui> |
Stop MySQL on es2019 before decommissioning T264063 |
[production] |
07:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove es2019 from dbctl T264063', diff saved to https://phabricator.wikimedia.org/P12825 and previous config saved to /var/cache/conftool/dbconfig/20200929-074602-marostegui.json |
[production] |
06:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es2019 T264063', diff saved to https://phabricator.wikimedia.org/P12824 and previous config saved to /var/cache/conftool/dbconfig/20200929-060538-marostegui.json |
[production] |
06:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote es2034 as es3 master in codfw T261717', diff saved to https://phabricator.wikimedia.org/P12823 and previous config saved to /var/cache/conftool/dbconfig/20200929-060253-marostegui.json |
[production] |
05:13 |
<marostegui> |
Stop mysql and reboot es2026 - T263837 |
[production] |
05:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es2026 T263837', diff saved to https://phabricator.wikimedia.org/P12822 and previous config saved to /var/cache/conftool/dbconfig/20200929-051236-marostegui.json |
[production] |
05:10 |
<marostegui> |
Remove es2013 from tendril and zarcillo T263740 |
[production] |
05:06 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
04:59 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
03:15 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
03:13 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
03:12 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |