2021-04-16
§
|
10:08 |
<jayme> |
updated envoyproxy to 1.15.4-1 on mw1325.eqiad.wmnet,restbase1026.eqiad.wmnet |
[production] |
10:05 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-fe2001.codfw.wmnet with reason: REIMAGE |
[production] |
10:04 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2011.codfw.wmnet |
[production] |
10:03 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-fe2001.codfw.wmnet with reason: REIMAGE |
[production] |
10:00 |
<jayme> |
updated envoyproxy to 1.15.4-1 on mwdebug1001.eqiad.wmnet |
[production] |
09:57 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase2011.codfw.wmnet |
[production] |
09:55 |
<jayme> |
imported envoyproxy_1.15.4-1 to stretch-wikimedia - T280317 |
[production] |
09:40 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2010.codfw.wmnet |
[production] |
09:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 100%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15384 and previous config saved to /var/cache/conftool/dbconfig/20210416-093446-root.json |
[production] |
09:33 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase2010.codfw.wmnet |
[production] |
09:27 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2009.codfw.wmnet |
[production] |
09:21 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase2009.codfw.wmnet |
[production] |
09:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 75%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15383 and previous config saved to /var/cache/conftool/dbconfig/20210416-091942-root.json |
[production] |
09:13 |
<jayme> |
imported envoyproxy_1.15.4-1 to buster-wikimedia - T280317 |
[production] |
09:12 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet |
[production] |
09:04 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 50%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15380 and previous config saved to /var/cache/conftool/dbconfig/20210416-090438-root.json |
[production] |
08:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 25%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15374 and previous config saved to /var/cache/conftool/dbconfig/20210416-084935-root.json |
[production] |
08:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 10%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15373 and previous config saved to /var/cache/conftool/dbconfig/20210416-083431-root.json |
[production] |
07:53 |
<elukey> |
run reprepro --delete clearvanished on apt1001 to clear all cloudera packages |
[production] |
07:41 |
<ema> |
cp-upload_ulsfo: rolling varnish-frontend-restart to apply exp policy settings changes starting from empty caches T275809 |
[production] |
07:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1169', diff saved to https://phabricator.wikimedia.org/P15372 and previous config saved to /var/cache/conftool/dbconfig/20210416-071936-marostegui.json |
[production] |
06:58 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1030.eqiad.wmnet |
[production] |
06:52 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1030.eqiad.wmnet |
[production] |
06:48 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1029.eqiad.wmnet |
[production] |
06:39 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1029.eqiad.wmnet |
[production] |
06:27 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1028.eqiad.wmnet |
[production] |
06:22 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2095.codfw.wmnet with reason: REIMAGE |
[production] |
06:20 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1028.eqiad.wmnet |
[production] |
06:19 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2095.codfw.wmnet with reason: REIMAGE |
[production] |
05:54 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts analytics-tool1001.eqiad.wmnet |
[production] |
05:50 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2094.codfw.wmnet with reason: REIMAGE |
[production] |
05:48 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2094.codfw.wmnet with reason: REIMAGE |
[production] |
05:42 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts analytics-tool1001.eqiad.wmnet |
[production] |
03:31 |
<ryankemper> |
[wdqs] `ryankemper@wdqs1013:~$ sudo systemctl restart wdqs-blazegraph` |
[production] |
03:26 |
<ryankemper> |
T267927 Pooled `wdqs2001` |
[production] |
03:22 |
<ryankemper> |
T267927 Pooled `wdqs1006` and `wdqs2002` |
[production] |
03:09 |
<ryankemper> |
T267927 kicked off next round of `data-transfer`s: `wdqs1004`->`wdqs1007`, `wdqs2001`->`wdqs2003`, `wdqs1003`->`wdqs1008`, `wdqs2008`->`wdqs2004` |
[production] |
03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
03:05 |
<ryankemper> |
T267927 Last round of `data-transfer`s finished successfully, proceeding to next round |
[production] |
03:04 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
03:04 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
03:04 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
00:30 |
<Krinkle> |
Delete old data at doc1001:/srv/doc/cover/PasswordBlacklist (ref T254799) |
[production] |
00:09 |
<jforrester@deploy1002> |
Finished deploy [integration/docroot@63b6fb6]: Sync with CI updates (no-op) (duration: 00m 08s) |
[production] |
00:09 |
<jforrester@deploy1002> |
Started deploy [integration/docroot@63b6fb6]: Sync with CI updates (no-op) |
[production] |