|
2021-04-16
§
|
| 10:04 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2011.codfw.wmnet |
[production] |
| 10:03 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-fe2001.codfw.wmnet with reason: REIMAGE |
[production] |
| 10:00 |
<jayme> |
updated envoyproxy to 1.15.4-1 on mwdebug1001.eqiad.wmnet |
[production] |
| 09:57 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase2011.codfw.wmnet |
[production] |
| 09:55 |
<jayme> |
imported envoyproxy_1.15.4-1 to stretch-wikimedia - T280317 |
[production] |
| 09:40 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2010.codfw.wmnet |
[production] |
| 09:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 100%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15384 and previous config saved to /var/cache/conftool/dbconfig/20210416-093446-root.json |
[production] |
| 09:33 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase2010.codfw.wmnet |
[production] |
| 09:27 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2009.codfw.wmnet |
[production] |
| 09:21 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase2009.codfw.wmnet |
[production] |
| 09:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 75%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15383 and previous config saved to /var/cache/conftool/dbconfig/20210416-091942-root.json |
[production] |
| 09:13 |
<jayme> |
imported envoyproxy_1.15.4-1 to buster-wikimedia - T280317 |
[production] |
| 09:12 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet |
[production] |
| 09:04 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 50%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15380 and previous config saved to /var/cache/conftool/dbconfig/20210416-090438-root.json |
[production] |
| 08:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 25%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15374 and previous config saved to /var/cache/conftool/dbconfig/20210416-084935-root.json |
[production] |
| 08:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1169 (re)pooling @ 10%: Repool db1169', diff saved to https://phabricator.wikimedia.org/P15373 and previous config saved to /var/cache/conftool/dbconfig/20210416-083431-root.json |
[production] |
| 07:53 |
<elukey> |
run reprepro --delete clearvanished on apt1001 to clear all cloudera packages |
[production] |
| 07:41 |
<ema> |
cp-upload_ulsfo: rolling varnish-frontend-restart to apply exp policy settings changes starting from empty caches T275809 |
[production] |
| 07:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1169', diff saved to https://phabricator.wikimedia.org/P15372 and previous config saved to /var/cache/conftool/dbconfig/20210416-071936-marostegui.json |
[production] |
| 06:58 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1030.eqiad.wmnet |
[production] |
| 06:52 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1030.eqiad.wmnet |
[production] |
| 06:48 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1029.eqiad.wmnet |
[production] |
| 06:39 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1029.eqiad.wmnet |
[production] |
| 06:27 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1028.eqiad.wmnet |
[production] |
| 06:22 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2095.codfw.wmnet with reason: REIMAGE |
[production] |
| 06:20 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1028.eqiad.wmnet |
[production] |
| 06:19 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2095.codfw.wmnet with reason: REIMAGE |
[production] |
| 05:54 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts analytics-tool1001.eqiad.wmnet |
[production] |
| 05:50 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2094.codfw.wmnet with reason: REIMAGE |
[production] |
| 05:48 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2094.codfw.wmnet with reason: REIMAGE |
[production] |
| 05:42 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts analytics-tool1001.eqiad.wmnet |
[production] |
| 03:31 |
<ryankemper> |
[wdqs] `ryankemper@wdqs1013:~$ sudo systemctl restart wdqs-blazegraph` |
[production] |
| 03:26 |
<ryankemper> |
T267927 Pooled `wdqs2001` |
[production] |
| 03:22 |
<ryankemper> |
T267927 Pooled `wdqs1006` and `wdqs2002` |
[production] |
| 03:09 |
<ryankemper> |
T267927 kicked off next round of `data-transfer`s: `wdqs1004`->`wdqs1007`, `wdqs2001`->`wdqs2003`, `wdqs1003`->`wdqs1008`, `wdqs2008`->`wdqs2004` |
[production] |
| 03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
| 03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
| 03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
| 03:09 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
| 03:05 |
<ryankemper> |
T267927 Last round of `data-transfer`s finished successfully, proceeding to next round |
[production] |
| 03:04 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
| 03:04 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
| 03:04 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
| 00:30 |
<Krinkle> |
Delete old data at doc1001:/srv/doc/cover/PasswordBlacklist (ref T254799) |
[production] |
| 00:09 |
<jforrester@deploy1002> |
Finished deploy [integration/docroot@63b6fb6]: Sync with CI updates (no-op) (duration: 00m 08s) |
[production] |
| 00:09 |
<jforrester@deploy1002> |
Started deploy [integration/docroot@63b6fb6]: Sync with CI updates (no-op) |
[production] |