|
2020-11-06
§
|
| 16:20 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes:weight=10; selector: dc=codfw,cluster=maps,service=kartotherian,name=maps2005.codfw.wmnet |
[production] |
| 14:46 |
<moritzm> |
installing wireshark security updates |
[production] |
| 14:36 |
<hnowlan> |
resyncing database on maps1001 |
[production] |
| 14:25 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 14:24 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 14:05 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 14:03 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 14:01 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 14:01 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 13:05 |
<hnowlan> |
started cassandra bootstrap of maps2005 |
[production] |
| 11:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 11:50 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 11:49 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 11:47 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 11:30 |
<hnowlan> |
joining maps2005 to cassandra cluster |
[production] |
| 11:24 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 11:22 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 11:20 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 11:19 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 11:09 |
<moritzm> |
uploaded openjdk-8 8u272-b10-1~deb10u1 to buster-wikimedia/component/jdk |
[production] |
| 10:54 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
| 10:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 10:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 10:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 10:06 |
<dcausse> |
restarted elastic on elastic1063 (T265113) |
[production] |
| 09:57 |
<moritzm> |
installing spice security updates |
[production] |
| 09:32 |
<moritzm> |
installing libsndfile security updates |
[production] |
| 09:15 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
| 09:13 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 09:12 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
| 09:12 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
| 08:14 |
<moritzm> |
installing openldap security updates on stretch/buster (client-side tools/libs only, slapd updates already deployed) |
[production] |
| 04:38 |
<ryankemper> |
[Deploy finished] WDQS deploy is complete; the service is healthy per https://grafana.wikimedia.org/d/000000489/wikidata-query-service?orgId=1&from=1604633917530&to=1604637475930 |
[production] |
| 04:36 |
<ryankemper> |
Finished restarting wdqs categories one host at a time across all wdqs production instances |
[production] |
| 04:02 |
<ryankemper> |
Restarting wdqs categories one host at a time across all wdqs production instances: `sudo -E cumin -b 1 'A:wdqs-all and not A:wdqs-test' 'depool && sleep 60 && systemctl restart wdqs-categories && sleep 30 && pool'` (in progress) |
[production] |
| 04:01 |
<ryankemper> |
Restarted wdqs categories across test hosts: `sudo -E cumin 'A:wdqs-test' 'systemctl restart wdqs-categories'` |
[production] |
| 04:01 |
<ryankemper> |
Restarted wdqs updater across all hosts: `sudo -E cumin -b 4 'A:wdqs-all' 'systemctl restart wdqs-updater'` |
[production] |
| 04:00 |
<ryankemper> |
`query.wikidata.org` looks good following deploy, proceeding to post-deploy steps |
[production] |
| 03:59 |
<ryankemper@deploy1001> |
Finished deploy [wdqs/wdqs@27a5c54]: 0.3.54 (duration: 11m 22s) |
[production] |
| 03:51 |
<ryankemper> |
Tests passing on canary `wdqs1003` following initial deployment, proceeding with deploy to rest of fleet |
[production] |
| 03:48 |
<ryankemper@deploy1001> |
Started deploy [wdqs/wdqs@27a5c54]: 0.3.54 |
[production] |
| 03:48 |
<ryankemper> |
About to begin wdqs deploy, tests passing on canary `wdqs1003` |
[production] |
| 00:52 |
<brennen@deploy1001> |
Finished scap: Synchronizing to pick up i18n for [[gerrit:639505]]. Will resume moving train to group1 on Monday morning (US) (T263182) (duration: 69m 02s) |
[production] |