2020-11-06
§
|
16:20 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes:weight=10; selector: dc=codfw,cluster=maps,service=kartotherian,name=maps2005.codfw.wmnet |
[production] |
14:46 |
<moritzm> |
installing wireshark security updates |
[production] |
14:36 |
<hnowlan> |
resyncing database on maps1001 |
[production] |
14:25 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:24 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:05 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:03 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:01 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:01 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:05 |
<hnowlan> |
started cassandra bootstrap of maps2005 |
[production] |
11:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:50 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:49 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:47 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:30 |
<hnowlan> |
joining maps2005 to cassandra cluster |
[production] |
11:24 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:22 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:20 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:19 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:09 |
<moritzm> |
uploaded openjdk-8 8u272-b10-1~deb10u1 to buster-wikimedia/component/jdk |
[production] |
10:54 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:06 |
<dcausse> |
restarted elastic on elastic1063 (T265113) |
[production] |
09:57 |
<moritzm> |
installing spice security updates |
[production] |
09:32 |
<moritzm> |
installing libsndfile security updates |
[production] |
09:15 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:13 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:12 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
09:12 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:14 |
<moritzm> |
installing openldap security updates on stretch/buster (client-side tools/libs only, slapd updates already deployed) |
[production] |
04:38 |
<ryankemper> |
[Deploy finished] WDQS deploy is complete; the service is healthy per https://grafana.wikimedia.org/d/000000489/wikidata-query-service?orgId=1&from=1604633917530&to=1604637475930 |
[production] |
04:36 |
<ryankemper> |
Finished restarting wdqs categories one host at a time across all wdqs production instances |
[production] |
04:02 |
<ryankemper> |
Restarting wdqs categories one host at a time across all wdqs production instances: `sudo -E cumin -b 1 'A:wdqs-all and not A:wdqs-test' 'depool && sleep 60 && systemctl restart wdqs-categories && sleep 30 && pool'` (in progress) |
[production] |
04:01 |
<ryankemper> |
Restarted wdqs categories across test hosts: `sudo -E cumin 'A:wdqs-test' 'systemctl restart wdqs-categories'` |
[production] |
04:01 |
<ryankemper> |
Restarted wdqs updater across all hosts: `sudo -E cumin -b 4 'A:wdqs-all' 'systemctl restart wdqs-updater'` |
[production] |
04:00 |
<ryankemper> |
`query.wikidata.org` looks good following deploy, proceeding to post-deploy steps |
[production] |
03:59 |
<ryankemper@deploy1001> |
Finished deploy [wdqs/wdqs@27a5c54]: 0.3.54 (duration: 11m 22s) |
[production] |
03:51 |
<ryankemper> |
Tests passing on canary `wdqs1003` following initial deployment, proceeding with deploy to rest of fleet |
[production] |
03:48 |
<ryankemper@deploy1001> |
Started deploy [wdqs/wdqs@27a5c54]: 0.3.54 |
[production] |
03:48 |
<ryankemper> |
About to begin wdqs deploy, tests passing on canary `wdqs1003` |
[production] |
00:52 |
<brennen@deploy1001> |
Finished scap: Synchronizing to pick up i18n for [[gerrit:639505]]. Will resume moving train to group1 on Monday morning (US) (T263182) (duration: 69m 02s) |
[production] |