2020-11-06
§
|
11:20 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:19 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:09 |
<moritzm> |
uploaded openjdk-8 8u272-b10-1~deb10u1 to buster-wikimedia/component/jdk |
[production] |
10:54 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:49 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:06 |
<dcausse> |
restarted elastic on elastic1063 (T265113) |
[production] |
09:57 |
<moritzm> |
installing spice security updates |
[production] |
09:32 |
<moritzm> |
installing libsndfile security updates |
[production] |
09:15 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:13 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:12 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
09:12 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:14 |
<moritzm> |
installing openldap security updates on stretch/buster (client-side tools/libs only, slapd updates already deployed) |
[production] |
04:38 |
<ryankemper> |
[Deploy finished] WDQS deploy is complete; the service is healthy per https://grafana.wikimedia.org/d/000000489/wikidata-query-service?orgId=1&from=1604633917530&to=1604637475930 |
[production] |
04:36 |
<ryankemper> |
Finished restarting wdqs categories one host at a time across all wdqs production instances |
[production] |
04:02 |
<ryankemper> |
Restarting wdqs categories one host at a time across all wdqs production instances: `sudo -E cumin -b 1 'A:wdqs-all and not A:wdqs-test' 'depool && sleep 60 && systemctl restart wdqs-categories && sleep 30 && pool'` (in progress) |
[production] |
04:01 |
<ryankemper> |
Restarted wdqs categories across test hosts: `sudo -E cumin 'A:wdqs-test' 'systemctl restart wdqs-categories'` |
[production] |
04:01 |
<ryankemper> |
Restarted wdqs updater across all hosts: `sudo -E cumin -b 4 'A:wdqs-all' 'systemctl restart wdqs-updater'` |
[production] |
04:00 |
<ryankemper> |
`query.wikidata.org` looks good following deploy, proceeding to post-deploy steps |
[production] |
03:59 |
<ryankemper@deploy1001> |
Finished deploy [wdqs/wdqs@27a5c54]: 0.3.54 (duration: 11m 22s) |
[production] |
03:51 |
<ryankemper> |
Tests passing on canary `wdqs1003` following initial deployment, proceeding with deploy to rest of fleet |
[production] |
03:48 |
<ryankemper@deploy1001> |
Started deploy [wdqs/wdqs@27a5c54]: 0.3.54 |
[production] |
03:48 |
<ryankemper> |
About to begin wdqs deploy, tests passing on canary `wdqs1003` |
[production] |
00:52 |
<brennen@deploy1001> |
Finished scap: Synchronizing to pick up i18n for [[gerrit:639505]]. Will resume moving train to group1 on Monday morning (US) (T263182) (duration: 69m 02s) |
[production] |
2020-11-05
§
|
23:44 |
<brennen@deploy1001> |
Started scap: Synchronizing to pick up i18n for [[gerrit:639505]]. Will resume moving train to group1 on Monday morning (US) (T263182) |
[production] |
23:38 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.16/includes/media/FormatMetadata.php: Backport: [[gerrit:639505|media: Support GPSAltitudeRef exif tag - FormatMetData.php (T267370)]] (duration: 07m 22s) |
[production] |
23:29 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.16/languages/i18n/exif: Backport: [[gerrit:639505|media: Support GPSAltitudeRef exif tag - i18n/exif files (T267370)]] (duration: 01m 08s) |
[production] |
23:09 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.16/vendor: Backport: [[gerrit:639504|Bump wikimedia/parsoid to 0.13.0-a16 (T267146)]] (duration: 01m 14s) |
[production] |
20:54 |
<hnowlan> |
reenabled tilerator in eqiad |
[production] |
20:47 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: Revert group1 wikis to 1.36.0-wmf.14 |
[production] |
20:44 |
<brennen@deploy1001> |
Synchronized php: group1 wikis to 1.36.0-wmf.16 (duration: 01m 39s) |
[production] |
20:42 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.36.0-wmf.16 |
[production] |
20:39 |
<hnowlan> |
finished removenode of maps2002 cassandra |
[production] |
20:22 |
<brennen> |
train: waiting ~15 minutes before rolling forward to group1. |
[production] |
20:19 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.36.0-wmf.16 |
[production] |
20:15 |
<brennen@deploy1001> |
Synchronized php-1.36.0-wmf.16/extensions/CentralAuth/includes/specials/SpecialCentralAuth.php: Backport: [[gerrit:639500|Dont double-format numeric edit count (T267362)]] (duration: 01m 06s) |
[production] |
19:44 |
<Urbanecm> |
Morning B&C window done |
[production] |
19:44 |
<urbanecm@deploy1001> |
Synchronized php-1.36.0-wmf.16/extensions/GrowthExperiments/modules/homepage/: 81cb1c7b141d49d7fc931fdc13ffd1b48b3a25ab: Suggested edits: Export task count from start editing dialog (T266868; T263040) (duration: 01m 07s) |
[production] |
19:16 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: 453b9c64c44a256eafdfafe7a0023484377bbbd2: Fix DiscussionTools wikis config for thwiki/tgwiki (T266303) (duration: 01m 08s) |
[production] |
18:32 |
<razzi> |
shutting down kafka-jumbo1005 to allow dcops to upgrade NIC |
[production] |
17:52 |
<akosiaris> |
restart uwsgi-ores in all ores1* nodes per complaint on IRC that max redis clients have been reached T263910 |
[production] |
17:51 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: Revert group0 wikis to 1.36.0-wmf.14 |
[production] |
17:48 |
<razzi> |
shutting down kafka-jumbo1004 to allow dcops to upgrade NIC |
[production] |
17:46 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.36.0-wmf.16 |
[production] |
17:41 |
<brennen> |
train is currently unblocked; rolling to group0 (T263182) |
[production] |
17:33 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:33 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
17:32 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |