2021-07-28
ยง
|
23:50 |
<thcipriani@deploy1002> |
Synchronized wmf-config: Config: [[gerrit:708158|Disable mobile contributions simplifications on Wikidata and Commons (T283988)]] (duration: 01m 58s) |
[production] |
19:16 |
<twentyafterfour@deploy1002> |
Synchronized php: group1 wikis to 1.37.0-wmf.16 refs T281157 (duration: 01m 06s) |
[production] |
19:15 |
<twentyafterfour@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.37.0-wmf.16 refs T281157 |
[production] |
19:09 |
<twentyafterfour> |
Preparing to deploy 1.37.0-wmf.16 to group1 wikis |
[production] |
18:57 |
<legoktm> |
mwmaint2002$ foreachwikiindblist wikimania refreshLinks.php - to start populating DPL tracking category |
[production] |
18:36 |
<legoktm@deploy1002> |
Finished scap: Add a tracking category to pages using the <DynamicPageList> tag (duration: 27m 16s) |
[production] |
18:14 |
<jbond> |
manually cleared out the puppetdb2002 queue |
[production] |
18:08 |
<legoktm@deploy1002> |
Started scap: Add a tracking category to pages using the <DynamicPageList> tag |
[production] |
16:37 |
<ryankemper> |
[WDQS Deploy] Deploy complete. Successful test query placed on query.wikidata.org, there's no relevant criticals in Icinga, and Grafana looks good |
[production] |
16:00 |
<ryankemper> |
[WDQS Deploy] Restarting `wdqs-categories` across lvs-managed hosts, one node at a time: `sudo -E cumin -b 1 'A:wdqs-all and not A:wdqs-test' 'depool && sleep 45 && systemctl restart wdqs-categories && sleep 45 && pool'` |
[production] |
15:59 |
<ryankemper> |
[WDQS Deploy] Restarted `wdqs-categories` across all test hosts simultaneously: `sudo -E cumin 'A:wdqs-test' 'systemctl restart wdqs-categories'` |
[production] |
15:59 |
<ryankemper> |
[WDQS Deploy] Restarted `wdqs-updater` across all hosts, 4 hosts at a time: `sudo -E cumin -b 4 'A:wdqs-all' 'systemctl restart wdqs-updater'` |
[production] |
15:58 |
<ryankemper> |
T287112 [WDQS] Re-pooled `wdqs2002` |
[production] |
15:57 |
<ryankemper@deploy1002> |
Finished deploy [wdqs/wdqs@26273d8]: 0.3.77 (duration: 08m 55s) |
[production] |
15:53 |
<mutante> |
mw1434,mw1435,mw1436: scap pull, repooled, reimaged, converted from API to appserver for balancing (T279309) |
[production] |
15:53 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw143[4-6].eqiad.wmnet |
[production] |
15:52 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw143[4-6].eqiad.wmnet |
[production] |
15:51 |
<ryankemper> |
[WDQS Deploy] Tests passing following deploy of `0.3.77` on canary `wdqs1003`; proceeding to rest of fleet |
[production] |
15:48 |
<ryankemper@deploy1002> |
Started deploy [wdqs/wdqs@26273d8]: 0.3.77 |
[production] |
15:47 |
<ryankemper> |
[WDQS Deploy] Gearing up for deploy of wdqs `0.3.77`. Pre-deploy tests passing on canary `wdqs1003` |
[production] |
15:47 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
15:08 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
15:05 |
<jmm@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
14:58 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1434.eqiad.wmnet with reason: REIMAGE |
[production] |
14:56 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1434.eqiad.wmnet with reason: REIMAGE |
[production] |
14:39 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
14:33 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mw1434.eqiad.wmnet with reason: known issue |
[production] |
14:33 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on mw1434.eqiad.wmnet with reason: known issue |
[production] |
14:19 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
14:06 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1436.eqiad.wmnet with reason: REIMAGE |
[production] |
14:06 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
14:06 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
14:06 |
<dcausse@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
14:04 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1435.eqiad.wmnet with reason: REIMAGE |
[production] |
14:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1436.eqiad.wmnet with reason: REIMAGE |
[production] |
14:01 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1435.eqiad.wmnet with reason: REIMAGE |
[production] |
13:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw143[4-6].eqiad.wmnet |
[production] |
13:29 |
<moritzm> |
installing python2.7 security updates on stretch |
[production] |
13:08 |
<moritzm> |
installing python3.5 security updates on stretch |
[production] |
12:27 |
<dcausse@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
11:26 |
<moritzm> |
installing nginx security updates on thumbor* |
[production] |
11:18 |
<moritzm> |
installing nginx security updates on sodium (mirrors.wikimedia.org) |
[production] |
11:03 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 8:00:00 on planet1002.eqiad.wmnet with reason: known issue |
[production] |
11:03 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 5 days, 8:00:00 on planet1002.eqiad.wmnet with reason: known issue |
[production] |
10:11 |
<moritzm> |
installing remaining nginx security updates on stretch |
[production] |
10:09 |
<godog> |
temp fix prometheus-icinga-am on alert1001 |
[production] |
09:40 |
<dcausse@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
09:40 |
<urbanecm> |
Start server-side upload for 1 video file (T287482) |
[production] |
09:29 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
09:29 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |