2020-05-13
§
|
07:29 |
<godog> |
roll-restart logstash in codfw/eqiad for configuration change |
[production] |
07:14 |
<elukey> |
upload spark2_2.4.4-bin-hadoop2.6-2 for buster/stretch on apt1001 |
[production] |
05:33 |
<ryankemper> |
wdqs2004 was depooled ~3 hours ago and was re-pooled ~10 mins ago after verifying the wdqs service was healthy |
[production] |
05:32 |
<ryankemper> |
wdqs1003 was depooled ~6 hours ago and was re-pooled ~10 mins ago after verifying the wdqs service was healthy |
[production] |
05:27 |
<_joe_> |
restarting php-fpm on mw1374, children dying with SIGILL |
[production] |
05:11 |
<root@cumin1001> |
END (PASS) - Cookbook sre.hosts.ipmi-password-reset (exit_code=0) |
[production] |
05:11 |
<root@cumin1001> |
Updating IPMI password on 1 hosts - root@cumin1001 |
[production] |
05:10 |
<root@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
05:10 |
<root@cumin1001> |
END (FAIL) - Cookbook sre.hosts.ipmi-password-reset (exit_code=99) |
[production] |
05:10 |
<root@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
04:52 |
<kart_> |
Updated cxserver to 2020-05-11-082207-production (T250004) |
[production] |
04:47 |
<kartik@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
04:44 |
<kartik@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
04:42 |
<kartik@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
02:27 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
02:10 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
00:43 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
00:33 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
2020-05-12
§
|
23:09 |
<ryankemper@cumin2001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
23:06 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
20:15 |
<hashar@deploy1001> |
Synchronized php-1.35.0-wmf.32/includes/revisionlist/RevisionItemBase.php: Fix RevisionItemBase::getId to actually return an int, as intended - T252076 (duration: 01m 06s) |
[production] |
19:55 |
<dpifke@deploy1001> |
Finished deploy [performance/navtiming@48110b9]: Fixes swapped dc/host labels - T238086 (duration: 00m 05s) |
[production] |
19:55 |
<dpifke@deploy1001> |
Started deploy [performance/navtiming@48110b9]: Fixes swapped dc/host labels - T238086 |
[production] |
19:05 |
<hashar@deploy1001> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.35.0-wmf.32 |
[production] |
18:41 |
<legoktm> |
started codereview-archiver script in screen on mwmaint1002 |
[production] |
18:23 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
18:23 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:17 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:17 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
18:14 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:14 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
17:49 |
<bblack> |
'gdnsdctl replace' on all authdns to load new maxmind data |
[production] |
17:43 |
<bblack> |
updating maxmind database on puppetmasters (usually automated weekly; we're mid-cycle) |
[production] |
17:10 |
<James_F> |
Running AbuseFilter updateVarDumps on testwikis on mwmaint1002 T246539 |
[production] |
16:55 |
<James_F> |
Running AbuseFilter updateVarDumps on closed wikis on mwmaint1002 T246539 |
[production] |
16:55 |
<mstyles@deploy1001> |
Finished deploy [wdqs/wdqs@f617307]: v0.3.31 (duration: 14m 53s) |
[production] |
16:40 |
<mstyles@deploy1001> |
Started deploy [wdqs/wdqs@f617307]: v0.3.31 |
[production] |
16:35 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
15:48 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:48 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:48 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:48 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:34 |
<filippo@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=thanos-query |
[production] |
15:15 |
<akosiaris@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
15:15 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
15:14 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
15:13 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:13 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:12 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:12 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |