2020-05-13
§
|
05:33 |
<ryankemper> |
wdqs2004 was depooled ~3 hours ago and was re-pooled ~10 mins ago after verifying the wdqs service was healthy |
[production] |
05:32 |
<ryankemper> |
wdqs1003 was depooled ~6 hours ago and was re-pooled ~10 mins ago after verifying the wdqs service was healthy |
[production] |
05:27 |
<_joe_> |
restarting php-fpm on mw1374, children dying with SIGILL |
[production] |
05:11 |
<root@cumin1001> |
END (PASS) - Cookbook sre.hosts.ipmi-password-reset (exit_code=0) |
[production] |
05:11 |
<root@cumin1001> |
Updating IPMI password on 1 hosts - root@cumin1001 |
[production] |
05:10 |
<root@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
05:10 |
<root@cumin1001> |
END (FAIL) - Cookbook sre.hosts.ipmi-password-reset (exit_code=99) |
[production] |
05:10 |
<root@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
04:52 |
<kart_> |
Updated cxserver to 2020-05-11-082207-production (T250004) |
[production] |
04:47 |
<kartik@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
04:44 |
<kartik@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
04:42 |
<kartik@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
02:27 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
02:10 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
00:43 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
00:33 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
00:04 |
<Jayprakash12345> |
Sort contests by their descending order of creation date (T252608) |
[tools.indic-wscontest] |
2020-05-12
§
|
23:09 |
<ryankemper@cumin2001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
23:06 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
22:23 |
<marxarelli> |
reloading zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/594823 |
[releng] |
22:14 |
<marxarelli> |
created 4 jenkins jobs with jjb for https://gerrit.wikimedia.org/r/c/integration/config/+/594823 |
[releng] |
20:33 |
<andrewbogott> |
moving cloudvirt1023 to the 'standard' pool and out of the 'spare' pool |
[admin] |
20:15 |
<hashar@deploy1001> |
Synchronized php-1.35.0-wmf.32/includes/revisionlist/RevisionItemBase.php: Fix RevisionItemBase::getId to actually return an int, as intended - T252076 (duration: 01m 06s) |
[production] |
20:10 |
<cdanis> |
reverting to standard php7.2-fpm on deployment-mediawiki-07 |
[releng] |
19:55 |
<dpifke@deploy1001> |
Finished deploy [performance/navtiming@48110b9]: Fixes swapped dc/host labels - T238086 (duration: 00m 05s) |
[production] |
19:55 |
<dpifke@deploy1001> |
Started deploy [performance/navtiming@48110b9]: Fixes swapped dc/host labels - T238086 |
[production] |
19:34 |
<cdanis> |
installing patched version of php7.2-fpm on deployment-mediawiki-07 |
[releng] |
19:29 |
<Krinkle> |
cs access #cvn-wp-en-newpages add Galendalia voiced |
[cvn] |
19:29 |
<Krinkle> |
cs access #cvn-wp-en add Galendalia voiced |
[cvn] |
19:10 |
<jeh> |
disable neutron-openvswitch-agent service on cloudvirt2001-dev.codfw T248881 |
[admin] |
19:09 |
<jeh> |
Shutdown the unused eno2 network interface on cloudvirt2001-dev.codfw to clear up monitoring errors T248425 |
[admin] |
19:05 |
<hashar@deploy1001> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.35.0-wmf.32 |
[production] |
18:44 |
<brennen> |
cloning all (public) wmf repos from gerrit replica to a local system for number crunching |
[releng] |
18:41 |
<legoktm> |
started codereview-archiver script in screen on mwmaint1002 |
[production] |
18:35 |
<bstorm_> |
upgraded to using typha and rolled back to not doing so -- no affect on existing network T250863 |
[toolsbeta] |
18:23 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
18:23 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:20 |
<andrewbogott> |
moving cloudvirt1024 out of the 'maintenance' aggregate and into 'spare' |
[admin] |
18:17 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:17 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
18:14 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:14 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
17:49 |
<bblack> |
'gdnsdctl replace' on all authdns to load new maxmind data |
[production] |
17:44 |
<bstorm_> |
set the calico version to v3.14.0 because the new liveness probe isn't compatible with the old version. T250863 |
[toolsbeta] |
17:43 |
<bblack> |
updating maxmind database on puppetmasters (usually automated weekly; we're mid-cycle) |
[production] |
17:35 |
<bstorm_> |
deployed an updated bit of yaml for calico without upgrading the version first T250863 |
[toolsbeta] |
17:10 |
<James_F> |
Running AbuseFilter updateVarDumps on testwikis on mwmaint1002 T246539 |
[production] |
16:55 |
<James_F> |
Running AbuseFilter updateVarDumps on closed wikis on mwmaint1002 T246539 |
[production] |
16:55 |
<mstyles@deploy1001> |
Finished deploy [wdqs/wdqs@f617307]: v0.3.31 (duration: 14m 53s) |
[production] |
16:45 |
<andrewbogott> |
restarting neutron-l3-agent on cloudnet1004 so it knows about all three cloudcontrols. Leaving cloudnet1003 since restarting it there will cause network interruptions |
[admin] |