2020-05-12
§
|
23:09 |
<ryankemper@cumin2001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
23:06 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
22:23 |
<marxarelli> |
reloading zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/594823 |
[releng] |
22:14 |
<marxarelli> |
created 4 jenkins jobs with jjb for https://gerrit.wikimedia.org/r/c/integration/config/+/594823 |
[releng] |
20:33 |
<andrewbogott> |
moving cloudvirt1023 to the 'standard' pool and out of the 'spare' pool |
[admin] |
20:15 |
<hashar@deploy1001> |
Synchronized php-1.35.0-wmf.32/includes/revisionlist/RevisionItemBase.php: Fix RevisionItemBase::getId to actually return an int, as intended - T252076 (duration: 01m 06s) |
[production] |
20:10 |
<cdanis> |
reverting to standard php7.2-fpm on deployment-mediawiki-07 |
[releng] |
19:55 |
<dpifke@deploy1001> |
Finished deploy [performance/navtiming@48110b9]: Fixes swapped dc/host labels - T238086 (duration: 00m 05s) |
[production] |
19:55 |
<dpifke@deploy1001> |
Started deploy [performance/navtiming@48110b9]: Fixes swapped dc/host labels - T238086 |
[production] |
19:34 |
<cdanis> |
installing patched version of php7.2-fpm on deployment-mediawiki-07 |
[releng] |
19:29 |
<Krinkle> |
cs access #cvn-wp-en-newpages add Galendalia voiced |
[cvn] |
19:29 |
<Krinkle> |
cs access #cvn-wp-en add Galendalia voiced |
[cvn] |
19:10 |
<jeh> |
disable neutron-openvswitch-agent service on cloudvirt2001-dev.codfw T248881 |
[admin] |
19:09 |
<jeh> |
Shutdown the unused eno2 network interface on cloudvirt2001-dev.codfw to clear up monitoring errors T248425 |
[admin] |
19:05 |
<hashar@deploy1001> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.35.0-wmf.32 |
[production] |
18:44 |
<brennen> |
cloning all (public) wmf repos from gerrit replica to a local system for number crunching |
[releng] |
18:41 |
<legoktm> |
started codereview-archiver script in screen on mwmaint1002 |
[production] |
18:35 |
<bstorm_> |
upgraded to using typha and rolled back to not doing so -- no affect on existing network T250863 |
[toolsbeta] |
18:23 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
18:23 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:20 |
<andrewbogott> |
moving cloudvirt1024 out of the 'maintenance' aggregate and into 'spare' |
[admin] |
18:17 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:17 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
18:14 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'canary' . |
[production] |
18:14 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventgate-analytics-external' for release 'production' . |
[production] |
17:49 |
<bblack> |
'gdnsdctl replace' on all authdns to load new maxmind data |
[production] |
17:44 |
<bstorm_> |
set the calico version to v3.14.0 because the new liveness probe isn't compatible with the old version. T250863 |
[toolsbeta] |
17:43 |
<bblack> |
updating maxmind database on puppetmasters (usually automated weekly; we're mid-cycle) |
[production] |
17:35 |
<bstorm_> |
deployed an updated bit of yaml for calico without upgrading the version first T250863 |
[toolsbeta] |
17:10 |
<James_F> |
Running AbuseFilter updateVarDumps on testwikis on mwmaint1002 T246539 |
[production] |
16:55 |
<James_F> |
Running AbuseFilter updateVarDumps on closed wikis on mwmaint1002 T246539 |
[production] |
16:55 |
<mstyles@deploy1001> |
Finished deploy [wdqs/wdqs@f617307]: v0.3.31 (duration: 14m 53s) |
[production] |
16:45 |
<andrewbogott> |
restarting neutron-l3-agent on cloudnet1004 so it knows about all three cloudcontrols. Leaving cloudnet1003 since restarting it there will cause network interruptions |
[admin] |
16:40 |
<mstyles@deploy1001> |
Started deploy [wdqs/wdqs@f617307]: v0.3.31 |
[production] |
16:35 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
16:02 |
<James_F> |
Zuul: Manually running fabric against contint1001 to add Privacybatm to the CI allow list T248256 |
[releng] |
15:54 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/595968 T248256 |
[releng] |
15:48 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:48 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:48 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:48 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:34 |
<filippo@cumin1001> |
conftool action : set/pooled=true; selector: dnsdisc=thanos-query |
[production] |