2020-12-15
§
|
07:52 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove es1019 from dbctl', diff saved to https://phabricator.wikimedia.org/P13549 and previous config saved to /var/cache/conftool/dbconfig/20201215-075220-marostegui.json |
[production] |
07:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es1019 for decommissioning', diff saved to https://phabricator.wikimedia.org/P13548 and previous config saved to /var/cache/conftool/dbconfig/20201215-074924-marostegui.json |
[production] |
07:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2131', diff saved to https://phabricator.wikimedia.org/P13547 and previous config saved to /var/cache/conftool/dbconfig/20201215-072513-marostegui.json |
[production] |
06:23 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
06:16 |
<marostegui@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
05:29 |
<kart_> |
Updated cxserver to 2020-12-12-101743-production (T268309) |
[production] |
05:23 |
<kartik@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
05:17 |
<kartik@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
05:14 |
<kartik@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
01:21 |
<legoktm> |
resubscribed to mediawiki-commits list after Google outage triggered too many bounces |
[tools.gerrit-reviewer-bot] |
00:42 |
<legoktm> |
resubscribed to mediawiki-commits list after Google outage triggered too many bounces |
[tools.forrestbot] |
00:13 |
<ejegg> |
updated payments-wiki from 63ae7413a8 to 3d3055c478 |
[production] |
2020-12-14
§
|
22:39 |
<sbassett> |
Deployed security patch for T120883 (v8) to wmf.21 |
[production] |
22:17 |
<andrewbogott> |
resizing spd-test to a g2 flavor: g2.cores4.ram24576.disk300 |
[recommendation-api] |
22:06 |
<andrewbogott> |
resizing integration-docker-registry-1003 to a g2 flavor: g2.cores4.ram24576.disk300 |
[integration] |
21:05 |
<mholloway-shell@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Add analytics event stream mediawiki.mediasearch_interaction T258183 (duration: 00m 56s) |
[production] |
20:22 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2031.codfw.wmnet with reason: REIMAGE |
[production] |
20:21 |
<wm-bot> |
<lucaswerkmeister> deployed 8c9a2dfc59 (fix current_url) |
[tools.quickcategories] |
20:20 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2031.codfw.wmnet with reason: REIMAGE |
[production] |
20:11 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1031.eqiad.wmnet with reason: REIMAGE |
[production] |
20:09 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1031.eqiad.wmnet with reason: REIMAGE |
[production] |
20:07 |
<wm-bot> |
<lucaswerkmeister> deployed 9ba55b3ad3 (fix current_url) |
[tools.lexeme-forms] |
19:50 |
<effie> |
disable puppet on mc1031, mc2031 to install buster |
[production] |
19:46 |
<hashar> |
integration: deleting faulty npm cache integration-castor03 /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/mwgate-node10-docker # T270100 |
[releng] |
19:45 |
<mutante> |
mwdebug1003 - removing zero.wikimedia.org include for testing |
[production] |
19:43 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: 3b5974ff7f57d19732cd1e7f7f492b778daf6cfc: zhwikinews: Grant suppressredirect to autoconfirmed (T270023) (duration: 00m 55s) |
[production] |
19:34 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: cf36ad6e89acd71ca0bc985eb5399fecec64fc5f: hrwiki: Add draft namespace (T268740) (duration: 00m 56s) |
[production] |
19:31 |
<ppchelko@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: gerrit:649359 group1: Enable OldRevisionParserCache (duration: 00m 55s) |
[production] |
19:28 |
<ppchelko@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: gerrit:644317 Remove wgParserCacheUseJson setting (duration: 00m 56s) |
[production] |
19:27 |
<bstorm> |
create temporary instance toolsbeta-test-io-unthrottled T267966 |
[toolsbeta] |
19:25 |
<bstorm> |
created temporary instance toolsbeta-io-test-local T267966 |
[toolsbeta] |
19:24 |
<ppchelko@deploy1001> |
Synchronized php-1.36.0-wmf.21/extensions/Popups: Backport gerrit:649408 Revert Remove title attributes at init (duration: 00m 59s) |
[production] |
19:09 |
<razzi> |
restart restart hadoop-yarn-resourcemanager on an-master1002 to promote an-master1001 to active again |
[analytics] |
19:08 |
<razzi> |
restarted hadoop-yarn-resourcemanager on an-master1001 again by mistake |
[analytics] |
19:02 |
<razzi> |
restart hadoop-yarn-resourcemanager on an-master1002 |
[analytics] |
18:54 |
<razzi> |
restart hadoop-yarn-resourcemanager on an-master1001 |
[analytics] |
18:43 |
<razzi> |
applying yarn config change via `sudo cumin "A:hadoop-worker" "systemctl restart hadoop-yarn-nodemanager" -b 10` |
[analytics] |
18:25 |
<ryankemper> |
T269204 Restarting `wdqs-blazegraph` prometheus exporter across all wdqs instances:`sudo cumin -b 12 'P{wdqs*}' 'sudo systemctl restart prometheus-blazegraph-exporter-wdqs-blazegraph.service'` |
[production] |
18:05 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=mw1265.eqiad.wmnet |
[production] |
18:04 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@2fc439e]: Redeploy Netbox 2.8 to netbox-next T266488 p2 (duration: 00m 05s) |
[production] |
18:04 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@2fc439e]: Redeploy Netbox 2.8 to netbox-next T266488 p2 |
[production] |
18:04 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@2fc439e]: Redeploy Netbox 2.8 to netbox-next T266488 p1 (duration: 00m 33s) |
[production] |
18:03 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@2fc439e]: Redeploy Netbox 2.8 to netbox-next T266488 p1 |
[production] |
17:59 |
<hnowlan> |
depooled mw1265 for reimaging |
[production] |
17:41 |
<dcaro> |
The removal freed ~12GB (still 100% usage :S) (T269419) |
[admin] |
17:36 |
<dcaro> |
removing invalid backups that have a valid copy (T269419) |
[admin] |
16:55 |
<jayme@deploy1001> |
helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
16:55 |
<jayme@deploy1001> |
helmfile [staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
15:43 |
<dcaro> |
Merging the tagging for vm backups (T267195) |
[admin] |
14:58 |
<elukey> |
stat1004's krb credential cache moved under /run (shared between notebooks and ssh/bash) - T255262 |
[analytics] |