2021-11-18
§
|
23:47 |
<ryankemper@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-operation restart with plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad plugin upgrade + restart - ryankemper@cumin1001 - T295705 |
[production] |
23:44 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=kubernetes1001.eqiad.wmnet,service=miscweb |
[production] |
23:28 |
<dzahn@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
23:27 |
<dzahn@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
22:52 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) restart with plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_codfw: codfw plugin upgrade + restart - ryankemper@cumin1001 - T295705 |
[production] |
22:48 |
<XioNoX> |
asw-b-codfw> request system power-off member 7 |
[production] |
22:44 |
<ryankemper@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-operation restart with plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_codfw: codfw plugin upgrade + restart - ryankemper@cumin1001 - T295705 |
[production] |
22:28 |
<mutante> |
icinga (alert1001) - manually fix IP of mw1488.mgmt (was 0.0.0.0 is: 10.65.1.26) in /etc/icinga/objects/puppet_hosts.cfg , running puppet |
[production] |
22:06 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts thumbor1003.eqiad.wmnet |
[production] |
21:53 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts thumbor1003.eqiad.wmnet |
[production] |
21:50 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts thumbor1004.eqiad.wmnet |
[production] |
21:36 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts thumbor1004.eqiad.wmnet |
[production] |
21:31 |
<XioNoX> |
asw-b-codfw> request system power-off member 7 |
[production] |
21:30 |
<legoktm@cumin1001> |
conftool action : set/pooled=inactive; selector: name=thumbor1004.eqiad.wmnet |
[production] |
21:30 |
<legoktm@cumin1001> |
conftool action : set/pooled=inactive; selector: name=thumbor1003.eqiad.wmnet |
[production] |
21:01 |
<ejegg> |
updated payments-wiki from abb2bd9d -> d1d6f024 |
[production] |
21:00 |
<mutante> |
[puppetmaster1001:/var/run/confd-template] $ sudo rm .git-ssh*.err |
[production] |
21:00 |
<mutante> |
[puppetmaster2001:/var/run/confd-template] $ sudo rm .git-ssh*.err |
[production] |
20:57 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:53 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:52 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=phab2001-vcs.codfw.wmnet |
[production] |
20:51 |
<dcausse> |
restart blazegraph on wdqs1006 (jvm stuck) |
[production] |
20:51 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) restart with plugin upgrade (3 nodes at a time) for ElasticSearch cluster cloudelastic: cloudelastic plugin upgrade + restart - ryankemper@cumin1001 - T295705 |
[production] |
20:50 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=phab2001-vcs.codfw.wmnet |
[production] |
20:45 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=phab2001-vcs.codfw.wmnet |
[production] |
20:43 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:43 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: all wikis to 1.38.0-wmf.9 refs T293950 |
[production] |
20:39 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:31 |
<jhuneidi@deploy1002> |
Synchronized php: group1 wikis to 1.38.0-wmf.9 refs T293950 (duration: 01m 03s) |
[production] |
20:30 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.38.0-wmf.9 refs T293950 |
[production] |
20:29 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:27 |
<jhuneidi@deploy1002> |
Synchronized php-1.38.0-wmf.9/tests/phpunit/includes/page/PageStoreTest.php: Backport for T295931 (duration: 01m 03s) |
[production] |
20:25 |
<jhuneidi@deploy1002> |
Synchronized php-1.38.0-wmf.9/includes/page/PageStore.php: Backport for T295931 (duration: 01m 04s) |
[production] |
20:25 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:05 |
<ryankemper@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-operation restart with plugin upgrade (3 nodes at a time) for ElasticSearch cluster cloudelastic: cloudelastic plugin upgrade + restart - ryankemper@cumin1001 - T295705 |
[production] |
20:01 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) restart with plugin upgrade (3 nodes at a time) for ElasticSearch cluster cloudelastic: cloudelastic plugin upgrade + restart - ryankemper@cumin1001 - T295705 |
[production] |
19:53 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=thumbor1004.eqiad.wmnet |
[production] |
19:52 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=thumbor1003.eqiad.wmnet |
[production] |
19:52 |
<legoktm@cumin1001> |
conftool action : set/weight=10; selector: name=thumbor1006.eqiad.wmnet |
[production] |
19:34 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
19:31 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
19:24 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=phab2001-vcs.codfw.wmnet |
[production] |
19:21 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: 4b4c0bca9aa6bceac86f40f03ad688b9d4481c58: Enable DiscussionTools automatic topic subscriptions as beta feature on most wikis (T290500) (duration: 01m 04s) |
[production] |
19:20 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
19:16 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
19:13 |
<twentyafterfour> |
upgrading php7.3 packages on phab1001 |
[production] |
19:07 |
<twentyafterfour> |
rebooting phab2001 to apply updated php and kernel packages |
[production] |
19:06 |
<cmjohnson@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-test-coord1002.eqiad.wmnet with OS bullseye |
[production] |