2022-03-21
ยง
|
22:26 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.UPGRADE (1 nodes at a time) for ElasticSearch cluster relforge: relforge cluster restart - bking@cumin1001 - T301955 |
[production] |
22:04 |
<reedy@deploy1002> |
Synchronized php-1.39.0-wmf.2/extensions/OATHAuth/: T304350 (duration: 00m 49s) |
[production] |
22:03 |
<reedy@deploy1002> |
Synchronized php-1.39.0-wmf.1/extensions/OATHAuth/: T304350 (duration: 00m 49s) |
[production] |
21:59 |
<ryankemper> |
T301955 Downtimed relforge for 2 days; stuck in yellow status during upgrade b/c replica shards cannot be scheduled to a host of lower elasticsearch version than primary shards. Working on patch for our `rolling-operation` cookbook to disable replication during operation |
[production] |
21:46 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/zotero: apply |
[production] |
21:46 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/zotero: apply |
[production] |
21:46 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/toolhub: apply |
[production] |
21:45 |
<bking@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.UPGRADE (1 nodes at a time) for ElasticSearch cluster relforge: relforge cluster restart - bking@cumin1001 - T301955 |
[production] |
21:45 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/toolhub: apply |
[production] |
21:45 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/shellbox-media: apply |
[production] |
21:44 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/shellbox-media: apply |
[production] |
21:44 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
21:43 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
21:43 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/proton: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/proton: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mobileapps: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mathoid: apply |
[production] |
21:40 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mathoid: apply |
[production] |
21:36 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventstreams-internal: apply |
[production] |
21:33 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventstreams: apply |
[production] |
21:33 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply |
[production] |
21:31 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-main: apply |
[production] |
21:31 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply |
[production] |
21:30 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply |
[production] |
21:30 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics: apply |
[production] |
21:28 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics: apply |
[production] |
21:28 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: apply |
[production] |
21:27 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: apply |
[production] |
21:27 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
21:26 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
21:25 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/blubberoid: apply |
[production] |
21:25 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/blubberoid: apply |
[production] |
21:25 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/apertium: apply |
[production] |
21:24 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/apertium: apply |
[production] |
21:10 |
<pt1979@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye |
[production] |
21:03 |
<pt1979@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudvirt1024.eqiad.wmnet with OS bullseye |
[production] |
20:56 |
<dduvall@deploy1002> |
rebuilt and synchronized wikiversions files: all wikis to 1.39.0-wmf.2 refs T300203 |
[production] |
20:52 |
<pt1979@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye |
[production] |
20:49 |
<dduvall@deploy1002> |
Synchronized php: group1 wikis to 1.39.0-wmf.2 refs T300203 (duration: 00m 51s) |
[production] |
20:49 |
<dduvall@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.39.0-wmf.2 refs T300203 |
[production] |
20:31 |
<urbanecm> |
UTC late backport window completed |
[production] |
20:29 |
<urbanecm@deploy1002> |
Synchronized docroot/noc/db.php: 3bcccdc: Migrate away from $wmfDbconfigFromEtcd (T45956; 2/2) (duration: 00m 50s) |
[production] |
20:28 |
<urbanecm@deploy1002> |
Synchronized wmf-config/etcd.php: 3bcccdc: Migrate away from $wmfDbconfigFromEtcd (T45956; 1/2) (duration: 00m 50s) |
[production] |
20:19 |
<urbanecm@deploy1002> |
Synchronized wmf-config/CommonSettings.php: 8347de5: ExtensionDistributor: Add REL1_38 (T304185) (duration: 00m 51s) |
[production] |
19:48 |
<brennen> |
mw1416: sudo -i /usr/local/sbin/restart-php7.2-fpm |
[production] |
19:42 |
<brennen@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.39.0-wmf.2 refs T300203 |
[production] |
19:26 |
<brennen@deploy1002> |
Finished scap: testwikis wikis to 1.39.0-wmf.2 refs T300203 (duration: 64m 33s) |
[production] |
18:54 |
<ebernhardson> |
T303548 start commonswiki reindexing on eqiad codfw and cloudelastic cirrus clusters |
[production] |
18:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T298557)', diff saved to https://phabricator.wikimedia.org/P22906 and previous config saved to /var/cache/conftool/dbconfig/20220321-185042-marostegui.json |
[production] |