2022-03-22
§
|
06:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1143', diff saved to https://phabricator.wikimedia.org/P22917 and previous config saved to /var/cache/conftool/dbconfig/20220322-061212-marostegui.json |
[production] |
05:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1143 (T300775)', diff saved to https://phabricator.wikimedia.org/P22916 and previous config saved to /var/cache/conftool/dbconfig/20220322-055707-marostegui.json |
[production] |
05:56 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: host reimage |
[production] |
05:53 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1175.eqiad.wmnet with reason: host reimage |
[production] |
05:43 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance |
[production] |
05:43 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance |
[production] |
05:41 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.reimage for host db1175.eqiad.wmnet with OS bullseye |
[production] |
03:47 |
<eileen> |
civicrm revision changed from 457adec4 to b6ceb722 |
[production] |
02:56 |
<eileen> |
civicrm revision changed from 30c55f51 to 457adec4 |
[production] |
02:56 |
<eileen> |
revision changed from 30c55f51 to 457adec4 |
[production] |
02:16 |
<pt1979@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye |
[production] |
02:03 |
<cmjohnson@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1047.eqiad.wmnet with OS bullseye |
[production] |
01:35 |
<cmjohnson@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1047.eqiad.wmnet with OS bullseye |
[production] |
00:35 |
<pt1979@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1024.eqiad.wmnet with OS bullseye |
[production] |
2022-03-21
§
|
23:52 |
<eileen> |
civicrm revision changed from 52c45874 to 30c55f51 |
[production] |
22:29 |
<ryankemper> |
T301955 Lifted downtime on relforge now that cluster upgrade is complete and cluster is back to green status |
[production] |
22:26 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.UPGRADE (1 nodes at a time) for ElasticSearch cluster relforge: relforge cluster restart - bking@cumin1001 - T301955 |
[production] |
22:04 |
<reedy@deploy1002> |
Synchronized php-1.39.0-wmf.2/extensions/OATHAuth/: T304350 (duration: 00m 49s) |
[production] |
22:03 |
<reedy@deploy1002> |
Synchronized php-1.39.0-wmf.1/extensions/OATHAuth/: T304350 (duration: 00m 49s) |
[production] |
21:59 |
<ryankemper> |
T301955 Downtimed relforge for 2 days; stuck in yellow status during upgrade b/c replica shards cannot be scheduled to a host of lower elasticsearch version than primary shards. Working on patch for our `rolling-operation` cookbook to disable replication during operation |
[production] |
21:46 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/zotero: apply |
[production] |
21:46 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/zotero: apply |
[production] |
21:46 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/toolhub: apply |
[production] |
21:45 |
<bking@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.UPGRADE (1 nodes at a time) for ElasticSearch cluster relforge: relforge cluster restart - bking@cumin1001 - T301955 |
[production] |
21:45 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/toolhub: apply |
[production] |
21:45 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/shellbox-media: apply |
[production] |
21:44 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/shellbox-media: apply |
[production] |
21:44 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
21:43 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
21:43 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/proton: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/proton: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mobileapps: apply |
[production] |
21:41 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mathoid: apply |
[production] |
21:40 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mathoid: apply |
[production] |
21:36 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventstreams-internal: apply |
[production] |
21:33 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventstreams: apply |
[production] |
21:33 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply |
[production] |
21:31 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-main: apply |
[production] |
21:31 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply |
[production] |
21:30 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply |
[production] |
21:30 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics: apply |
[production] |
21:28 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics: apply |
[production] |
21:28 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: apply |
[production] |
21:27 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: apply |
[production] |
21:27 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
21:26 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
21:25 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/blubberoid: apply |
[production] |
21:25 |
<rzl@deploy1002> |
helmfile [eqiad] START helmfile.d/services/blubberoid: apply |
[production] |
21:25 |
<rzl@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/apertium: apply |
[production] |