2022-09-29
ยง
|
21:53 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1169 (T314041)', diff saved to https://phabricator.wikimedia.org/P35189 and previous config saved to /var/cache/conftool/dbconfig/20220929-215333-ladsgroup.json |
[production] |
21:53 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1169.eqiad.wmnet with reason: Maintenance |
[production] |
21:53 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1169.eqiad.wmnet with reason: Maintenance |
[production] |
21:43 |
<sukhe> |
alert1001: restart icinga |
[production] |
21:43 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:42 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
21:42 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:41 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
21:26 |
<robh@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp4045.mgmt.ulsfo.wmnet with reboot policy FORCED |
[production] |
21:21 |
<robh@cumin2002> |
START - Cookbook sre.hosts.provision for host cp4045.mgmt.ulsfo.wmnet with reboot policy FORCED |
[production] |
21:18 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:18 |
<ejegg> |
payments-wiki upgraded from 839d6dde to aeee9676 |
[production] |
21:14 |
<robh@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
21:14 |
<brennen> |
end of utc late backport and config window |
[production] |
21:14 |
<brennen@deploy1002> |
Finished scap: Backport for [[gerrit:836719|cirrus: Don't configure cloud clusters for private wikis]] (duration: 08m 22s) |
[production] |
21:10 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:09 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
21:09 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:08 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
21:06 |
<brennen@deploy1002> |
brennen and ebernhardson: Backport for [[gerrit:836719|cirrus: Don't configure cloud clusters for private wikis]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
21:05 |
<brennen@deploy1002> |
Started scap: Backport for [[gerrit:836719|cirrus: Don't configure cloud clusters for private wikis]] |
[production] |
21:03 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:02 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
21:02 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:01 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
20:59 |
<ryankemper> |
T313431 Repooled `elastic[2073-2074,2080-2081,2083,2086].codfw.wmnet`. Codfw's all on 5 masters now and cluster is back to green. |
[production] |
20:58 |
<brennen@deploy1002> |
Sync cancelled. |
[production] |
20:58 |
<brennen@deploy1002> |
brennen and trainbranchbot: Backport for [[gerrit:836928|Revert "cirrus: Don't configure cloud clusters for private wikis"]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet |
[production] |
20:58 |
<ryankemper> |
T313431 Updated cross-cluster seed conf with new masters; should resolve the settings check alerts |
[production] |
20:58 |
<brennen@deploy1002> |
Started scap: Backport for [[gerrit:836928|Revert "cirrus: Don't configure cloud clusters for private wikis"]] |
[production] |
20:57 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cp4027.ulsfo.wmnet |
[production] |
20:57 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:56 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:55 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
20:55 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:54 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
20:52 |
<brennen@deploy1002> |
scap failed: CalledProcessError Command '/usr/local/bin/mwscript mergeMessageFileList.php --wiki=aawiki --force-version "1.40.0-wmf.3" --list-file="/srv/mediawiki-staging/wmf-config/extension-list" --output="/tmp/tmp.gcoIZ0BTKW"' returned non-zero exit status 255. (duration: 00m 00s) |
[production] |
20:52 |
<brennen@deploy1002> |
Started scap: Backport for [[gerrit:836886|cirrus: Don't configure cloud clusters for private wikis]] |
[production] |
20:49 |
<robh@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
20:49 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:48 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
20:48 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:47 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
20:46 |
<brennen@deploy1002> |
Sync cancelled. |
[production] |
20:45 |
<brennen@deploy1002> |
brennen and trainbranchbot: Backport for [[gerrit:836922|Revert "Add Nepalese Wikipedia tagline"]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
20:45 |
<brennen@deploy1002> |
Started scap: Backport for [[gerrit:836922|Revert "Add Nepalese Wikipedia tagline"]] |
[production] |
20:45 |
<cmjohnson@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-stretch1001.eqiad.wmnet with OS bullseye |
[production] |
20:42 |
<brennen@deploy1002> |
Sync cancelled. |
[production] |
20:41 |
<brennen@deploy1002> |
brennen and jdlrobson: Backport for [[gerrit:836880|Add Nepalese Wikipedia tagline (T318737)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet |
[production] |
20:41 |
<ryankemper> |
T313431 Restarting elasticsearch_7* services on `elastic2080` to pick up new master-eligible status |
[production] |