2025-08-26
ยง
|
18:28 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2217.codfw.wmnet with reason: Maintenance |
[production] |
18:28 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214 (T401906)', diff saved to https://phabricator.wikimedia.org/P81795 and previous config saved to /var/cache/conftool/dbconfig/20250826-182842-fceratto.json |
[production] |
18:13 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214', diff saved to https://phabricator.wikimedia.org/P81794 and previous config saved to /var/cache/conftool/dbconfig/20250826-181334-fceratto.json |
[production] |
17:58 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214', diff saved to https://phabricator.wikimedia.org/P81793 and previous config saved to /var/cache/conftool/dbconfig/20250826-175827-fceratto.json |
[production] |
17:44 |
<dzahn@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on people1005.eqiad.wmnet with reason: T402596 |
[production] |
17:43 |
<ammarpad@deploy1003> |
mwscript-k8s job started: extensions/Translate/scripts/moveTranslatableBundle.php --wiki=mediawikiwiki 'API:Main page' 'API:Action API' Ammarpad '--reason=per [[:phab:T402800]]' # T402800 |
[production] |
17:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2214 (T401906)', diff saved to https://phabricator.wikimedia.org/P81791 and previous config saved to /var/cache/conftool/dbconfig/20250826-174319-fceratto.json |
[production] |
17:42 |
<sfaci@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply |
[production] |
17:42 |
<sfaci@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply |
[production] |
17:41 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2214 (T401906)', diff saved to https://phabricator.wikimedia.org/P81790 and previous config saved to /var/cache/conftool/dbconfig/20250826-174106-fceratto.json |
[production] |
17:40 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2214.codfw.wmnet with reason: Maintenance |
[production] |
17:40 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2197.codfw.wmnet with reason: Maintenance |
[production] |
17:40 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2193 (T401906)', diff saved to https://phabricator.wikimedia.org/P81789 and previous config saved to /var/cache/conftool/dbconfig/20250826-174023-fceratto.json |
[production] |
17:25 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2193', diff saved to https://phabricator.wikimedia.org/P81788 and previous config saved to /var/cache/conftool/dbconfig/20250826-172516-fceratto.json |
[production] |
17:19 |
<swfrench@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1171703|image-suggestion: cleanup unused refs to service listener (T368096)]] (duration: 12m 15s) |
[production] |
17:13 |
<swfrench@deploy1003> |
eevans, swfrench: Continuing with sync |
[production] |
17:12 |
<swfrench@deploy1003> |
eevans, swfrench: Backport for [[gerrit:1171703|image-suggestion: cleanup unused refs to service listener (T368096)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
17:10 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2193', diff saved to https://phabricator.wikimedia.org/P81787 and previous config saved to /var/cache/conftool/dbconfig/20250826-171008-fceratto.json |
[production] |
17:06 |
<swfrench@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1171703|image-suggestion: cleanup unused refs to service listener (T368096)]] |
[production] |
17:03 |
<robh@cumin2002> |
END (ERROR) - Cookbook sre.hardware.upgrade-firmware (exit_code=97) upgrade firmware for hosts cp2042.codfw.wmnet |
[production] |
17:02 |
<robh@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cp2042.codfw.wmnet |
[production] |
17:00 |
<robh@cumin2002> |
END (ERROR) - Cookbook sre.hardware.upgrade-firmware (exit_code=97) upgrade firmware for hosts cp2042.codfw.wmnet |
[production] |
16:59 |
<robh@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cp2042.codfw.wmnet |
[production] |
16:55 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2193 (T401906)', diff saved to https://phabricator.wikimedia.org/P81786 and previous config saved to /var/cache/conftool/dbconfig/20250826-165501-fceratto.json |
[production] |
16:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2193 (T401906)', diff saved to https://phabricator.wikimedia.org/P81785 and previous config saved to /var/cache/conftool/dbconfig/20250826-165248-fceratto.json |
[production] |
16:52 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2193.codfw.wmnet with reason: Maintenance |
[production] |
16:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2180 (T401906)', diff saved to https://phabricator.wikimedia.org/P81784 and previous config saved to /var/cache/conftool/dbconfig/20250826-165226-fceratto.json |
[production] |
16:37 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P81783 and previous config saved to /var/cache/conftool/dbconfig/20250826-163718-fceratto.json |
[production] |
16:22 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P81782 and previous config saved to /var/cache/conftool/dbconfig/20250826-162211-fceratto.json |
[production] |
16:19 |
<mutante> |
phabricator - added FCeratto-WMF to acl*sre-team |
[production] |
16:17 |
<moritzm> |
installing libxslt security updates |
[production] |
16:12 |
<swfrench@cumin2002> |
END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-codfw (T352245) |
[production] |
16:11 |
<swfrench@cumin2002> |
START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-codfw (T352245) |
[production] |
16:07 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2180 (T401906)', diff saved to https://phabricator.wikimedia.org/P81781 and previous config saved to /var/cache/conftool/dbconfig/20250826-160703-fceratto.json |
[production] |
16:04 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2180 (T401906)', diff saved to https://phabricator.wikimedia.org/P81780 and previous config saved to /var/cache/conftool/dbconfig/20250826-160451-fceratto.json |
[production] |
16:04 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2180.codfw.wmnet with reason: Maintenance |
[production] |
16:04 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2169 (T401906)', diff saved to https://phabricator.wikimedia.org/P81779 and previous config saved to /var/cache/conftool/dbconfig/20250826-160427-fceratto.json |
[production] |
15:49 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2169', diff saved to https://phabricator.wikimedia.org/P81778 and previous config saved to /var/cache/conftool/dbconfig/20250826-154920-fceratto.json |
[production] |
15:46 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host thanos-be2005.codfw.wmnet with OS bullseye |
[production] |
15:43 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2082.codfw.wmnet with OS bullseye |
[production] |
15:40 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2081.codfw.wmnet with OS bullseye |
[production] |
15:34 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2169', diff saved to https://phabricator.wikimedia.org/P81777 and previous config saved to /var/cache/conftool/dbconfig/20250826-153412-fceratto.json |
[production] |
15:28 |
<swfrench-wmf> |
finished restart of all codfw-associated confds - T352245 |
[production] |
15:28 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-be2005.codfw.wmnet with reason: host reimage |
[production] |
15:25 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2082.codfw.wmnet with reason: host reimage |
[production] |
15:22 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2081.codfw.wmnet with reason: host reimage |
[production] |
15:19 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2169 (T401906)', diff saved to https://phabricator.wikimedia.org/P81776 and previous config saved to /var/cache/conftool/dbconfig/20250826-151905-fceratto.json |
[production] |
15:18 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-be2005.codfw.wmnet with reason: host reimage |
[production] |
15:17 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2082.codfw.wmnet with reason: host reimage |
[production] |
15:17 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2081.codfw.wmnet with reason: host reimage |
[production] |