2025-10-22
ยง
|
13:04 |
<jgleeson> |
SmashPig upgraded from aa45ee08 to 9a7e626c |
[production] |
13:03 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
13:03 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie |
[production] |
13:03 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1263 (re)pooling @ 75%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84263 and previous config saved to /var/cache/conftool/dbconfig/20251022-130320-root.json |
[production] |
13:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1184 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P84262 and previous config saved to /var/cache/conftool/dbconfig/20251022-130226-root.json |
[production] |
13:01 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
12:54 |
<reedy@deploy2002> |
Synchronized wmf-config/CommonSettings.php: T407167 (duration: 08m 29s) |
[production] |
12:53 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
12:53 |
<sukhe@cumin1003> |
cookbooks.sre.cdn.roll-reboot finished rebooting cp2027.codfw.wmnet |
[production] |
12:50 |
<sukhe@cumin1003> |
cookbooks.sre.cdn.roll-reboot finished rebooting cp2028.codfw.wmnet |
[production] |
12:48 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:48 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1263 (re)pooling @ 60%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84260 and previous config saved to /var/cache/conftool/dbconfig/20251022-124814-root.json |
[production] |
12:47 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1184 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P84259 and previous config saved to /var/cache/conftool/dbconfig/20251022-124720-root.json |
[production] |
12:45 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:45 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:41 |
<sukhe@cumin1003> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-upload_codfw and A:cp |
[production] |
12:41 |
<sukhe@cumin1003> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-text_codfw and A:cp |
[production] |
12:40 |
<cmooney@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:40 |
<cmooney@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:39 |
<oblivian@cumin1003> |
END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Rate-limit by wmfuniq - oblivian@cumin1003" |
[production] |
12:39 |
<oblivian@cumin1003> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Rate-limit by wmfuniq - oblivian@cumin1003 |
[production] |
12:38 |
<oblivian@cumin1003> |
START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Rate-limit by wmfuniq - oblivian@cumin1003 |
[production] |
12:38 |
<oblivian@cumin1003> |
START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Rate-limit by wmfuniq - oblivian@cumin1003" |
[production] |
12:38 |
<cmooney@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:38 |
<cmooney@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:37 |
<cmooney@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:33 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1263 (re)pooling @ 50%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84258 and previous config saved to /var/cache/conftool/dbconfig/20251022-123308-root.json |
[production] |
12:32 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:32 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1184 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84257 and previous config saved to /var/cache/conftool/dbconfig/20251022-123213-root.json |
[production] |
12:20 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1196 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P84256 and previous config saved to /var/cache/conftool/dbconfig/20251022-122039-root.json |
[production] |
12:19 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:19 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1263 (re)pooling @ 30%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84255 and previous config saved to /var/cache/conftool/dbconfig/20251022-121802-root.json |
[production] |
12:17 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1184 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P84254 and previous config saved to /var/cache/conftool/dbconfig/20251022-121707-root.json |
[production] |
12:11 |
<jelto@cumin1003> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Upgrade GitLab |
[production] |
12:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1184 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P84253 and previous config saved to /var/cache/conftool/dbconfig/20251022-120853-marostegui.json |
[production] |
12:08 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1184.eqiad.wmnet with reason: Maintenance |
[production] |
12:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1196 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P84252 and previous config saved to /var/cache/conftool/dbconfig/20251022-120533-root.json |
[production] |
12:03 |
<cmooney@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ssw1-d1-eqiad |
[production] |
12:03 |
<cmooney@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for ssw1-d1-eqiad |
[production] |
12:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1263 (re)pooling @ 25%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84251 and previous config saved to /var/cache/conftool/dbconfig/20251022-120256-root.json |
[production] |
11:50 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1196 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84249 and previous config saved to /var/cache/conftool/dbconfig/20251022-115027-root.json |
[production] |
11:48 |
<cmooney@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
11:47 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1263 (re)pooling @ 20%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84248 and previous config saved to /var/cache/conftool/dbconfig/20251022-114749-root.json |
[production] |
11:46 |
<cmooney@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
11:46 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2146 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P84247 and previous config saved to /var/cache/conftool/dbconfig/20251022-114629-root.json |
[production] |
11:40 |
<mvernon@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on ms-be[1089-1090].eqiad.wmnet with reason: awaiting controller swap |
[production] |
11:35 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1196 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P84246 and previous config saved to /var/cache/conftool/dbconfig/20251022-113521-root.json |
[production] |