2025-08-04
§
|
20:09 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2221.codfw.wmnet with reason: Maintenance |
[production] |
20:09 |
<krinkle@deploy1003> |
krinkle: Continuing with sync |
[production] |
20:09 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2218 (T400854)', diff saved to https://phabricator.wikimedia.org/P80767 and previous config saved to /var/cache/conftool/dbconfig/20250804-200938-ladsgroup.json |
[production] |
20:07 |
<krinkle@deploy1003> |
krinkle: Backport for [[gerrit:1161757|Set wgCentralBannerRecorder to /beacon/… instead of //example.org/beacon/… (T400586)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:06 |
<krinkle@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1161757|Set wgCentralBannerRecorder to /beacon/… instead of //example.org/beacon/… (T400586)]] |
[production] |
20:05 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
20:04 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1043.eqiad.wmnet with OS bullseye |
[production] |
20:02 |
<otto@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply |
[production] |
20:01 |
<otto@deploy1003> |
helmfile [eqiad] START helmfile.d/services/eventgate-main: apply |
[production] |
20:01 |
<otto@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/eventgate-main: apply |
[production] |
20:01 |
<otto@deploy1003> |
helmfile [codfw] START helmfile.d/services/eventgate-main: apply |
[production] |
20:01 |
<otto@deploy1003> |
helmfile [staging] DONE helmfile.d/services/eventgate-main: apply |
[production] |
19:59 |
<otto@deploy1003> |
helmfile [staging] START helmfile.d/services/eventgate-main: apply |
[production] |
19:54 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2218', diff saved to https://phabricator.wikimedia.org/P80765 and previous config saved to /var/cache/conftool/dbconfig/20250804-195431-ladsgroup.json |
[production] |
19:50 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1043.eqiad.wmnet with OS bullseye |
[production] |
19:39 |
<rzl@deploy1003> |
mwscript-k8s job started: Version.php --wiki=urwiki # Testing --sal for T376776 |
[production] |
19:39 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2218', diff saved to https://phabricator.wikimedia.org/P80764 and previous config saved to /var/cache/conftool/dbconfig/20250804-193923-ladsgroup.json |
[production] |
19:38 |
<otto@deploy1003> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics: apply |
[production] |
19:37 |
<otto@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/eventgate-analytics: apply |
[production] |
19:36 |
<otto@deploy1003> |
helmfile [codfw] START helmfile.d/services/eventgate-analytics: apply |
[production] |
19:36 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1043.eqiad.wmnet with OS bullseye |
[production] |
19:35 |
<otto@deploy1003> |
helmfile [staging] DONE helmfile.d/services/eventgate-analytics: apply |
[production] |
19:35 |
<ottomata> |
deploying eventgate-analytics and eventgate-main to pick up meta.dt field logic change - T376026 |
[production] |
19:35 |
<otto@deploy1003> |
helmfile [staging] START helmfile.d/services/eventgate-analytics: apply |
[production] |
19:24 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2218 (T400854)', diff saved to https://phabricator.wikimedia.org/P80763 and previous config saved to /var/cache/conftool/dbconfig/20250804-192415-ladsgroup.json |
[production] |
19:21 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2218 (T400854)', diff saved to https://phabricator.wikimedia.org/P80762 and previous config saved to /var/cache/conftool/dbconfig/20250804-192129-ladsgroup.json |
[production] |
19:21 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2218.codfw.wmnet with reason: Maintenance |
[production] |
19:21 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2208 (T400854)', diff saved to https://phabricator.wikimedia.org/P80761 and previous config saved to /var/cache/conftool/dbconfig/20250804-192107-ladsgroup.json |
[production] |
19:20 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1043.eqiad.wmnet with OS bullseye |
[production] |
19:19 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1043.eqiad.wmnet with OS bullseye |
[production] |
19:17 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
19:12 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
19:12 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231 (T399728)', diff saved to https://phabricator.wikimedia.org/P80760 and previous config saved to /var/cache/conftool/dbconfig/20250804-191213-fceratto.json |
[production] |
19:06 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2208', diff saved to https://phabricator.wikimedia.org/P80759 and previous config saved to /var/cache/conftool/dbconfig/20250804-190559-ladsgroup.json |
[production] |
18:59 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
18:58 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
18:57 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P80758 and previous config saved to /var/cache/conftool/dbconfig/20250804-185705-fceratto.json |
[production] |
18:55 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
18:50 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2208', diff saved to https://phabricator.wikimedia.org/P80757 and previous config saved to /var/cache/conftool/dbconfig/20250804-185052-ladsgroup.json |
[production] |
18:41 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P80756 and previous config saved to /var/cache/conftool/dbconfig/20250804-184156-fceratto.json |
[production] |
18:35 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2208 (T400854)', diff saved to https://phabricator.wikimedia.org/P80755 and previous config saved to /var/cache/conftool/dbconfig/20250804-183543-ladsgroup.json |
[production] |
18:33 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2208 (T400854)', diff saved to https://phabricator.wikimedia.org/P80754 and previous config saved to /var/cache/conftool/dbconfig/20250804-183259-ladsgroup.json |
[production] |
18:32 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2208.codfw.wmnet with reason: Maintenance |
[production] |
18:31 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2200.codfw.wmnet with reason: Maintenance |
[production] |
18:30 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
18:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T400854)', diff saved to https://phabricator.wikimedia.org/P80753 and previous config saved to /var/cache/conftool/dbconfig/20250804-183033-ladsgroup.json |
[production] |
18:26 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231 (T399728)', diff saved to https://phabricator.wikimedia.org/P80752 and previous config saved to /var/cache/conftool/dbconfig/20250804-182649-fceratto.json |
[production] |
18:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1231 (T399728)', diff saved to https://phabricator.wikimedia.org/P80751 and previous config saved to /var/cache/conftool/dbconfig/20250804-182420-fceratto.json |
[production] |
18:24 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1231.eqiad.wmnet with reason: Maintenance |
[production] |
18:23 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1225.eqiad.wmnet with reason: Maintenance |
[production] |