1101-1150 of 10000 results (125ms)
2025-08-04 §
20:09 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2221.codfw.wmnet with reason: Maintenance [production]
20:09 <krinkle@deploy1003> krinkle: Continuing with sync [production]
20:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2218 (T400854)', diff saved to https://phabricator.wikimedia.org/P80767 and previous config saved to /var/cache/conftool/dbconfig/20250804-200938-ladsgroup.json [production]
20:07 <krinkle@deploy1003> krinkle: Backport for [[gerrit:1161757|Set wgCentralBannerRecorder to /beacon/… instead of //example.org/beacon/… (T400586)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:06 <krinkle@deploy1003> Started scap sync-world: Backport for [[gerrit:1161757|Set wgCentralBannerRecorder to /beacon/… instead of //example.org/beacon/… (T400586)]] [production]
20:05 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:04 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1043.eqiad.wmnet with OS bullseye [production]
20:02 <otto@deploy1003> helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply [production]
20:01 <otto@deploy1003> helmfile [eqiad] START helmfile.d/services/eventgate-main: apply [production]
20:01 <otto@deploy1003> helmfile [codfw] DONE helmfile.d/services/eventgate-main: apply [production]
20:01 <otto@deploy1003> helmfile [codfw] START helmfile.d/services/eventgate-main: apply [production]
20:01 <otto@deploy1003> helmfile [staging] DONE helmfile.d/services/eventgate-main: apply [production]
19:59 <otto@deploy1003> helmfile [staging] START helmfile.d/services/eventgate-main: apply [production]
19:54 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2218', diff saved to https://phabricator.wikimedia.org/P80765 and previous config saved to /var/cache/conftool/dbconfig/20250804-195431-ladsgroup.json [production]
19:50 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1043.eqiad.wmnet with OS bullseye [production]
19:39 <rzl@deploy1003> mwscript-k8s job started: Version.php --wiki=urwiki # Testing --sal for T376776 [production]
19:39 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2218', diff saved to https://phabricator.wikimedia.org/P80764 and previous config saved to /var/cache/conftool/dbconfig/20250804-193923-ladsgroup.json [production]
19:38 <otto@deploy1003> helmfile [eqiad] START helmfile.d/services/eventgate-analytics: apply [production]
19:37 <otto@deploy1003> helmfile [codfw] DONE helmfile.d/services/eventgate-analytics: apply [production]
19:36 <otto@deploy1003> helmfile [codfw] START helmfile.d/services/eventgate-analytics: apply [production]
19:36 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1043.eqiad.wmnet with OS bullseye [production]
19:35 <otto@deploy1003> helmfile [staging] DONE helmfile.d/services/eventgate-analytics: apply [production]
19:35 <ottomata> deploying eventgate-analytics and eventgate-main to pick up meta.dt field logic change - T376026 [production]
19:35 <otto@deploy1003> helmfile [staging] START helmfile.d/services/eventgate-analytics: apply [production]
19:24 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2218 (T400854)', diff saved to https://phabricator.wikimedia.org/P80763 and previous config saved to /var/cache/conftool/dbconfig/20250804-192415-ladsgroup.json [production]
19:21 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2218 (T400854)', diff saved to https://phabricator.wikimedia.org/P80762 and previous config saved to /var/cache/conftool/dbconfig/20250804-192129-ladsgroup.json [production]
19:21 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2218.codfw.wmnet with reason: Maintenance [production]
19:21 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2208 (T400854)', diff saved to https://phabricator.wikimedia.org/P80761 and previous config saved to /var/cache/conftool/dbconfig/20250804-192107-ladsgroup.json [production]
19:20 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1043.eqiad.wmnet with OS bullseye [production]
19:19 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1043.eqiad.wmnet with OS bullseye [production]
19:17 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:12 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance [production]
19:12 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231 (T399728)', diff saved to https://phabricator.wikimedia.org/P80760 and previous config saved to /var/cache/conftool/dbconfig/20250804-191213-fceratto.json [production]
19:06 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2208', diff saved to https://phabricator.wikimedia.org/P80759 and previous config saved to /var/cache/conftool/dbconfig/20250804-190559-ladsgroup.json [production]
18:59 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:58 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:57 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P80758 and previous config saved to /var/cache/conftool/dbconfig/20250804-185705-fceratto.json [production]
18:55 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:50 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2208', diff saved to https://phabricator.wikimedia.org/P80757 and previous config saved to /var/cache/conftool/dbconfig/20250804-185052-ladsgroup.json [production]
18:41 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P80756 and previous config saved to /var/cache/conftool/dbconfig/20250804-184156-fceratto.json [production]
18:35 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2208 (T400854)', diff saved to https://phabricator.wikimedia.org/P80755 and previous config saved to /var/cache/conftool/dbconfig/20250804-183543-ladsgroup.json [production]
18:33 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2208 (T400854)', diff saved to https://phabricator.wikimedia.org/P80754 and previous config saved to /var/cache/conftool/dbconfig/20250804-183259-ladsgroup.json [production]
18:32 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2208.codfw.wmnet with reason: Maintenance [production]
18:31 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
18:30 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
18:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182 (T400854)', diff saved to https://phabricator.wikimedia.org/P80753 and previous config saved to /var/cache/conftool/dbconfig/20250804-183033-ladsgroup.json [production]
18:26 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231 (T399728)', diff saved to https://phabricator.wikimedia.org/P80752 and previous config saved to /var/cache/conftool/dbconfig/20250804-182649-fceratto.json [production]
18:24 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1231 (T399728)', diff saved to https://phabricator.wikimedia.org/P80751 and previous config saved to /var/cache/conftool/dbconfig/20250804-182420-fceratto.json [production]
18:24 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1231.eqiad.wmnet with reason: Maintenance [production]
18:23 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]