601-650 of 10000 results (105ms)
2025-07-23 ยง
14:58 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash2031.codfw.wmnet with reason: host reimage [production]
14:46 <bking@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply [production]
14:46 <bking@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply [production]
14:38 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash2031.codfw.wmnet with OS bookworm [production]
14:37 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash2030.codfw.wmnet with OS bookworm [production]
14:27 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:26 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:25 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:17 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest2009.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
14:16 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host sretest2009.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
14:13 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2224 (T399728)', diff saved to https://phabricator.wikimedia.org/P79746 and previous config saved to /var/cache/conftool/dbconfig/20250723-141353-fceratto.json [production]
14:05 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:04 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:04 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:03 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash2030.codfw.wmnet with reason: host reimage [production]
14:03 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:59 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:58 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2224', diff saved to https://phabricator.wikimedia.org/P79745 and previous config saved to /var/cache/conftool/dbconfig/20250723-135846-fceratto.json [production]
13:58 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash2030.codfw.wmnet with reason: host reimage [production]
13:45 <mszabo@deploy1003> Finished scap sync-world: Backport for [[gerrit:1172016|Enable wgWikimediaEventsCreateAccountInstrumentation (T394744)]] (duration: 09m 31s) [production]
13:43 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2224', diff saved to https://phabricator.wikimedia.org/P79743 and previous config saved to /var/cache/conftool/dbconfig/20250723-134338-fceratto.json [production]
13:40 <mszabo@deploy1003> mszabo: Continuing with sync [production]
13:39 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash2030.codfw.wmnet with OS bookworm [production]
13:38 <mszabo@deploy1003> mszabo: Backport for [[gerrit:1172016|Enable wgWikimediaEventsCreateAccountInstrumentation (T394744)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:36 <mszabo@deploy1003> Started scap sync-world: Backport for [[gerrit:1172016|Enable wgWikimediaEventsCreateAccountInstrumentation (T394744)]] [production]
13:35 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:35 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ssw1-d1-eqiad mgmt - ayounsi@cumin1003" [production]
13:35 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ssw1-d1-eqiad mgmt - ayounsi@cumin1003" [production]
13:28 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2224 (T399728)', diff saved to https://phabricator.wikimedia.org/P79741 and previous config saved to /var/cache/conftool/dbconfig/20250723-132831-fceratto.json [production]
13:27 <ayounsi@cumin1003> START - Cookbook sre.dns.netbox [production]
13:25 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2224 (T399728)', diff saved to https://phabricator.wikimedia.org/P79740 and previous config saved to /var/cache/conftool/dbconfig/20250723-132548-fceratto.json [production]
13:25 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2224.codfw.wmnet with reason: Maintenance [production]
13:25 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217 (T399728)', diff saved to https://phabricator.wikimedia.org/P79739 and previous config saved to /var/cache/conftool/dbconfig/20250723-132525-fceratto.json [production]
13:10 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217', diff saved to https://phabricator.wikimedia.org/P79738 and previous config saved to /var/cache/conftool/dbconfig/20250723-131018-fceratto.json [production]
13:00 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:58 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1013.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:57 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-serve1012.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART [production]
12:57 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1012.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART [production]
12:55 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217', diff saved to https://phabricator.wikimedia.org/P79737 and previous config saved to /var/cache/conftool/dbconfig/20250723-125510-fceratto.json [production]
12:40 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217 (T399728)', diff saved to https://phabricator.wikimedia.org/P79736 and previous config saved to /var/cache/conftool/dbconfig/20250723-124003-fceratto.json [production]
12:37 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2217 (T399728)', diff saved to https://phabricator.wikimedia.org/P79735 and previous config saved to /var/cache/conftool/dbconfig/20250723-123722-fceratto.json [production]
12:37 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2217.codfw.wmnet with reason: Maintenance [production]
12:37 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2214 (T399728)', diff saved to https://phabricator.wikimedia.org/P79734 and previous config saved to /var/cache/conftool/dbconfig/20250723-123659-fceratto.json [production]
12:32 <gmodena@deploy1003> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:31 <gmodena@deploy1003> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:29 <gmodena@deploy1003> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:29 <gmodena@deploy1003> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:28 <gmodena@deploy1003> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:28 <gmodena@deploy1003> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:28 <gmodena@deploy1003> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]