1601-1650 of 10000 results (26ms)
2025-07-28 ยง
08:54 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:54 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2171 (T399728)', diff saved to https://phabricator.wikimedia.org/P80035 and previous config saved to /var/cache/conftool/dbconfig/20250728-085359-fceratto.json [production]
08:52 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P80034 and previous config saved to /var/cache/conftool/dbconfig/20250728-085231-marostegui.json [production]
08:52 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1015.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:50 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2171 (T399728)', diff saved to https://phabricator.wikimedia.org/P80033 and previous config saved to /var/cache/conftool/dbconfig/20250728-085004-fceratto.json [production]
08:49 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2171.codfw.wmnet with reason: Maintenance [production]
08:49 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157 (T399728)', diff saved to https://phabricator.wikimedia.org/P80032 and previous config saved to /var/cache/conftool/dbconfig/20250728-084941-fceratto.json [production]
08:49 <hashar@deploy1003> Finished deploy [integration/docroot@827d626]: build: Updating brace-expansion to 1.1.12, 2.0.2 (duration: 00m 13s) [production]
08:48 <hashar@deploy1003> Started deploy [integration/docroot@827d626]: build: Updating brace-expansion to 1.1.12, 2.0.2 [production]
08:47 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:46 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:38 <gmodena@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
08:38 <gmodena@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
08:37 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P80031 and previous config saved to /var/cache/conftool/dbconfig/20250728-083724-marostegui.json [production]
08:34 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P80030 and previous config saved to /var/cache/conftool/dbconfig/20250728-083433-fceratto.json [production]
08:29 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
08:22 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177 (T399249)', diff saved to https://phabricator.wikimedia.org/P80029 and previous config saved to /var/cache/conftool/dbconfig/20250728-082216-marostegui.json [production]
08:20 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1177 (T399249)', diff saved to https://phabricator.wikimedia.org/P80028 and previous config saved to /var/cache/conftool/dbconfig/20250728-082002-marostegui.json [production]
08:19 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1177.eqiad.wmnet with reason: Maintenance [production]
08:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T399249)', diff saved to https://phabricator.wikimedia.org/P80027 and previous config saved to /var/cache/conftool/dbconfig/20250728-081939-marostegui.json [production]
08:19 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P80026 and previous config saved to /var/cache/conftool/dbconfig/20250728-081926-fceratto.json [production]
08:19 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
08:09 <marostegui@cumin1002> dbctl commit (dc=all): 'db2220 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P80025 and previous config saved to /var/cache/conftool/dbconfig/20250728-080940-root.json [production]
08:04 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P80024 and previous config saved to /var/cache/conftool/dbconfig/20250728-080432-marostegui.json [production]
08:04 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157 (T399728)', diff saved to https://phabricator.wikimedia.org/P80023 and previous config saved to /var/cache/conftool/dbconfig/20250728-080418-fceratto.json [production]
08:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2157 (T399728)', diff saved to https://phabricator.wikimedia.org/P80022 and previous config saved to /var/cache/conftool/dbconfig/20250728-080026-fceratto.json [production]
08:00 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2157.codfw.wmnet with reason: Maintenance [production]
07:54 <marostegui@cumin1002> dbctl commit (dc=all): 'db2220 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P80021 and previous config saved to /var/cache/conftool/dbconfig/20250728-075435-root.json [production]
07:52 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Secondary switchover s7 T400591 [production]
07:49 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P80020 and previous config saved to /var/cache/conftool/dbconfig/20250728-074924-marostegui.json [production]
07:48 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
07:39 <marostegui@cumin1002> dbctl commit (dc=all): 'db2220 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P80019 and previous config saved to /var/cache/conftool/dbconfig/20250728-073929-root.json [production]
07:38 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
07:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T399249)', diff saved to https://phabricator.wikimedia.org/P80018 and previous config saved to /var/cache/conftool/dbconfig/20250728-073417-marostegui.json [production]
07:32 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1172 (T399249)', diff saved to https://phabricator.wikimedia.org/P80016 and previous config saved to /var/cache/conftool/dbconfig/20250728-073203-marostegui.json [production]
07:31 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1172.eqiad.wmnet with reason: Maintenance [production]
07:31 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
07:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167 (T399249)', diff saved to https://phabricator.wikimedia.org/P80015 and previous config saved to /var/cache/conftool/dbconfig/20250728-073119-marostegui.json [production]
07:24 <marostegui@cumin1002> dbctl commit (dc=all): 'db2220 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P80014 and previous config saved to /var/cache/conftool/dbconfig/20250728-072423-root.json [production]
07:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2220 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P80013 and previous config saved to /var/cache/conftool/dbconfig/20250728-071643-marostegui.json [production]
07:16 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2220.codfw.wmnet with reason: Maintenance [production]
07:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P80012 and previous config saved to /var/cache/conftool/dbconfig/20250728-071611-marostegui.json [production]
07:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P80011 and previous config saved to /var/cache/conftool/dbconfig/20250728-070103-marostegui.json [production]
06:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167 (T399249)', diff saved to https://phabricator.wikimedia.org/P80010 and previous config saved to /var/cache/conftool/dbconfig/20250728-064556-marostegui.json [production]
06:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1167 (T399249)', diff saved to https://phabricator.wikimedia.org/P80009 and previous config saved to /var/cache/conftool/dbconfig/20250728-064241-marostegui.json [production]
06:42 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
06:42 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
06:40 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2165.codfw.wmnet with reason: Maintenance [production]
06:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es2038 T400436', diff saved to https://phabricator.wikimedia.org/P80008 and previous config saved to /var/cache/conftool/dbconfig/20250728-063039-root.json [production]
06:30 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es2038.codfw.wmnet with reason: Maintenance [production]