1501-1550 of 10000 results (117ms)
2025-10-22 ยง
12:17 <marostegui@cumin1003> dbctl commit (dc=all): 'db1184 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P84254 and previous config saved to /var/cache/conftool/dbconfig/20251022-121707-root.json [production]
12:11 <jelto@cumin1003> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Upgrade GitLab [production]
12:08 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db1184 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P84253 and previous config saved to /var/cache/conftool/dbconfig/20251022-120853-marostegui.json [production]
12:08 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1184.eqiad.wmnet with reason: Maintenance [production]
12:05 <marostegui@cumin1003> dbctl commit (dc=all): 'db1196 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P84252 and previous config saved to /var/cache/conftool/dbconfig/20251022-120533-root.json [production]
12:03 <cmooney@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ssw1-d1-eqiad [production]
12:03 <cmooney@cumin1003> START - Cookbook sre.hosts.remove-downtime for ssw1-d1-eqiad [production]
12:02 <marostegui@cumin1003> dbctl commit (dc=all): 'db1263 (re)pooling @ 25%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84251 and previous config saved to /var/cache/conftool/dbconfig/20251022-120256-root.json [production]
11:50 <marostegui@cumin1003> dbctl commit (dc=all): 'db1196 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84249 and previous config saved to /var/cache/conftool/dbconfig/20251022-115027-root.json [production]
11:48 <cmooney@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
11:47 <marostegui@cumin1003> dbctl commit (dc=all): 'db1263 (re)pooling @ 20%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84248 and previous config saved to /var/cache/conftool/dbconfig/20251022-114749-root.json [production]
11:46 <cmooney@cumin1003> START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
11:46 <marostegui@cumin1003> dbctl commit (dc=all): 'db2146 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P84247 and previous config saved to /var/cache/conftool/dbconfig/20251022-114629-root.json [production]
11:40 <mvernon@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on ms-be[1089-1090].eqiad.wmnet with reason: awaiting controller swap [production]
11:35 <marostegui@cumin1003> dbctl commit (dc=all): 'db1196 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P84246 and previous config saved to /var/cache/conftool/dbconfig/20251022-113521-root.json [production]
11:32 <marostegui@cumin1003> dbctl commit (dc=all): 'db1263 (re)pooling @ 10%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84245 and previous config saved to /var/cache/conftool/dbconfig/20251022-113243-root.json [production]
11:31 <marostegui@cumin1003> dbctl commit (dc=all): 'db2146 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P84244 and previous config saved to /var/cache/conftool/dbconfig/20251022-113123-root.json [production]
11:30 <mvolz@deploy1003> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
11:30 <mvolz@deploy1003> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
11:30 <mvolz@deploy1003> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
11:29 <mvolz@deploy1003> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
11:28 <mvolz@deploy1003> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:27 <mvolz@deploy1003> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:27 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db1196 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P84243 and previous config saved to /var/cache/conftool/dbconfig/20251022-112732-marostegui.json [production]
11:27 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1196.eqiad.wmnet with reason: Maintenance [production]
11:26 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
11:26 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
11:26 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 14 hosts with reason: Upgrading [production]
11:25 <cmooney@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host sretest1006 [production]
11:25 <cmooney@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host sretest1006 [production]
11:25 <dreamyjazz@deploy2002> Finished scap sync-world: Backport for [[gerrit:1198021|EventStreamConfig: Don't collect user-agent for suggested_investigations_interaction (T404177)]] (duration: 08m 48s) [production]
11:24 <mvolz@deploy1003> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:24 <mvolz@deploy1003> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:20 <dreamyjazz@deploy2002> kharlan, dreamyjazz: Continuing with sync [production]
11:20 <dreamyjazz@deploy2002> kharlan, dreamyjazz: Backport for [[gerrit:1198021|EventStreamConfig: Don't collect user-agent for suggested_investigations_interaction (T404177)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
11:20 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
11:19 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
11:18 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
11:17 <marostegui@cumin1003> dbctl commit (dc=all): 'db1263 (re)pooling @ 7%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P84242 and previous config saved to /var/cache/conftool/dbconfig/20251022-111736-root.json [production]
11:16 <cmooney@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host sretest1006 [production]
11:16 <cmooney@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host sretest1006 [production]
11:16 <marostegui@cumin1003> dbctl commit (dc=all): 'db2146 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84241 and previous config saved to /var/cache/conftool/dbconfig/20251022-111617-root.json [production]
11:16 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1198021|EventStreamConfig: Don't collect user-agent for suggested_investigations_interaction (T404177)]] [production]
11:16 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
11:15 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
11:15 <cmooney@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
11:15 <cmooney@cumin1003> START - Cookbook sre.hosts.provision for host sretest1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
11:15 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
11:14 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
11:14 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]