951-1000 of 10000 results (32ms)
2025-12-02 ยง
16:47 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm afb538cb-a128-450b-a02f-4fee25183588 (cluster eqiad1) [admin]
16:47 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 8b546fd2-137d-4b91-86f3-b50fa515c98c (cluster eqiad1) [admin]
16:46 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 8b546fd2-137d-4b91-86f3-b50fa515c98c (cluster eqiad1) [admin]
16:46 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm c7b1311c-ee8b-4118-b907-ad0382644350 (cluster eqiad1) [admin]
16:46 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm c7b1311c-ee8b-4118-b907-ad0382644350 (cluster eqiad1) [admin]
16:45 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 60d1c96e-0c3c-47e1-86d6-cd30527d5066 (cluster eqiad1) [admin]
16:45 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' [admin]
16:45 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 60d1c96e-0c3c-47e1-86d6-cd30527d5066 (cluster eqiad1) [admin]
16:45 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 028c29da-adcb-4239-bcb4-6e80516e6fbb (cluster eqiad1) [admin]
16:44 <dancy> Pausing WMCS Gitlab runners [releng]
16:44 <ihurbain@deploy2002> Finished scap sync-world: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] (duration: 09m 21s) [production]
16:44 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 028c29da-adcb-4239-bcb4-6e80516e6fbb (cluster eqiad1) [admin]
16:43 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:43 <inflatador> bking@wmf3062 restart WDQS codfw to resolve lag/possible deadlocks [production]
16:40 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' [admin]
16:39 <ihurbain@deploy2002> ihurbain: Continuing with sync [production]
16:39 <jhathaway@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:38 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply [production]
16:38 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply [production]
16:37 <ihurbain@deploy2002> ihurbain: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
16:36 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2193 (T410589)', diff saved to https://phabricator.wikimedia.org/P86319 and previous config saved to /var/cache/conftool/dbconfig/20251202-163612-ladsgroup.json [production]
16:36 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1251 gradually with 4 steps - Pool db1251.eqiad.wmnet in after cloning [production]
16:35 <ihurbain@deploy2002> Started scap sync-world: Backport for [[gerrit:1214069|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]], [[gerrit:1214070|Bump parsoid to v0.23.0-a7.1 on wmf.4 (T411238 T410960)]] [production]
16:34 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm e6796dbd-2511-4bf6-bdee-4a14a7414d5f (cluster eqiad1) [admin]
16:33 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 5e5c3bad-f1c7-49e5-b846-edaf111af83c (cluster eqiad1) [admin]
16:33 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm e6796dbd-2511-4bf6-bdee-4a14a7414d5f (cluster eqiad1) [admin]
16:33 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 5e5c3bad-f1c7-49e5-b846-edaf111af83c (cluster eqiad1) [admin]
16:32 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 1d74fc9a-0ddd-41d6-a0fd-5bba5e455c32 (cluster eqiad1) [admin]
16:32 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.instance.stop_start (exit_code=0) vm 55ed5d49-43db-4f62-8c40-5cb0431dfce2 (cluster eqiad1) [admin]
16:31 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 1d74fc9a-0ddd-41d6-a0fd-5bba5e455c32 (cluster eqiad1) [admin]
16:31 <andrew@cloudcumin1001> START - Cookbook wmcs.vps.instance.stop_start vm 55ed5d49-43db-4f62-8c40-5cb0431dfce2 (cluster eqiad1) [admin]
16:30 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:27 <brett> import varnish 7.1.1-2~bpo13+wmf2 into trixie-wikimedia - T401832 [production]
16:24 <jhathaway@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:23 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:20 <jhathaway@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:19 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:18 <swfrench-wmf> restarted navtiming on webperf1003 - T352245 [production]
16:14 <swfrench-wmf> begin rolling restarts of eqiad-associated confds - T352245 [production]
16:12 <moritzm> installing nodejs security updates [production]
16:12 <swfrench@deploy2002> Unlocked for deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 (duration: 03m 45s) [production]
16:12 <jhathaway@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:10 <jhathaway@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm [production]
16:08 <swfrench@deploy2002> Locking from deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 [production]
16:08 <swfrench-wmf> migrating etcd to PKI certs on conf1008 - T352245 [production]
16:08 <jhathaway@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
16:02 <moritzm> installing libsndfile security updates [production]
16:01 <jhathaway@cumin1003> START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
16:00 <gehel> restarting wdqs@codfw - system overloaded [production]
15:58 <jhathaway@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on sretest1005.eqiad.wmnet with reason: ipxe [production]