51-100 of 10000 results (119ms)
2026-06-16 ยง
20:41 <jdlrobson@deploy1003> Started scap sync-world: Backport for [[gerrit:1302890|Guard round function with a supports query (T424596)]], [[gerrit:1302935|Add wprov parameter to home link (T429268)]] [production]
20:40 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=dns5004.* [production]
20:33 <brett@dns1004> END - running authdns-update [production]
20:31 <brett@dns1004> START - running authdns-update [production]
20:30 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dns5004.wikimedia.org with OS bookworm [production]
20:30 <brett@dns5004> FAIL - running authdns-update [production]
20:29 <brett@dns5004> START - running authdns-update [production]
20:28 <brett@dns5004> FAIL - running authdns-update [production]
20:27 <kemayo@deploy1003> Finished scap sync-world: Backport for [[gerrit:1302320|EditChecks: Namespace tracking object for seen/shown/used checks]] (duration: 09m 50s) [production]
20:26 <brett@dns5004> START - running authdns-update [production]
20:26 <brett@dns5004> START - running authdns-update [production]
20:25 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=dns5004.*,service=authdns-update [production]
20:23 <kemayo@deploy1003> kemayo: Continuing with deployment [production]
20:19 <kemayo@deploy1003> kemayo: Backport for [[gerrit:1302320|EditChecks: Namespace tracking object for seen/shown/used checks]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:18 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
20:17 <kemayo@deploy1003> Started scap sync-world: Backport for [[gerrit:1302320|EditChecks: Namespace tracking object for seen/shown/used checks]] [production]
20:09 <jasmine@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
20:00 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-wdqs1001.eqiad.wmnet with reason: host reimage [production]
19:56 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-wdqs1001.eqiad.wmnet with reason: host reimage [production]
19:55 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dse-k8s-wdqs2001.codfw.wmnet with OS bookworm [production]
19:55 <jasmine@cumin2002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:54 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:47 <jasmine@cumin2002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:46 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:45 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-wdqs2001.codfw.wmnet with OS bookworm [production]
19:45 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-wdqs1001.eqiad.wmnet with OS bookworm [production]
19:39 <jasmine@cumin2002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:35 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-haproxy rolling restart of HAProxy on A:cp - OpenSSL update () [production]
19:34 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:31 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
19:30 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-haproxy rolling restart of HAProxy on A:cp - OpenSSL update () [production]
19:27 <jasmine@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:18 <topranks> restarting grpc server on eqiad SR-Linux switches to recover from problem of no free threads T429242 [production]
19:08 <jasmine@cumin2002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:08 <robh@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:02 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:00 <krinkle@deploy1003> Finished scap sync-world: Backport for [[gerrit:1302274|Disable ShortUrl on hiwiki, hiwikiversity, maiwiki, knwiki, knwikisource, tcywiki (T107188)]] (duration: 11m 18s) [production]
18:58 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
18:56 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
18:56 <jasmine@cumin2002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:55 <krinkle@deploy1003> krinkle: Continuing with deployment [production]
18:52 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:51 <krinkle@deploy1003> krinkle: Backport for [[gerrit:1302274|Disable ShortUrl on hiwiki, hiwikiversity, maiwiki, knwiki, knwikisource, tcywiki (T107188)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
18:48 <krinkle@deploy1003> Started scap sync-world: Backport for [[gerrit:1302274|Disable ShortUrl on hiwiki, hiwikiversity, maiwiki, knwiki, knwikisource, tcywiki (T107188)]] [production]
18:45 <jasmine@cumin2002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:44 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dns5004.wikimedia.org with reason: host reimage [production]
18:41 <eevans@deploy1003> helmfile [codfw] DONE helmfile.d/services/data-gateway: apply [production]
18:41 <eevans@deploy1003> helmfile [codfw] START helmfile.d/services/data-gateway: apply [production]
18:41 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on dns5004.wikimedia.org with reason: host reimage [production]
18:40 <eevans@deploy1003> helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply [production]