2051-2100 of 10000 results (123ms)
2025-09-25 ยง
18:24 <brennen@deploy2002> Started scap sync-world: Backport for [[gerrit:1191334|fix: provide a eventType fallback for already scheduled jobs (T405514)]], [[gerrit:1191414|fix: prevent type-error from outdated serialization (T405511)]] [production]
18:11 <jhancock@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS bookworm [production]
18:08 <jhancock@cumin1002> START - Cookbook sre.hosts.reimage for host dse-k8s-worker2003.codfw.wmnet with OS bookworm [production]
17:57 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply [production]
17:56 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
17:32 <jasmine@deploy2002> Finished scap sync-world: Test deployment to validate deployment server switchover - T399891. (duration: 39m 28s) [production]
17:25 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
17:25 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
17:03 <ejegg|food> donorwiki upgraded from df2482ce to 52104fab [production]
17:02 <pt1979@cumin2002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device fasw1-f5b-codfw [production]
17:02 <pt1979@cumin2002> START - Cookbook sre.network.tls for network device fasw1-f5b-codfw [production]
16:54 <pt1979@cumin2002> END (PASS) - Cookbook sre.network.provision (exit_code=0) for device fasw1-f5b-codfw.mgmt.codfw.wmnet [production]
16:52 <jasmine@deploy2002> Started scap sync-world: Test deployment to validate deployment server switchover - T399891. [production]
16:46 <jhancock@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
16:41 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
16:40 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:27 <jasmine@dns1004> END - running authdns-update [production]
16:26 <sukhe@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host durum7003.magru.wmnet with OS bookworm [production]
16:25 <jasmine@dns1004> START - running authdns-update [production]
16:23 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:23 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add management record for fasw1-f5b-codfw - pt1979@cumin2002" [production]
16:23 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add management record for fasw1-f5b-codfw - pt1979@cumin2002" [production]
16:22 <jasmine@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on releases2003.codfw.wmnet,releases1003.eqiad.wmnet with reason: Deployment server switchover [production]
16:15 <jasmine_> sopped spiderpig-apiserver, spiderpig-jobrunner on deploy1003 [production]
16:13 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
16:13 <pt1979@cumin2002> START - Cookbook sre.network.provision for device fasw1-f5b-codfw.mgmt.codfw.wmnet [production]
16:13 <pt1979@cumin2002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device fasw1-f5a-codfw [production]
16:13 <pt1979@cumin2002> START - Cookbook sre.network.tls for network device fasw1-f5a-codfw [production]
16:09 <pt1979@cumin2002> END (PASS) - Cookbook sre.network.provision (exit_code=0) for device fasw1-f5a-codfw.mgmt.codfw.wmnet [production]
16:06 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dse-k8s-worker2003.codfw.wmnet with OS bookworm [production]
15:54 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
15:44 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
15:41 <sukhe@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum5001.eqsin.wmnet with OS trixie [production]
15:38 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:38 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add management record for fasw1-f5a-codfw - pt1979@cumin2002" [production]
15:38 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add management record for fasw1-f5a-codfw - pt1979@cumin2002" [production]
15:35 <sukhe@cumin1003> START - Cookbook sre.hosts.reimage for host durum7003.magru.wmnet with OS bookworm [production]
15:34 <sukhe> sudo puppet node deactivate durum7003.magru.wmnet: stuck after reimage with failed puppet run [production]
15:33 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
15:33 <pt1979@cumin2002> START - Cookbook sre.network.provision for device fasw1-f5a-codfw.mgmt.codfw.wmnet [production]
15:32 <tchin@deploy1003> helmfile [codfw] DONE helmfile.d/services/eventgate-logging-external: apply [production]
15:31 <tchin@deploy1003> helmfile [codfw] START helmfile.d/services/eventgate-logging-external: apply [production]
15:24 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1045.eqiad.wmnet [production]
15:23 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1045.eqiad.wmnet [production]
15:22 <tchin@deploy1003> helmfile [staging] DONE helmfile.d/services/eventgate-logging-external: apply [production]
15:22 <tchin@deploy1003> helmfile [staging] START helmfile.d/services/eventgate-logging-external: apply [production]
15:19 <sukhe@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum5001.eqsin.wmnet with reason: host reimage [production]
15:18 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1045.eqiad.wmnet [production]
15:15 <sukhe@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on durum5001.eqsin.wmnet with reason: host reimage [production]