51-100 of 10000 results (21ms)
2025-06-06 ยง
17:08 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: T383811 - bking@cumin2002 [production]
17:06 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: T383811 - bking@cumin2002 [production]
17:00 <sukhe> forced agent run on O:alerting_host to reload vopsbot to add cdobbins [production]
16:57 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host db2244.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:57 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db2244.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:56 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host db2244.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:55 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host db2244 [production]
16:55 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host db2244 [production]
16:44 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:42 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
16:08 <sbassett> Deployed security update to fix T396111 [production]
15:41 <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2066.codfw.wmnet [production]
15:34 <eevans@cumin1002> START - Cookbook sre.hosts.reboot-single for host ms-be2066.codfw.wmnet [production]
15:24 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) config_reloading P{lvs1013.eqiad.wmnet} and A:liberica [production]
15:23 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin config_reloading P{lvs1013.eqiad.wmnet} and A:liberica [production]
15:20 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-5 [tools]
15:19 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs1013.eqiad.wmnet [production]
15:19 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs1013.eqiad.wmnet [production]
15:14 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-5 [tools]
14:53 <akosiaris@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
14:42 <akosiaris@deploy1003> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
14:37 <jnuche> Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/merge_requests/79 [releng]
14:24 <sukhe@dns1004> END - running authdns-update [production]
14:23 <sukhe@dns1004> START - running authdns-update [production]
14:23 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns2004*} and (A:dnsbox) [production]
14:23 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns2004.wikimedia.org [production]
14:22 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns1005*} and (A:dnsbox) [production]
14:22 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns1005.wikimedia.org [production]
14:11 <taavi> deleting old generated SMW tarballs from TarballDownloader/build to free up disk space. tool appears to be abandoned at least since grid shutdown. # T396220 [tools.nyandata]
14:10 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot begin reboot of dns1005.wikimedia.org [production]
14:10 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on P{dns1005*} and (A:dnsbox) [production]
14:10 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot begin reboot of dns2004.wikimedia.org [production]
14:10 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on P{dns2004*} and (A:dnsbox) [production]
13:51 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns2006*} and (A:dnsbox) [production]
13:51 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns2006.wikimedia.org [production]
13:49 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns1006*} and (A:dnsbox) [production]
13:49 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns1006.wikimedia.org [production]
13:40 <vgutierrez@cumin1003> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) config_reloading P{lvs1013.eqiad.wmnet} and A:liberica [production]
13:40 <vgutierrez@cumin1003> START - Cookbook sre.loadbalancer.admin config_reloading P{lvs1013.eqiad.wmnet} and A:liberica [production]
13:40 <vgutierrez@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on lvs1013.eqiad.wmnet with reason: switching to katran [production]
13:35 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot begin reboot of dns2006.wikimedia.org [production]
13:35 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on P{dns2006*} and (A:dnsbox) [production]
13:34 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot begin reboot of dns1006.wikimedia.org [production]
13:34 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on P{dns1006*} and (A:dnsbox) [production]
13:32 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns3003*} and (A:dnsbox) [production]
13:32 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns3003.wikimedia.org [production]
13:31 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns6001*} and (A:dnsbox) [production]
13:31 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns6001.wikimedia.org [production]
13:21 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:21 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add back entry for mistakenly deleted ssw1-a8-codfw IP - cmooney@cumin1003" [production]