production SAL

351-400 of 10000 results (100ms)

2024-08-06 §
18:28	<sukhe>	sudo cumin "lvs6001*" 'disable-puppet "rebooting" && systemctl stop pybal.service'	[production]
18:18	<brett>	stop pybal on lvs5005 for server reboot	[production]
18:13	<dancy@deploy1003>	Started scap sync-world: testing T370934	[production]
17:53	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs5004.eqsin.wmnet	[production]
17:50	<sukhe@cumin1002>	START - Cookbook sre.hosts.reboot-single for host lvs5004.eqsin.wmnet	[production]
17:47	<sukhe>	stop pybal on lvs5004 for server reboot	[production]
17:40	<mutante>	CI - adding a new SSH key to jenkins - in the same file without removing the old key yet - this is expected to have no effect, but if CI breaks will revert - T177826	[production]
17:01	<fnegri@cumin1002>	conftool action : set/pooled=yes; selector: name=clouddb1020.eqiad.wmnet,service=s5	[production]
17:01	<fnegri@cumin1002>	conftool action : set/pooled=yes; selector: name=clouddb1020.eqiad.wmnet,service=s8	[production]
16:56	<fnegri@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host clouddb1020.eqiad.wmnet with OS bookworm	[production]
16:44	<ryankemper@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1023.eqiad.wmnet with OS bullseye	[production]
16:39	<jhancock@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
16:39	<jhancock@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding payments200 to codfw - jhancock@cumin2002"	[production]
16:39	<jhancock@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding payments200 to codfw - jhancock@cumin2002"	[production]
16:35	<jhancock@cumin2002>	START - Cookbook sre.dns.netbox	[production]
16:23	<fnegri@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on clouddb1020.eqiad.wmnet with reason: host reimage	[production]
16:21	<fnegri@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on clouddb1020.eqiad.wmnet with reason: host reimage	[production]
16:08	<fnegri@cumin1002>	START - Cookbook sre.hosts.reimage for host clouddb1020.eqiad.wmnet with OS bookworm	[production]
16:08	<sukhe>	sudo cumin "A:dnsbox" "run-puppet-agent --enable 'upgrading anycast-hc'": finish anycast-hc upgrade: T370068	[production]
16:08	<sukhe>	sudo cumin "A:dnsbox" "run-puppet-agent --enable 'upgrading anycast-hc'": finish anycast-hc upgrade	[production]
16:03	<fnegri@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on clouddb1020.eqiad.wmnet with reason: Reimaging clouddb1020 T365424	[production]
16:03	<fnegri@cumin1002>	START - Cookbook sre.hosts.downtime for 1:00:00 on clouddb1020.eqiad.wmnet with reason: Reimaging clouddb1020 T365424	[production]
15:46	<jhancock@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
15:46	<jhancock@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2011 to codfw - jhancock@cumin2002"	[production]
15:46	<jhancock@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2011 to codfw - jhancock@cumin2002"	[production]
15:41	<jhancock@cumin2002>	START - Cookbook sre.dns.netbox	[production]
15:39	<jhancock@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
15:39	<jhancock@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2010 to codfw - jhancock@cumin2002"	[production]
15:39	<jhancock@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2010 to codfw - jhancock@cumin2002"	[production]
15:35	<jhancock@cumin2002>	START - Cookbook sre.dns.netbox	[production]
15:30	<dcausse@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
15:30	<dcausse@deploy1003>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
15:26	<dcausse@deploy1003>	helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
15:26	<dcausse@deploy1003>	helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
15:25	<sukhe@cumin1002>	conftool action : set/pooled=yes; selector: name=dns1006.wikimedia.org [reason: [done] anycast-healthchecker 0.9.8 upgrade]	[production]
15:25	<elukey@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker2035.mgmt.codfw.wmnet with reboot policy GRACEFUL	[production]
15:23	<ryankemper@cumin2002>	START - Cookbook sre.hosts.reimage for host wdqs1023.eqiad.wmnet with OS bullseye	[production]
15:23	<elukey@cumin1002>	START - Cookbook sre.hosts.provision for host wikikube-worker2035.mgmt.codfw.wmnet with reboot policy GRACEFUL	[production]
15:23	<sukhe@cumin1002>	conftool action : set/pooled=no; selector: name=dns1006.wikimedia.org [reason: anycast-healthchecker 0.9.8 upgrade]	[production]
15:21	<sukhe@cumin1002>	conftool action : set/pooled=yes; selector: name=dns1005.wikimedia.org [reason: [done] anycast-healthchecker 0.9.8 upgrade]	[production]
15:20	<elukey@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2002.mgmt.codfw.wmnet with reboot policy FORCED	[production]
15:18	<sukhe@cumin1002>	conftool action : set/pooled=no; selector: name=dns1005.wikimedia.org [reason: anycast-healthchecker 0.9.8 upgrade]	[production]
15:16	<sukhe@cumin1002>	conftool action : set/pooled=yes; selector: name=dns1004.wikimedia.org [reason: [done] anycast-healthchecker 0.9.8 upgrade]	[production]
15:14	<sukhe@cumin1002>	conftool action : set/pooled=no; selector: name=dns1004.wikimedia.org [reason: anycast-healthchecker 0.9.8 upgrade]	[production]
15:12	<elukey@cumin1002>	START - Cookbook sre.hosts.provision for host sretest2002.mgmt.codfw.wmnet with reboot policy FORCED	[production]
15:11	<elukey@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest1001.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
15:10	<elukey@cumin1002>	START - Cookbook sre.hosts.provision for host sretest1001.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
15:10	<cdanis>	re-enabling puppet on cp nodes to deploy https://gerrit.wikimedia.org/r/1059126	[production]
15:02	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker1296.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:01	<cdanis>	disabling puppet on cp nodes to deploy https://gerrit.wikimedia.org/r/1059126	[production]