production SAL

8051-8100 of 10000 results (106ms)

2023-07-07 §
15:45	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
15:43	<aborrero@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudlb1001.eqiad.wmnet with OS bullseye	[production]
15:33	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
15:30	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
15:05	<bking@deploy1002>	Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 50s)	[production]
15:04	<bking@deploy1002>	Started deploy [wdqs/wdqs@dff41b7]: 0.3.124	[production]
14:58	<bking@deploy1002>	Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 49s)	[production]
14:57	<aborrero@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudlb1001.eqiad.wmnet with OS bullseye	[production]
14:57	<bking@deploy1002>	Started deploy [wdqs/wdqs@dff41b7]: 0.3.124	[production]
14:50	<bking@deploy1002>	Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 05s)	[production]
14:50	<bking@deploy1002>	Started deploy [wdqs/wdqs@dff41b7]: 0.3.124	[production]
14:49	<bking@deploy1002>	Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 05s)	[production]
14:49	<bking@deploy1002>	Started deploy [wdqs/wdqs@dff41b7]: 0.3.124	[production]
14:47	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
14:26	<bking@cumin1001>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0)	[production]
13:59	<bking@deploy1002>	Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 07s)	[production]
13:59	<bking@deploy1002>	Started deploy [wdqs/wdqs@dff41b7]: 0.3.124	[production]
13:58	<bking@deploy1002>	Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 05s)	[production]
13:58	<bking@deploy1002>	Started deploy [wdqs/wdqs@dff41b7]: 0.3.124	[production]
12:50	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
12:17	<hashar>	Re-enabled zuul-merger on contint2001 and removed the Icinga maintenance window	[production]
12:02	<aborrero@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
12:02	<aborrero@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikimediacloud - aborrero@cumin1001"	[production]
12:01	<aborrero@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikimediacloud - aborrero@cumin1001"	[production]
11:58	<aborrero@cumin1001>	START - Cookbook sre.dns.netbox	[production]
11:48	<aborrero@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
11:48	<aborrero@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikimediacloud - aborrero@cumin1001"	[production]
11:47	<aborrero@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikimediacloud - aborrero@cumin1001"	[production]
11:45	<aborrero@cumin1001>	START - Cookbook sre.dns.netbox	[production]
11:42	<hashar>	Enabled zuul-merger contint1002, disabled it on contint2001 and marked that host as under maintenance in Icinga for the next two hours	[production]
11:27	<hashar>	Stopped zuul-merger contint1002	[production]
11:17	<aborrero@cumin1001>	START - Cookbook sre.dns.netbox	[production]
11:05	<aborrero@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
11:05	<aborrero@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikimediacloud - aborrero@cumin1001"	[production]
11:04	<aborrero@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikimediacloud - aborrero@cumin1001"	[production]
11:02	<aborrero@cumin1001>	START - Cookbook sre.dns.netbox	[production]
10:13	<moritzm>	rebooting puppetdb1003	[production]
10:09	<moritzm>	rebooting puppetserver1001	[production]
10:06	<jmm@cumin2002>	END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host puppetdb2003.codfw.wmnet	[production]
10:05	<moritzm>	rebooting puppetserver2001	[production]
10:05	<jiji@deploy1002>	helmfile [staging] DONE helmfile.d/services/ipoid: apply	[production]
10:03	<jiji@deploy1002>	helmfile [staging] START helmfile.d/services/ipoid: apply	[production]
09:59	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1002.eqiad.wmnet	[production]
09:55	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host puppetdb2003.codfw.wmnet	[production]
09:55	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host netflow1002.eqiad.wmnet	[production]
09:52	<jmm@cumin2002>	END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host debmonitor2003.codfw.wmnet	[production]
09:52	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow2003.codfw.wmnet	[production]
09:46	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host netflow2003.codfw.wmnet	[production]
09:46	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host debmonitor2003.codfw.wmnet	[production]
09:45	<stevemunene@cumin1001>	END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) restart masters for Hadoop analytics cluster: Restart of jvm daemons.	[production]