production SAL

2051-2100 of 10000 results (166ms)

2025-02-19 §
13:45	<fabfur@cumin1002>	START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_esams and A:cp	[production]
13:42	<dcausse@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:42	<dcausse@deploy2002>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:41	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cloudgw1002.eqiad.wmnet	[production]
13:41	<andrew@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
13:41	<andrew@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudgw1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002"	[production]
13:41	<dcausse@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:41	<moritzm>	installing libtasn1-6 security updates	[production]
13:41	<dcausse@deploy2002>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:41	<andrew@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudgw1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002"	[production]
13:37	<andrew@cumin1002>	START - Cookbook sre.dns.netbox	[production]
13:32	<andrew@cumin1002>	START - Cookbook sre.hosts.decommission for hosts cloudgw1002.eqiad.wmnet	[production]
13:31	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1025.eqiad.wmnet	[production]
13:31	<andrew@cumin1002>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cloudgw1001.eqiad.wmnet	[production]
13:31	<andrew@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
13:31	<andrew@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudgw1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002"	[production]
13:30	<andrew@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudgw1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002"	[production]
13:26	<andrew@cumin1002>	START - Cookbook sre.dns.netbox	[production]
13:13	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1025.eqiad.wmnet with OS bookworm	[production]
13:11	<aborrero@cumin1002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) vlan1120.cloudgw1003.eqiad1.wikimediacloud.org on all recursors	[production]
13:11	<aborrero@cumin1002>	START - Cookbook sre.dns.wipe-cache vlan1120.cloudgw1003.eqiad1.wikimediacloud.org on all recursors	[production]
13:09	<andrew@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw1003.eqiad.wmnet with OS bullseye	[production]
12:54	<aborrero@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
12:54	<aborrero@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudgw updates - aborrero@cumin1002"	[production]
12:54	<aborrero@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudgw updates - aborrero@cumin1002"	[production]
12:54	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti1025.eqiad.wmnet with reason: host reimage	[production]
12:51	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1025.eqiad.wmnet with reason: host reimage	[production]
12:50	<aborrero@cumin1002>	START - Cookbook sre.dns.netbox	[production]
12:50	<andrew@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1003.eqiad.wmnet with reason: host reimage	[production]
12:46	<andrew@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1003.eqiad.wmnet with reason: host reimage	[production]
12:46	<fabfur@cumin1002>	END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_esams and A:cp	[production]
12:31	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS bookworm	[production]
12:30	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudgw1003.eqiad.wmnet with OS bullseye	[production]
12:29	<arnaudb@dns1004>	END - running authdns-update	[production]
12:28	<andrew@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw1001.eqiad.wmnet with OS bookworm	[production]
12:27	<arnaudb@dns1004>	START - running authdns-update	[production]
12:27	<andrew@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudgw1003.eqiad.wmnet with OS bullseye	[production]
12:22	<fabfur@cumin1002>	START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_esams and A:cp	[production]
12:12	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1036.eqiad.wmnet	[production]
12:10	<fabfur@cumin1002>	END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_eqiad and A:cp	[production]
12:10	<andrew@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
12:10	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1004.eqiad.wmnet to plain	[production]
12:09	<jmm@cumin2002>	START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1004.eqiad.wmnet to plain	[production]
12:07	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1036.eqiad.wmnet	[production]
12:07	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1036.eqiad.wmnet	[production]
12:06	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1004.eqiad.wmnet to drbd	[production]
12:06	<andrew@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
12:01	<kartik@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply	[production]
11:59	<jmm@cumin2002>	END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ganeti1025.eqiad.wmnet	[production]
11:59	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1025.eqiad.wmnet	[production]