production SAL

3001-3050 of 10000 results (81ms)

2024-02-13 §
11:34	<brouberol@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage2001.codfw.wmnet with reason: host reimage	[production]
11:33	<cgoubert@deploy2002>	helmfile [codfw] DONE helmfile.d/services/mw-debug: apply	[production]
11:32	<cgoubert@deploy2002>	helmfile [codfw] START helmfile.d/services/mw-debug: apply	[production]
11:32	<cgoubert@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply	[production]
11:31	<brouberol@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage2001.codfw.wmnet with reason: host reimage	[production]
11:31	<cgoubert@deploy2002>	helmfile [eqiad] START helmfile.d/services/mw-debug: apply	[production]
11:27	<hnowlan@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye	[production]
11:24	<cgoubert@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply	[production]
11:24	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
11:24	<cgoubert@deploy2002>	helmfile [eqiad] START helmfile.d/services/mw-debug: apply	[production]
11:24	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
11:24	<claime>	Change default maxUnavailable for mw-on-k8s to 10%	[production]
11:21	<brouberol@cumin1002>	START - Cookbook sre.hosts.reimage for host apifeatureusage2001.codfw.wmnet with OS bullseye	[production]
11:20	<brouberol@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host apifeatureusage1001.eqiad.wmnet with OS bullseye	[production]
11:14	<gmodena@deploy2002>	helmfile [codfw] DONE helmfile.d/services/eventstreams: apply	[production]
11:14	<gmodena@deploy2002>	helmfile [codfw] START helmfile.d/services/eventstreams: apply	[production]
11:14	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye	[production]
11:13	<hnowlan@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye	[production]
11:12	<gmodena@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/eventstreams: apply	[production]
11:11	<gmodena@deploy2002>	helmfile [eqiad] START helmfile.d/services/eventstreams: apply	[production]
11:10	<gmodena@deploy2002>	helmfile [staging] DONE helmfile.d/services/eventstreams: apply	[production]
11:10	<gmodena@deploy2002>	helmfile [staging] START helmfile.d/services/eventstreams: apply	[production]
11:04	<brouberol@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage	[production]
11:01	<brouberol@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage	[production]
11:01	<cgoubert@cumin2002>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2388.codfw.wmnet	[production]
11:01	<cgoubert@cumin2002>	START - Cookbook sre.hosts.remove-downtime for mw2388.codfw.wmnet	[production]
10:57	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye	[production]
10:49	<brouberol@cumin1002>	START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bullseye	[production]
10:41	<brouberol@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apifeatureusage1001.eqiad.wmnet with OS bookworm	[production]
10:39	<brouberol@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage	[production]
10:36	<brouberol@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage	[production]
10:25	<brouberol@cumin1002>	START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bookworm	[production]
10:23	<brouberol@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apifeatureusage1001.eqiad.wmnet with OS bookworm	[production]
10:23	<kharlan@deploy2002>	helmfile [codfw] DONE helmfile.d/services/ipoid: apply	[production]
10:23	<kharlan@deploy2002>	helmfile [codfw] START helmfile.d/services/ipoid: apply	[production]
10:22	<kharlan@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/ipoid: apply	[production]
10:22	<kharlan@deploy2002>	helmfile [eqiad] START helmfile.d/services/ipoid: apply	[production]
10:22	<kharlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/ipoid: apply	[production]
10:22	<kharlan@deploy2002>	helmfile [staging] START helmfile.d/services/ipoid: apply	[production]
10:09	<brouberol@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage	[production]
10:06	<brouberol@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage	[production]
10:05	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb2002-dev.codfw.wmnet	[production]
09:58	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host clouddb2002-dev.codfw.wmnet	[production]
09:57	<brouberol@cumin1002>	START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bookworm	[production]
09:23	<stran@deploy2002>	helmfile [codfw] DONE helmfile.d/services/ipoid: apply	[production]
09:22	<stran@deploy2002>	helmfile [codfw] START helmfile.d/services/ipoid: apply	[production]
09:22	<stran@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/ipoid: apply	[production]
09:22	<akosiaris>	delete sessionstore pod to force rescheduling	[production]
09:21	<stran@deploy2002>	helmfile [eqiad] START helmfile.d/services/ipoid: apply	[production]
09:20	<stran@deploy2002>	helmfile [staging] DONE helmfile.d/services/ipoid: apply	[production]