production SAL

2751-2800 of 10000 results (138ms)

2025-09-22 §
17:55	<andrew@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage	[production]
17:38	<andrew@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1024.eqiad.wmnet with OS bookworm	[production]
17:36	<andrew@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1023.eqiad.wmnet with OS bookworm	[production]
17:24	<sfaci@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
17:23	<sfaci@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
17:18	<andrew@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage	[production]
17:11	<andrew@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage	[production]
16:54	<andrew@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1023.eqiad.wmnet with OS bookworm	[production]
16:48	<andrew@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm	[production]
16:45	<andrew@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1020.eqiad.wmnet with OS bookworm	[production]
16:43	<andrew@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm	[production]
16:41	<andrew@cumin2002>	END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet']	[production]
16:32	<andrew@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet']	[production]
16:31	<andrew@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm	[production]
16:28	<andrew@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage	[production]
16:22	<andrew@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage	[production]
16:12	<andrew@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm	[production]
16:10	<jhathaway@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on sretest2001.codfw.wmnet with reason: T383173	[production]
16:05	<andrew@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1020.eqiad.wmnet with OS bookworm	[production]
16:01	<andrew@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1019.eqiad.wmnet with OS bookworm	[production]
15:45	<toyofuku@deploy1003>	Finished scap sync-world: Backport for [[gerrit:1187052\|Enable search recommendation on Wikipedia (T402048)]] (duration: 11m 35s)	[production]
15:43	<andrew@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage	[production]
15:40	<toyofuku@deploy1003>	jdlrobson, toyofuku: Continuing with sync	[production]
15:39	<andrew@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage	[production]
15:37	<toyofuku@deploy1003>	jdlrobson, toyofuku: Backport for [[gerrit:1187052\|Enable search recommendation on Wikipedia (T402048)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
15:33	<toyofuku@deploy1003>	Started scap sync-world: Backport for [[gerrit:1187052\|Enable search recommendation on Wikipedia (T402048)]]	[production]
15:22	<andrew@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudcephosd1019.eqiad.wmnet with OS bookworm	[production]
15:19	<pt1979@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on fasw2-c8a-codfw,fasw2-c8b-codfw with reason: pfw1-codfw relocation	[production]
15:17	<pt1979@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pfw1-codfw with reason: pfw1-codfw relocation	[production]
15:15	<moritzm>	installing clamav security updates	[production]
15:11	<pt1979@cumin2002>	DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on ‘pfw1-codfw’ with reason: ‘pfw1	[production]
14:32	<dcausse@deploy1003>	helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
14:32	<dcausse@deploy1003>	helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
14:31	<dcausse@deploy1003>	helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply	[production]
14:31	<dcausse@deploy1003>	helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply	[production]
14:22	<brouberol@deploy1003>	helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:21	<brouberol@deploy1003>	helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'.	[production]
14:20	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:20	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:12	<brouberol@deploy1003>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
14:11	<brouberol@deploy1003>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
14:10	<brouberol@deploy1003>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:09	<brouberol@deploy1003>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
14:08	<brouberol@deploy1003>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:07	<brouberol@deploy1003>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
13:58	<sukhe>	delete list: sectrainings@lists.wikimedia.org [no archives, project obsolete since 2022]	[production]
13:54	<phuedx@deploy1003>	Finished scap sync-world: Backport for [[gerrit:1190280\|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]] (duration: 12m 25s)	[production]
13:49	<phuedx@deploy1003>	phuedx: Continuing with sync	[production]
13:48	<phuedx@deploy1003>	phuedx: Backport for [[gerrit:1190280\|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
13:42	<phuedx@deploy1003>	Started scap sync-world: Backport for [[gerrit:1190280\|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]]	[production]