production SAL

4201-4250 of 10000 results (105ms)

2024-06-03 §
11:51	<ladsgroup@deploy1002>	Started scap: Backport for [[gerrit:1037942\|Enable numeric sorting for Persian (T329440)]]	[production]
11:35	<effie>	restart memcached on mc1050 and mc2050	[production]
11:34	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1167 (T352010)', diff saved to https://phabricator.wikimedia.org/P63927 and previous config saved to /var/cache/conftool/dbconfig/20240603-113447-ladsgroup.json	[production]
11:34	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
11:34	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
11:34	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance	[production]
11:34	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance	[production]
11:27	<jynus@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on backup2011.codfw.wmnet with reason: remount filesystem	[production]
11:26	<jynus@cumin1002>	START - Cookbook sre.hosts.downtime for 1:00:00 on backup2011.codfw.wmnet with reason: remount filesystem	[production]
11:24	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1037.eqiad.wmnet with OS bookworm	[production]
11:07	<jmm@cumin2002>	END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host snapshot1013.eqiad.wmnet	[production]
11:07	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1037.eqiad.wmnet with reason: host reimage	[production]
11:04	<jiji@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc1037.eqiad.wmnet with reason: host reimage	[production]
10:54	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1199 (T364069)', diff saved to https://phabricator.wikimedia.org/P63926 and previous config saved to /var/cache/conftool/dbconfig/20240603-105416-marostegui.json	[production]
10:54	<jmm@cumin2002>	START - Cookbook sre.puppet.migrate-host for host snapshot1013.eqiad.wmnet	[production]
10:54	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1199.eqiad.wmnet with reason: Maintenance	[production]
10:53	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1199.eqiad.wmnet with reason: Maintenance	[production]
10:53	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1190 (T364069)', diff saved to https://phabricator.wikimedia.org/P63925 and previous config saved to /var/cache/conftool/dbconfig/20240603-105352-marostegui.json	[production]
10:50	<jiji@cumin1002>	START - Cookbook sre.hosts.reimage for host mc1037.eqiad.wmnet with OS bookworm	[production]
10:41	<moritzm>	installing linux 5.10.218 security updates	[production]
10:40	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1038.eqiad.wmnet with OS bookworm	[production]
10:38	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P63924 and previous config saved to /var/cache/conftool/dbconfig/20240603-103844-marostegui.json	[production]
10:29	<btullis@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host snapshot1013.eqiad.wmnet with OS bullseye	[production]
10:23	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P63923 and previous config saved to /var/cache/conftool/dbconfig/20240603-102335-marostegui.json	[production]
10:21	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1038.eqiad.wmnet with reason: host reimage	[production]
10:18	<jiji@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc1038.eqiad.wmnet with reason: host reimage	[production]
10:08	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1190 (T364069)', diff saved to https://phabricator.wikimedia.org/P63922 and previous config saved to /var/cache/conftool/dbconfig/20240603-100827-marostegui.json	[production]
10:03	<jiji@cumin1002>	START - Cookbook sre.hosts.reimage for host mc1038.eqiad.wmnet with OS bookworm	[production]
10:02	<btullis@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on snapshot1013.eqiad.wmnet with reason: host reimage	[production]
09:58	<ladsgroup@deploy1002>	Finished scap: Backport for [[gerrit:1038243\|Stop writing to the old pagelinks columns in s8 (T352010)]] (duration: 18m 39s)	[production]
09:57	<Dreamy_Jazz>	Restarting MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration	[production]
09:56	<btullis@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1013.eqiad.wmnet with reason: host reimage	[production]
09:49	<jiji@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host mc-gp2001.codfw.wmnet with OS bookworm	[production]
09:45	<ladsgroup@deploy1002>	ladsgroup: Continuing with sync	[production]
09:43	<btullis@cumin1002>	START - Cookbook sre.hosts.reimage for host snapshot1013.eqiad.wmnet with OS bullseye	[production]
09:42	<ladsgroup@deploy1002>	ladsgroup: Backport for [[gerrit:1038243\|Stop writing to the old pagelinks columns in s8 (T352010)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
09:41	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1039.eqiad.wmnet with OS bookworm	[production]
09:40	<ladsgroup@deploy1002>	Started scap: Backport for [[gerrit:1038243\|Stop writing to the old pagelinks columns in s8 (T352010)]]	[production]
09:31	<jiji@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc-gp2001.codfw.wmnet with reason: host reimage	[production]
09:29	<jiji@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc-gp2001.codfw.wmnet with reason: host reimage	[production]
09:25	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1039.eqiad.wmnet with reason: host reimage	[production]
09:22	<jiji@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc1039.eqiad.wmnet with reason: host reimage	[production]
09:10	<jiji@cumin2002>	START - Cookbook sre.hosts.reimage for host mc-gp2001.codfw.wmnet with OS bookworm	[production]
09:10	<jiji@cumin1002>	START - Cookbook sre.hosts.reimage for host mc1039.eqiad.wmnet with OS bookworm	[production]
09:08	<logmsgbot>	@deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
09:08	<logmsgbot>	@deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
09:08	<jiji@cumin1002>	END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['mc1039.eqiad.wmnet']	[production]
08:49	<jiji@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-gp2002.codfw.wmnet with OS bookworm	[production]
08:45	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-gp1003.eqiad.wmnet with OS bookworm	[production]
08:15	<hashar@deploy1002>	Finished deploy [gerrit/gerrit@c93e47d]: Revert Gerrit back to 3.8.6 - T354887 (duration: 00m 05s)	[production]