production SAL

1551-1600 of 10000 results (54ms)

2022-05-02 §
13:50	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host aqs2004.codfw.wmnet with OS bullseye	[production]
13:49	<vgutierrez>	rolling upgrade of HAProxy in codfw	[production]
13:48	<elukey@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 18s)	[production]
13:48	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
13:41	<dcaro@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudbackup2001.codfw.wmnet	[production]
13:13	<godog>	start removal of 'tegola-swift-container' and its objects - T307184	[production]
12:58	<kormat@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 9 hosts with reason: Deploying schema change to s2@codfw T303603	[production]
12:58	<kormat@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on 9 hosts with reason: Deploying schema change to s2@codfw T303603	[production]
12:57	<vgutierrez>	rolling upgrade of HAProxy in drmrs	[production]
12:55	<kormat>	dbmaint Deploying schema change to s2@codfw (T303603)	[production]
12:48	<volans>	swapped /srv/deployment directory on deploy1002 with the one from the latest backup - T307349	[production]
12:45	<kormat>	dbmaint Deploying schema change to s2 (T303603)	[production]
12:10	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T298563)', diff saved to https://phabricator.wikimedia.org/P27349 and previous config saved to /var/cache/conftool/dbconfig/20220502-121018-ladsgroup.json	[production]
11:55	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27347 and previous config saved to /var/cache/conftool/dbconfig/20220502-115513-ladsgroup.json	[production]
11:40	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27346 and previous config saved to /var/cache/conftool/dbconfig/20220502-114007-ladsgroup.json	[production]
11:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T298563)', diff saved to https://phabricator.wikimedia.org/P27345 and previous config saved to /var/cache/conftool/dbconfig/20220502-112502-ladsgroup.json	[production]
11:11	<vgutierrez>	rolling upgrade of HAProxy in ulsfo	[production]
11:07	<elukey@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 45s)	[production]
11:06	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
11:04	<elukey@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 05s)	[production]
11:04	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
11:01	<elukey@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 19s)	[production]
11:00	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1168 (T298563)', diff saved to https://phabricator.wikimedia.org/P27344 and previous config saved to /var/cache/conftool/dbconfig/20220502-110041-ladsgroup.json	[production]
11:00	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
11:00	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1168.eqiad.wmnet with reason: Maintenance	[production]
11:00	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db1168.eqiad.wmnet with reason: Maintenance	[production]
11:00	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180 (T298563)', diff saved to https://phabricator.wikimedia.org/P27343 and previous config saved to /var/cache/conftool/dbconfig/20220502-110033-ladsgroup.json	[production]
10:59	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
10:58	<elukey@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 40s)	[production]
10:57	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
10:57	<elukey@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 06s)	[production]
10:57	<elukey@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
10:49	<klausman@deploy1002>	Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 05s)	[production]
10:48	<klausman@deploy1002>	Started deploy [ores/deploy@98a1b2e]: (no justification provided)	[production]
10:47	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1002.eqiad.wmnet	[production]
10:45	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
10:45	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27342 and previous config saved to /var/cache/conftool/dbconfig/20220502-104528-ladsgroup.json	[production]
10:44	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
10:44	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
10:43	<jynus@cumin1001>	START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS bullseye	[production]
10:43	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet	[production]
10:42	<jynus@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1002.eqiad.wmnet with OS bullseye	[production]
10:41	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
10:38	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1002.eqiad.wmnet	[production]
10:34	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet	[production]
10:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27341 and previous config saved to /var/cache/conftool/dbconfig/20220502-103023-ladsgroup.json	[production]
10:24	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1169 (T306560)', diff saved to https://phabricator.wikimedia.org/P27340 and previous config saved to /var/cache/conftool/dbconfig/20220502-102402-ladsgroup.json	[production]
10:19	<klausman@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ores2002.codfw.wmnet with OS buster	[production]
10:18	<klausman@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ores2002.codfw.wmnet with reason: host reimage	[production]
10:15	<klausman@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on ores2002.codfw.wmnet with reason: host reimage	[production]