production SAL

701-750 of 10000 results (44ms)

2022-03-22 §
16:07	<andrew@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
16:00	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
15:59	<moritzm>	imported jvmquake 1.0.1 for stretch/buster (JDK8) and bullseye (JDK11)	[production]
15:58	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P22970 and previous config saved to /var/cache/conftool/dbconfig/20220322-155854-marostegui.json	[production]
15:56	<btullis@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
15:54	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
15:43	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22969 and previous config saved to /var/cache/conftool/dbconfig/20220322-154349-marostegui.json	[production]
15:33	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
15:29	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
15:25	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22968 and previous config saved to /var/cache/conftool/dbconfig/20220322-152508-marostegui.json	[production]
15:25	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
15:25	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
15:22	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1174 (re)pooling @ 10%: After reboot', diff saved to https://phabricator.wikimedia.org/P22967 and previous config saved to /var/cache/conftool/dbconfig/20220322-152247-root.json	[production]
15:17	<hashar>	Gerrit 3.3.10 up and running T304226	[production]
15:14	<hashar>	Stopping Gerrit for security update T304226	[production]
15:13	<hashar@deploy1002>	Finished deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit1001 T304226 (duration: 00m 10s)	[production]
15:13	<hashar@deploy1002>	Started deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit1001 T304226	[production]
15:10	<hashar>	Upgrading and starting Gerrit on gerrit2001 (replica)	[production]
15:06	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
15:06	<hashar@deploy1002>	Finished deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit2001 T304226 (duration: 00m 12s)	[production]
15:06	<hashar@deploy1002>	Started deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit2001 T304226	[production]
14:48	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22965 and previous config saved to /var/cache/conftool/dbconfig/20220322-144855-marostegui.json	[production]
14:48	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
14:48	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
14:48	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181 (T298557)', diff saved to https://phabricator.wikimedia.org/P22964 and previous config saved to /var/cache/conftool/dbconfig/20220322-144847-marostegui.json	[production]
14:33	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P22963 and previous config saved to /var/cache/conftool/dbconfig/20220322-143341-marostegui.json	[production]
14:18	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P22962 and previous config saved to /var/cache/conftool/dbconfig/20220322-141836-marostegui.json	[production]
13:52	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudgw1002.eqiad.wmnet	[production]
13:46	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudgw1001.eqiad.wmnet	[production]
13:44	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudmetrics1004.eqiad.wmnet	[production]
13:43	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
13:42	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
13:42	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
13:41	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1181 (T298557)', diff saved to https://phabricator.wikimedia.org/P22960 and previous config saved to /var/cache/conftool/dbconfig/20220322-134148-marostegui.json	[production]
13:41	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance	[production]
13:41	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1181.eqiad.wmnet with reason: Maintenance	[production]
13:41	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
13:40	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudgw1001.eqiad.wmnet	[production]
13:39	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: all wikis to 1.39.0-wmf.3 refs T300203	[production]
13:36	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudmetrics1004.eqiad.wmnet	[production]
13:35	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudmetrics1003.eqiad.wmnet	[production]
13:33	<aborrero@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudgw2001-dev.codfw.wmnet	[production]
13:31	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
13:30	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
13:30	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
13:29	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
13:27	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudmetrics1003.eqiad.wmnet	[production]
13:27	<jnuche@deploy1002>	Synchronized php: group1 wikis to 1.39.0-wmf.3 refs T300203 (duration: 00m 52s)	[production]
13:26	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: group1 wikis to 1.39.0-wmf.3 refs T300203	[production]
13:26	<aborrero@cumin2002>	START - Cookbook sre.hosts.reboot-single for host cloudgw2001-dev.codfw.wmnet	[production]