production SAL

401-450 of 10000 results (42ms)

2022-03-22 §
16:29	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1158.eqiad.wmnet with reason: Maintenance	[production]
16:29	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1158.eqiad.wmnet with reason: Maintenance	[production]
16:29	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22972 and previous config saved to /var/cache/conftool/dbconfig/20220322-162904-marostegui.json	[production]
16:27	<razzi@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on karapace1001.eqiad.wmnet with reason: Setting up karapace for the first time	[production]
16:27	<razzi@cumin1001>	START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on karapace1001.eqiad.wmnet with reason: Setting up karapace for the first time	[production]
16:23	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
16:18	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
16:18	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
16:17	<jayme@deploy1002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
16:17	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
16:16	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
16:16	<jayme@deploy1002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
16:13	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P22971 and previous config saved to /var/cache/conftool/dbconfig/20220322-161359-marostegui.json	[production]
16:13	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
16:13	<jayme@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
16:11	<jayme@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
16:09	<btullis@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
16:07	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
16:07	<andrew@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
16:00	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
15:59	<moritzm>	imported jvmquake 1.0.1 for stretch/buster (JDK8) and bullseye (JDK11)	[production]
15:58	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P22970 and previous config saved to /var/cache/conftool/dbconfig/20220322-155854-marostegui.json	[production]
15:56	<btullis@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
15:54	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
15:43	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22969 and previous config saved to /var/cache/conftool/dbconfig/20220322-154349-marostegui.json	[production]
15:33	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
15:29	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet1003.eqiad.wmnet with reason: host reimage	[production]
15:25	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22968 and previous config saved to /var/cache/conftool/dbconfig/20220322-152508-marostegui.json	[production]
15:25	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
15:25	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
15:22	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1174 (re)pooling @ 10%: After reboot', diff saved to https://phabricator.wikimedia.org/P22967 and previous config saved to /var/cache/conftool/dbconfig/20220322-152247-root.json	[production]
15:17	<hashar>	Gerrit 3.3.10 up and running T304226	[production]
15:14	<hashar>	Stopping Gerrit for security update T304226	[production]
15:13	<hashar@deploy1002>	Finished deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit1001 T304226 (duration: 00m 10s)	[production]
15:13	<hashar@deploy1002>	Started deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit1001 T304226	[production]
15:10	<hashar>	Upgrading and starting Gerrit on gerrit2001 (replica)	[production]
15:06	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1003.eqiad.wmnet with OS bullseye	[production]
15:06	<hashar@deploy1002>	Finished deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit2001 T304226 (duration: 00m 12s)	[production]
15:06	<hashar@deploy1002>	Started deploy [gerrit/gerrit@967b0d7]: Gerrit to 3.3.10 on gerrit2001 T304226	[production]
14:48	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1174 (T298557)', diff saved to https://phabricator.wikimedia.org/P22965 and previous config saved to /var/cache/conftool/dbconfig/20220322-144855-marostegui.json	[production]
14:48	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
14:48	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
14:48	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181 (T298557)', diff saved to https://phabricator.wikimedia.org/P22964 and previous config saved to /var/cache/conftool/dbconfig/20220322-144847-marostegui.json	[production]
14:33	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P22963 and previous config saved to /var/cache/conftool/dbconfig/20220322-143341-marostegui.json	[production]
14:18	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P22962 and previous config saved to /var/cache/conftool/dbconfig/20220322-141836-marostegui.json	[production]
13:52	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudgw1002.eqiad.wmnet	[production]
13:46	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudgw1001.eqiad.wmnet	[production]
13:44	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudmetrics1004.eqiad.wmnet	[production]
13:43	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
13:42	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]