production SAL

901-950 of 10000 results (88ms)

2024-05-29 §
16:51	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1194.eqiad.wmnet with reason: Maintenance	[production]
16:51	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1194.eqiad.wmnet with reason: Maintenance	[production]
16:50	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1191 (T366123)', diff saved to https://phabricator.wikimedia.org/P63575 and previous config saved to /var/cache/conftool/dbconfig/20240529-165057-marostegui.json	[production]
16:50	<dancy@deploy1002>	Started scap: Backport for [[gerrit:1037018\|Revert "Wrap tables with JS" (T330527)]]	[production]
16:40	<jiji@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc2045.codfw.wmnet with OS bookworm	[production]
16:35	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P63574 and previous config saved to /var/cache/conftool/dbconfig/20240529-163549-marostegui.json	[production]
16:35	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main1009.eqiad.wmnet with reason: host reimage	[production]
16:34	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1045.eqiad.wmnet with OS bookworm	[production]
16:32	<sukhe>	restart pybal on lvs1019	[production]
16:32	<jclark@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main1009.eqiad.wmnet with reason: host reimage	[production]
16:29	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host kafka-main1009.eqiad.wmnet with OS bullseye	[production]
16:28	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host kafka-main1010.eqiad.wmnet with OS bullseye	[production]
16:27	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
16:22	<jiji@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2045.codfw.wmnet with reason: host reimage	[production]
16:20	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P63573 and previous config saved to /var/cache/conftool/dbconfig/20240529-162040-marostegui.json	[production]
16:19	<jiji@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc2045.codfw.wmnet with reason: host reimage	[production]
16:18	<ChrisDobbins901_>	sudo cumin -b1 -s60 'A:cp and A:drmrs' 'run-puppet-agent --enable "merging CR 1037089"'	[production]
16:17	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1045.eqiad.wmnet with reason: host reimage	[production]
16:15	<arnaudb@cumin1002>	dbctl commit (dc=all): 'db1163 (re)pooling @ 100%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63572 and previous config saved to /var/cache/conftool/dbconfig/20240529-161522-arnaudb.json	[production]
16:15	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
16:14	<jiji@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc1045.eqiad.wmnet with reason: host reimage	[production]
16:09	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
16:05	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1191 (T366123)', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20240529-160528-marostegui.json	[production]
16:04	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
16:04	<ChrisDobbins901_>	sudo cumin 'A:cp and A:drmrs' 'disable-puppet "merging CR 1037089"'	[production]
16:01	<jiji@cumin1002>	START - Cookbook sre.hosts.reimage for host mc1045.eqiad.wmnet with OS bookworm	[production]
16:01	<jiji@cumin2002>	START - Cookbook sre.hosts.reimage for host mc2045.codfw.wmnet with OS bookworm	[production]
16:00	<arnaudb@cumin1002>	dbctl commit (dc=all): 'db1163 (re)pooling @ 75%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63570 and previous config saved to /var/cache/conftool/dbconfig/20240529-160016-arnaudb.json	[production]
16:00	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance	[production]
16:00	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance	[production]
15:59	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1181 (T366123)', diff saved to https://phabricator.wikimedia.org/P63569 and previous config saved to /var/cache/conftool/dbconfig/20240529-155954-marostegui.json	[production]
15:59	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:56	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:55	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:55	<andrew@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudvirt1041.eqiad.wmnet with OS bookworm	[production]
15:55	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-main1009.eqiad.wmnet with OS bullseye	[production]
15:53	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db2159 (T364299)', diff saved to https://phabricator.wikimedia.org/P63568 and previous config saved to /var/cache/conftool/dbconfig/20240529-155349-marostegui.json	[production]
15:53	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2187.codfw.wmnet with reason: Maintenance	[production]
15:53	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 12:00:00 on db2187.codfw.wmnet with reason: Maintenance	[production]
15:53	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2159.codfw.wmnet with reason: Maintenance	[production]
15:53	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2159.codfw.wmnet with reason: Maintenance	[production]
15:53	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2150 (T364299)', diff saved to https://phabricator.wikimedia.org/P63567 and previous config saved to /var/cache/conftool/dbconfig/20240529-155321-marostegui.json	[production]
15:52	<robh@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cloudvirt1041']	[production]
15:49	<jynus@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on dbprov2003.codfw.wmnet with reason: upgrade to 10.6	[production]
15:49	<jynus@cumin1002>	START - Cookbook sre.hosts.downtime for 5:00:00 on dbprov2003.codfw.wmnet with reason: upgrade to 10.6	[production]
15:49	<jynus@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on dbprov1003.eqiad.wmnet with reason: upgrade to 10.6	[production]
15:49	<jynus@cumin1002>	START - Cookbook sre.hosts.downtime for 5:00:00 on dbprov1003.eqiad.wmnet with reason: upgrade to 10.6	[production]
15:48	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
15:48	<jynus@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2141.codfw.wmnet with reason: upgrade to 10.6	[production]
15:48	<jynus@cumin1002>	START - Cookbook sre.hosts.downtime for 5:00:00 on db2141.codfw.wmnet with reason: upgrade to 10.6	[production]