__all__ SAL

3601-3650 of 10000 results (76ms)

2022-05-06 §
09:40	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1006.eqiad.wmnet	[production]
09:38	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM install4001.wikimedia.org	[production]
09:34	<jmm@cumin2002>	START - Cookbook sre.ganeti.reboot-vm for VM install4001.wikimedia.org	[production]
09:33	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve1006.eqiad.wmnet	[production]
09:33	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM bast4003.wikimedia.org	[production]
09:31	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1005.eqiad.wmnet	[production]
09:29	<jmm@cumin2002>	START - Cookbook sre.ganeti.reboot-vm for VM bast4003.wikimedia.org	[production]
09:27	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2057.codfw.wmnet with OS bullseye	[production]
09:25	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve1005.eqiad.wmnet	[production]
09:23	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1004.eqiad.wmnet	[production]
09:17	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve1004.eqiad.wmnet	[production]
09:11	<joal>	kill cassandra-monthly-wf-local_group_default_T_mediarequest_top_files-2022-4 again	[analytics]
09:08	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1003.eqiad.wmnet	[production]
09:03	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2057.codfw.wmnet with reason: host reimage	[production]
09:02	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve1003.eqiad.wmnet	[production]
09:00	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1002.eqiad.wmnet	[production]
09:00	<mvernon@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2057.codfw.wmnet with reason: host reimage	[production]
08:54	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve1002.eqiad.wmnet	[production]
08:52	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1001.eqiad.wmnet	[production]
08:45	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve1001.eqiad.wmnet	[production]
08:44	<joal>	Rerun cassandra-monthly-wf-local_group_default_T_mediarequest_top_files-2022-4 with SRE watching network	[analytics]
08:29	<joal>	kill cassandra-monthly-wf-local_group_default_T_mediarequest_top_files-2022-4 as it was probably saturating network	[analytics]
08:16	<mvernon@cumin1001>	START - Cookbook sre.hosts.reimage for host ms-be2057.codfw.wmnet with OS bullseye	[production]
07:49	<mvernon@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-be2057.codfw.wmnet with OS bullseye	[production]
07:42	<mvernon@cumin1001>	START - Cookbook sre.hosts.reimage for host ms-be2057.codfw.wmnet with OS bullseye	[production]
07:41	<mvernon@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-be2057.codfw.wmnet with OS bullseye	[production]
07:31	<mvernon@cumin1001>	START - Cookbook sre.hosts.reimage for host ms-be2057.codfw.wmnet with OS bullseye	[production]
07:20	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetboard1002.eqiad.wmnet	[production]
07:19	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host puppetboard1002.eqiad.wmnet	[production]
07:14	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetboard2002.codfw.wmnet	[production]
07:13	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host puppetboard2002.codfw.wmnet	[production]
07:11	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dumpsdata1007.eqiad.wmnet	[production]
07:06	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host dumpsdata1007.eqiad.wmnet	[production]
07:06	<wm-bot>	<samwilson> Updating to version 0.1.0	[tools.docs]
01:51	<dzahn@cumin2002>	conftool action : set/pooled=no; selector: dc=eqiad,name=mw1415.eqiad.wmnet	[production]
01:50	<dzahn@cumin2002>	conftool action : set/pooled=no; selector: dc=codfw,name=mw1415.eqiad.wmnet	[production]
00:46	<rook@cumin1001>	END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host cloudvirt1016.eqiad.wmnet	[production]
00:46	<rook@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudvirt1016.eqiad.wmnet	[production]
2022-05-05 §
22:57	<dduvall>	Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789723	[releng]
22:31	<dduvall>	Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789721	[releng]
22:28	<dduvall>	created 2 new jobs to deploy https://gerrit.wikimedia.org/r/789720	[releng]
22:24	<dduvall>	Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789718	[releng]
22:21	<dduvall>	created 4 new jobs to deploy https://gerrit.wikimedia.org/r/789717	[releng]
22:15	<dduvall>	Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789714	[releng]
22:13	<dduvall>	created 2 new jobs to deploy https://gerrit.wikimedia.org/r/789713	[releng]
22:09	<dduvall>	Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789711	[releng]
22:07	<dduvall>	created 2 new jobs to deploy https://gerrit.wikimedia.org/r/789710	[releng]
22:06	<razzi@cumin1001>	END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka main-eqiad cluster: Reboot kafka nodes	[production]
22:01	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
22:00	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]