production SAL

3501-3550 of 10000 results (83ms)

2023-02-27 §
17:38	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
17:38	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
17:37	<zabe@deploy1002>	Finished scap: create gucwiki T321880 (duration: 11m 05s)	[production]
17:37	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
17:37	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
17:37	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
17:36	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
17:36	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
17:36	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
17:36	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
17:36	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
17:35	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
17:35	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
17:35	<elukey@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
17:33	<bking@cumin1001>	END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99)	[production]
17:29	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
17:29	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
17:28	<zabe@deploy1002>	zabe: create gucwiki T321880 synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet	[production]
17:26	<zabe@deploy1002>	Started scap: create gucwiki T321880	[production]
17:22	<zabe>	create Wikipedia Wayuu # T321880	[production]
17:12	<dcaro@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1004.eqiad.wmnet with reason: host reimage	[production]
17:09	<dcaro@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1004.eqiad.wmnet with reason: host reimage	[production]
16:54	<dcaro@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudcephosd1004.eqiad.wmnet with OS bullseye	[production]
16:54	<dcaro@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1004.eqiad.wmnet with OS bullseye	[production]
16:47	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker1005.eqiad.wmnet with OS bullseye	[production]
16:46	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2022.codfw.wmnet with OS bullseye	[production]
16:46	<pt1979@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
16:33	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2015.codfw.wmnet with OS bullseye	[production]
16:33	<pt1979@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
16:32	<pt1979@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
16:31	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1005.eqiad.wmnet with reason: host reimage	[production]
16:28	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1005.eqiad.wmnet with reason: host reimage	[production]
16:25	<jgleeson>	payments-wiki updated from c13b8d26 to 871c4e5c	[production]
16:25	<bking@cumin1001>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0)	[production]
16:21	<pt1979@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
16:16	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2022.codfw.wmnet with reason: host reimage	[production]
16:13	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2022.codfw.wmnet with reason: host reimage	[production]
16:08	<dcaro@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudcephosd1004.eqiad.wmnet with OS bullseye	[production]
16:08	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1008.eqiad.wmnet with reason: host reimage	[production]
16:06	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2015.codfw.wmnet with reason: host reimage	[production]
16:06	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1008.eqiad.wmnet with reason: host reimage	[production]
16:02	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2015.codfw.wmnet with reason: host reimage	[production]
16:02	<hashar@deploy1002>	Finished deploy [integration/docroot@cd7c263]: build: Pin PHPUnit to 9.5.28 like in other repos (duration: 00m 12s)	[production]
16:02	<hashar@deploy1002>	Started deploy [integration/docroot@cd7c263]: build: Pin PHPUnit to 9.5.28 like in other repos	[production]
15:58	<root@cumin1001>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['cloudcephosd1004']	[production]
15:56	<elukey@cumin1001>	END (ERROR) - Cookbook sre.ganeti.reimage (exit_code=97) for host ml-etcd2001.codfw.wmnet with OS bullseye	[production]
15:52	<root@cumin1001>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1004']	[production]
15:52	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host wdqs2022.codfw.wmnet with OS bullseye	[production]
15:52	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on ml-etcd2001.codfw.wmnet with reason: etcd cluster upgrade failed, waiting for k8s upgrade	[production]
15:52	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on ml-etcd2001.codfw.wmnet with reason: etcd cluster upgrade failed, waiting for k8s upgrade	[production]