production SAL

301-350 of 10000 results (104ms)

2024-08-07 §
15:15	<kevinbazira@deploy1003>	helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' .	[production]
14:58	<sukhe>	start pybal on lvs3008	[production]
14:53	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs3008.esams.wmnet	[production]
14:50	<sukhe@cumin1002>	START - Cookbook sre.hosts.reboot-single for host lvs3008.esams.wmnet	[production]
14:33	<elukey@cumin1002>	START - Cookbook sre.cassandra.roll-restart for nodes matching A:aqs-eqiad: Openjdk upgrade - elukey@cumin1002	[production]
14:26	<jnuche@deploy1003>	Finished deploy [releng/jenkins-deploy@9b733de] (releasing): (no justification provided) (duration: 01m 12s)	[production]
14:25	<jnuche@deploy1003>	Started deploy [releng/jenkins-deploy@9b733de] (releasing): (no justification provided)	[production]
14:24	<sukhe>	sudo cumin "lvs3008*" 'disable-puppet "rebooting" && systemctl stop pybal.service'	[production]
14:22	<jnuche@deploy1003>	Finished deploy [releng/jenkins-deploy@9b733de] (releasing): (no justification provided) (duration: 00m 53s)	[production]
14:21	<jnuche@deploy1003>	Started deploy [releng/jenkins-deploy@9b733de] (releasing): (no justification provided)	[production]
14:04	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:03	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:01	<elukey>	import Jenkins 2.462.1 on bullseye-wikimedia:thirdparty/ci	[production]
13:55	<sukhe>	start pybal on lvs3009	[production]
13:54	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs3009.esams.wmnet	[production]
13:51	<sukhe@cumin1002>	START - Cookbook sre.hosts.reboot-single for host lvs3009.esams.wmnet	[production]
13:46	<dcaro@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1037.eqiad.wmnet with OS bullseye	[production]
13:43	<hnowlan@deploy1003>	Finished scap: sync to test mw-jobrunner resource increase (duration: 02m 22s)	[production]
13:42	<hnowlan@deploy1003>	Started scap sync-world: sync to test mw-jobrunner resource increase	[production]
13:39	<filippo@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-jobrunner: apply	[production]
13:39	<filippo@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-jobrunner: apply	[production]
13:39	<filippo@deploy1003>	helmfile [codfw] DONE helmfile.d/services/mw-jobrunner: apply	[production]
13:38	<filippo@deploy1003>	helmfile [codfw] START helmfile.d/services/mw-jobrunner: apply	[production]
13:31	<hashar>	UTC afternoon backport window is completed	[production]
13:28	<dcaro@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1037.eqiad.wmnet with reason: host reimage	[production]
13:28	<hashar@deploy1003>	Finished scap: Backport for [[gerrit:1060415\|Turn on Parsoid support for Kartographer on Wikivoyage (T371823)]] (duration: 17m 26s)	[production]
13:27	<sukhe>	sudo cumin "lvs3009*" 'disable-puppet "rebooting" && systemctl stop pybal.service'	[production]
13:26	<dcaro@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1037.eqiad.wmnet with reason: host reimage	[production]
13:24	<elukey@cumin1002>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons.	[production]
13:23	<hashar@deploy1003>	cscott, hashar: Continuing with sync	[production]
13:22	<sukhe@cumin1002>	END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on P{cp3073*} and A:cp for 9.2.5-1wm2	[production]
13:18	<sukhe@cumin1002>	START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on P{cp3073*} and A:cp for 9.2.5-1wm2	[production]
13:18	<hashar>	stashbot got restarted since it was not processing anything	[production]
13:17	<elukey@cumin1002>	START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons.	[production]
13:15	<elukey@cumin1002>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons.	[production]
13:15	<hashar>	Restarted CI Jenkins	[production]
08:31	<elukey>	openjdk-11 upgrades for bullseye rolled out to prod	[production]
08:18	<jnuche@deploy1003>	rebuilt and synchronized wikiversions files: group1 to 1.43.0-wmf.17 refs T366962	[production]
07:44	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "add role to mgmt devices - ayounsi@cumin1002"	[production]
07:43	<ayounsi@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "add role to mgmt devices - ayounsi@cumin1002"	[production]
03:02	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1296.eqiad.wmnet with OS bullseye	[production]
02:21	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wikikube-worker1296.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
02:21	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host wikikube-worker1296.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
02:06	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wikikube-worker1285.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
02:04	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host wikikube-worker1285.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
02:02	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1288.eqiad.wmnet with OS bullseye	[production]
02:02	<jclark@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
02:02	<jclark@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
01:57	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1294.eqiad.wmnet with OS bullseye	[production]
01:57	<jclark@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]