production SAL

5101-5150 of 10000 results (128ms)

2024-06-28 §
19:35	<jclark@cumin1002>	START - Cookbook sre.network.configure-switch-interfaces for host dbproxy1029	[production]
19:34	<jclark@cumin1002>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dbproxy1028	[production]
19:33	<jclark@cumin1002>	START - Cookbook sre.network.configure-switch-interfaces for host dbproxy1028	[production]
19:31	<jclark@cumin1002>	START - Cookbook sre.dns.netbox	[production]
19:31	<jclark@cumin1002>	END (ERROR) - Cookbook sre.dns.netbox (exit_code=97)	[production]
19:31	<sukhe>	sudo cumin -b10 "A:cp-text" "run-puppet-agent --enable 'dont enable'": T368645	[production]
19:30	<jclark@cumin1002>	START - Cookbook sre.dns.netbox	[production]
18:22	<sukhe>	disable puppet on A:cp-text	[production]
18:16	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply	[production]
18:16	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply	[production]
18:05	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
18:05	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:43	<hnowlan@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host wikikube-worker2026.codfw.wmnet with OS bullseye	[production]
16:36	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
16:36	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
16:33	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
16:23	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
16:19	<aikochou@deploy1002>	helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' .	[production]
16:03	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on mw2300.codfw.wmnet with reason: Reimaging issues	[production]
16:03	<hnowlan@cumin1002>	START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on mw2300.codfw.wmnet with reason: Reimaging issues	[production]
15:45	<hnowlan@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker2026.codfw.wmnet with OS bullseye	[production]
15:43	<cmooney@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
15:43	<cmooney@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix entries for wikikube-worker2026 - cmooney@cumin1002"	[production]
15:35	<cmooney@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix entries for wikikube-worker2026 - cmooney@cumin1002"	[production]
15:32	<cmooney@cumin1002>	START - Cookbook sre.dns.netbox	[production]
15:25	<hnowlan@cumin1002>	conftool action : set/weight=10:pooled=yes; selector: name=(wikikube-worker2025.codfw.wmnet\|wikikube-worker2027.codfw.wmnet\|wikikube-worker2028.codfw.wmnet\|wikikube-worker2029.codfw.wmnet),cluster=kubernetes,service=kubesvc	[production]
15:22	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
15:21	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/commons-impact-analytics: apply	[production]
15:21	<andrewbogott>	upgraded wikitech-static to 1_42 and php 8.3	[production]
15:14	<hnowlan>	homer 'crcodfw' commit 'T351074'	[production]
15:14	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2027.codfw.wmnet with OS bullseye	[production]
15:12	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
15:11	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/commons-impact-analytics: apply	[production]
15:11	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
15:10	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2028.codfw.wmnet with OS bullseye	[production]
15:10	<cgoubert@cumin1002>	conftool action : set/weight=10:pooled=yes; selector: name=(wikikube-worker1027.eqiad.wmnet\|wikikube-worker1028.eqiad.wmnet\|wikikube-worker1029.eqiad.wmnet\|wikikube-worker1030.eqiad.wmnet\|wikikube-worker1031.eqiad.wmnet),cluster=kubernetes,service=kubesvc	[production]
15:10	<claime>	Pooling and uncordoning wikikube-worker1027.eqiad.wmnet,wikikube-worker1028.eqiad.wmnet,wikikube-worker1029.eqiad.wmnet,wikikube-worker1030.eqiad.wmnet,wikikube-worker1031.eqiad.wmnet - T351074	[production]
15:09	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2029.codfw.wmnet with OS bullseye	[production]
15:07	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2025.codfw.wmnet with OS bullseye	[production]
15:06	<jhathaway>	mx-in1001 postfix mx testing complete	[production]
15:04	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/commons-impact-analytics: apply	[production]
15:00	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
15:00	<claime>	homer 'creqiad' commit 'T351074'	[production]
14:56	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2027.codfw.wmnet with reason: host reimage	[production]
14:54	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/commons-impact-analytics: apply	[production]
14:52	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2028.codfw.wmnet with reason: host reimage	[production]
14:50	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2029.codfw.wmnet with reason: host reimage	[production]
14:47	<hnowlan@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2025.codfw.wmnet with reason: host reimage	[production]
14:46	<hnowlan@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2029.codfw.wmnet with reason: host reimage	[production]
14:46	<hnowlan@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2028.codfw.wmnet with reason: host reimage	[production]