production SAL

4601-4650 of 10000 results (98ms)

2023-04-13 §
15:46	<andrew@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - andrew@cumin1001"	[production]
15:46	<brett>	Disable Puppet/PyBal on lvs2008 in preparation for reimaging - T321309	[production]
15:44	<andrew@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - andrew@cumin1001"	[production]
15:42	<SandraEbele>	paused Oozie pageview-druid-hourly job.	[production]
15:41	<ebysans@deploy2002>	Started deploy [analytics/refinery@4e8f1ac]: Update druid pageview hourly and daily tables [analytics/refinery@4e8f1ac]	[production]
15:36	<sukhe@cumin2002>	END (FAIL) - Cookbook sre.network.configure-switch-interfaces (exit_code=99) for host lvs2007	[production]
15:36	<sukhe@cumin2002>	START - Cookbook sre.network.configure-switch-interfaces for host lvs2007	[production]
15:33	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirtlocal1002.eqiad.wmnet with reason: host reimage	[production]
15:31	<hnowlan@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/thumbor: apply	[production]
15:31	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirtlocal1001.eqiad.wmnet with reason: host reimage	[production]
15:30	<stevemunene@cumin1001>	START - Cookbook sre.hosts.reimage for host an-worker1132.eqiad.wmnet with OS buster	[production]
15:29	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirtlocal1003.eqiad.wmnet with reason: host reimage	[production]
15:29	<SandraEbele>	deploying analytics refinery-update pageview druid table	[production]
15:25	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirtlocal1001.eqiad.wmnet with reason: host reimage	[production]
15:25	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirtlocal1002.eqiad.wmnet with reason: host reimage	[production]
15:25	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirtlocal1003.eqiad.wmnet with reason: host reimage	[production]
15:25	<hnowlan@deploy2002>	helmfile [eqiad] START helmfile.d/services/thumbor: apply	[production]
15:24	<otto@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply	[production]
15:24	<otto@deploy2002>	helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply	[production]
15:23	<stevemunene@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1132.eqiad.wmnet with OS buster	[production]
15:22	<otto@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply	[production]
15:22	<otto@deploy2002>	helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply	[production]
15:19	<otto@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply	[production]
15:19	<otto@deploy2002>	helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply	[production]
15:17	<claime>	cxserver migrated to mw-api-int on kubernetes, take three - T334204	[production]
15:14	<cgoubert@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/cxserver: apply	[production]
15:13	<cgoubert@deploy2002>	helmfile [eqiad] START helmfile.d/services/cxserver: apply	[production]
15:13	<cgoubert@deploy2002>	helmfile [codfw] DONE helmfile.d/services/cxserver: apply	[production]
15:13	<kamila@deploy2002>	helmfile [codfw] DONE helmfile.d/services/thumbor: apply	[production]
15:13	<moritzm>	remove runc packages installed on mw1349-mw1436, these were once used for a load test with dragonfly and are no longer needed	[production]
15:12	<cgoubert@deploy2002>	helmfile [codfw] START helmfile.d/services/cxserver: apply	[production]
15:11	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
15:10	<claime>	Migrating cxserver to mw-api-int on kubernetes, take three - T334204	[production]
15:10	<kamila@deploy2002>	helmfile [codfw] START helmfile.d/services/thumbor: apply	[production]
15:09	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
15:09	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1003.eqiad.wmnet with OS bullseye	[production]
15:07	<cgoubert@deploy2002>	helmfile [staging] DONE helmfile.d/services/cxserver: apply	[production]
15:06	<cgoubert@deploy2002>	helmfile [staging] START helmfile.d/services/cxserver: apply	[production]
15:06	<cgoubert@deploy2002>	helmfile [staging] DONE helmfile.d/services/cxserver: apply	[production]
15:05	<cgoubert@deploy2002>	helmfile [staging] START helmfile.d/services/cxserver: apply	[production]
15:04	<moritzm>	installing unbound security updates on buster	[production]
15:03	<cgoubert@deploy2002>	helmfile [staging] DONE helmfile.d/services/cxserver: apply	[production]
15:03	<cgoubert@deploy2002>	helmfile [staging] START helmfile.d/services/cxserver: apply	[production]
15:00	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirtlocal1003.eqiad.wmnet with OS bullseye	[production]
14:49	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
14:41	<cgoubert@deploy2002>	helmfile [staging] DONE helmfile.d/services/cxserver: apply	[production]
14:39	<cgoubert@deploy2002>	helmfile [staging] START helmfile.d/services/cxserver: apply	[production]
14:36	<cgoubert@deploy2002>	helmfile [staging] DONE helmfile.d/services/cxserver: apply	[production]
14:36	<cgoubert@deploy2002>	helmfile [staging] START helmfile.d/services/cxserver: apply	[production]
14:26	<sukhe>	restart pybal on lvs2007 to pick up bgp-med change CR 908552	[production]