production SAL

51-100 of 10000 results (64ms)

2023-09-15 §
15:24	<eevans@cumin1001>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching restbase10[18,25-27,33].eqiad.wmnet: Maybe pickup missed topology changes — T331713 - eevans@cumin1001	[production]
15:03	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply	[production]
15:03	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply	[production]
14:58	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:57	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:38	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:38	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:35	<eevans@cumin1001>	START - Cookbook sre.cassandra.roll-restart for nodes matching restbase10[18,25-27,33].eqiad.wmnet: Maybe pickup missed topology changes — T331713 - eevans@cumin1001	[production]
14:35	<urandom>	rolling Cassandra restart, RESTBase/eqiad/row-D — T331713	[production]
14:34	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:33	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:33	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:33	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2006-dev.codfw.wmnet with OS bookworm	[production]
14:32	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:32	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
14:31	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2005-dev.codfw.wmnet with OS bookworm	[production]
14:29	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt2004-dev.codfw.wmnet with OS bookworm	[production]
14:27	<aborrero@cumin1001>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudvirt2006-dev	[production]
14:27	<aborrero@cumin1001>	START - Cookbook sre.network.configure-switch-interfaces for host cloudvirt2006-dev	[production]
14:26	<aborrero@cumin1001>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudvirt2005-dev	[production]
14:26	<aborrero@cumin1001>	START - Cookbook sre.network.configure-switch-interfaces for host cloudvirt2005-dev	[production]
14:25	<aborrero@cumin1001>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudvirt2004-dev	[production]
14:24	<aborrero@cumin1001>	START - Cookbook sre.network.configure-switch-interfaces for host cloudvirt2004-dev	[production]
14:06	<cgoubert@cumin1001>	conftool action : set/pooled=yes; selector: name=mw2444.codfw.wmnet	[production]
14:05	<claime>	repooling mw2444.codfw.wmnet - T345884	[production]
13:51	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idm-test1001.wikimedia.org	[production]
13:47	<slyngshede@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM idm-test1001.wikimedia.org	[production]
13:46	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idm-test1001.wikimedia.org	[production]
13:41	<slyngshede@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM idm-test1001.wikimedia.org	[production]
13:41	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host idm-test1001.wikimedia.org with OS bookworm	[production]
13:40	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage	[production]
13:38	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt2006-dev.codfw.wmnet with reason: host reimage	[production]
13:38	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cloudvirt2005-dev.codfw.wmnet with reason: host reimage	[production]
13:35	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2005-dev.codfw.wmnet with reason: host reimage	[production]
13:35	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2004-dev.codfw.wmnet with reason: host reimage	[production]
13:35	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt2006-dev.codfw.wmnet with reason: host reimage	[production]
13:19	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt2004-dev.codfw.wmnet with OS bookworm	[production]
13:19	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt2005-dev.codfw.wmnet with OS bookworm	[production]
13:19	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on idm-test1001.wikimedia.org with reason: host reimage	[production]
13:19	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt2006-dev.codfw.wmnet with OS bookworm	[production]
13:16	<slyngshede@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on idm-test1001.wikimedia.org with reason: host reimage	[production]
13:03	<slyngshede@cumin1001>	START - Cookbook sre.hosts.reimage for host idm-test1001.wikimedia.org with OS bookworm	[production]
13:01	<akosiaris@deploy1002>	Synchronized docroot: (no justification provided) (duration: 08m 20s)	[production]
12:50	<topranks>	changing ECMP hasing algorithm on drmrs, esams and cloud switches T339852	[production]
12:27	<topranks>	changing ECMP hasing algorithm on asw1-b12-drmrs T339852	[production]
11:54	<_joe_>	updated etcd-mirror to 0.0.10 everywhere	[production]
11:37	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1138.eqiad.wmnet with OS bullseye	[production]
11:12	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1138.eqiad.wmnet with reason: host reimage	[production]
11:09	<stevemunene@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1138.eqiad.wmnet with reason: host reimage	[production]
10:56	<stevemunene@cumin1001>	START - Cookbook sre.hosts.reimage for host an-worker1138.eqiad.wmnet with OS bullseye	[production]