production SAL

151-200 of 10000 results (92ms)

2025-06-26 §
15:34	<fabfur@cumin1002>	conftool action : set/pooled=yes; selector: name=cp7001.magru.wmnet	[production]
15:34	<fabfur>	repooling cp7001 (T397917)	[production]
15:31	<urandom>	bootstrapping Cassandra/sessionstore2004-a — T390514	[production]
15:30	<mvernon@cumin1002>	START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f3	[production]
15:30	<mvernon@cumin1002>	END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f2	[production]
15:29	<sukhe>	sudo cumin 'A:cp' "disable-puppet 'merging CR 1163843'"	[production]
15:27	<eevans@cumin1003>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore2004.codfw.wmnet with OS bullseye	[production]
15:19	<jhancock@cumin1003>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
15:17	<mvernon@cumin1002>	START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f2	[production]
15:17	<mvernon@cumin1002>	END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f1	[production]
15:16	<jhancock@cumin1003>	START - Cookbook sre.dns.netbox	[production]
15:15	<akosiaris@cumin1003>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
15:15	<akosiaris@cumin1003>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Removal of old mw-wikifunctions PTR records - akosiaris@cumin1003"	[production]
15:14	<fabfur@cumin1002>	conftool action : set/pooled=no; selector: name=cp7001.magru.wmnet	[production]
15:12	<fabfur>	temporary disable puppet on cp7001 (T397917)	[production]
15:09	<akosiaris@cumin1003>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Removal of old mw-wikifunctions PTR records - akosiaris@cumin1003"	[production]
15:06	<akosiaris@cumin1003>	START - Cookbook sre.dns.netbox	[production]
15:05	<eevans@cumin1003>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore2004.codfw.wmnet with reason: host reimage	[production]
15:05	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply	[production]
15:04	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/services/mobileapps: apply	[production]
15:03	<mvernon@cumin1002>	START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f1	[production]
15:03	<mvernon@cumin1002>	END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f0	[production]
15:01	<eevans@cumin1003>	START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore2004.codfw.wmnet with reason: host reimage	[production]
14:55	<volans>	uploaded debmonitor-server,python3-debmonitor_0.6.1 to apt.wikimedia.org bookworm-wikimedia	[production]
14:49	<dancy@deploy1003>	Installation of scap version "4.182.0" completed for 2 hosts	[production]
14:48	<mvernon@cumin1002>	START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f0	[production]
14:48	<mvernon@cumin1002>	END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.ef	[production]
14:47	<dancy@deploy1003>	Installing scap version "4.182.0" for 2 host(s)	[production]
14:42	<eevans@cumin1003>	START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye	[production]
14:42	<eevans@cumin1003>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2004.codfw.wmnet with OS bullseye	[production]
14:33	<mvernon@cumin1002>	START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.ef	[production]
14:32	<mvernon@cumin1002>	END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.ee	[production]
14:31	<eevans@cumin1003>	START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye	[production]
14:31	<jiji@cumin1003>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc-gp1005.eqiad.wmnet	[production]
14:27	<eevans@cumin1003>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2004.codfw.wmnet with OS bullseye	[production]
14:25	<jiji@cumin1003>	START - Cookbook sre.hosts.reboot-single for host mc-gp1005.eqiad.wmnet	[production]
14:24	<fceratto@cumin1002>	END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db2240 gradually with 4 steps - Pooling in	[production]
14:24	<effie>	restart memcached on mc2038 and mc2039	[production]
14:22	<jmm@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on ganeti2022.codfw.wmnet with reason: remove for decom	[production]
14:22	<jiji@deploy1003>	helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply	[production]
14:21	<dancy@deploy1003>	Installation of scap version "4.183.0" completed for 2 hosts	[production]
14:20	<jmm@cumin1003>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2022.codfw.wmnet	[production]
14:19	<dancy@deploy1003>	Installing scap version "4.183.0" for 2 host(s)	[production]
14:18	<jiji@deploy1003>	helmfile [codfw] START helmfile.d/services/mw-experimental: apply	[production]
14:17	<jiji@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply	[production]
14:17	<mvernon@cumin1002>	START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.ee	[production]
14:17	<mvernon@cumin1002>	END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.ed	[production]
14:16	<jiji@deploy1003>	helmfile [eqiad] START helmfile.d/services/mw-experimental: apply	[production]
14:14	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply	[production]
14:14	<jiji@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply	[production]