production SAL

1201-1250 of 10000 results (82ms)

2024-04-30 §
18:12	<sukhe@cumin1002>	START - Cookbook sre.hosts.reimage for host cp7014.magru.wmnet with OS bullseye	[production]
18:11	<sukhe@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7014.magru.wmnet with OS bullseye	[production]
18:11	<sukhe@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dns7001.wikimedia.org with OS bookworm	[production]
18:11	<sukhe@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7015.magru.wmnet with OS bullseye	[production]
18:09	<sukhe>	running cookbook -d sre.dns.netbox "test"	[production]
18:06	<sukhe@cumin1002>	START - Cookbook sre.hosts.reimage for host cp7015.magru.wmnet with OS bullseye	[production]
18:04	<sukhe@cumin1002>	START - Cookbook sre.hosts.reimage for host cp7014.magru.wmnet with OS bullseye	[production]
18:03	<sukhe@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "running manually for cp7013 - sukhe@cumin1002"	[production]
18:03	<sukhe@cumin1002>	START - Cookbook sre.hosts.reimage for host dns7001.wikimedia.org with OS bookworm	[production]
18:02	<sukhe@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "running manually for cp7013 - sukhe@cumin1002"	[production]
17:57	<xcollazo@deploy1002>	Started deploy [analytics/refinery@4836095]: Regular analytics weekly train [analytics/refinery@4836095f]	[production]
17:56	<swfrench@deploy1002>	Unlocked for deployment [ALL REPOSITORIES]: etcd replication maintenance - T358636 (duration: 55m 11s)	[production]
17:54	<swfrench-wmf>	putting etcd back in read-write mode for T358636	[production]
17:09	<swfrench-wmf>	disabling etcd replication into conf2005 for T358636	[production]
17:03	<swfrench-wmf>	putting etcd in read-only mode for T358636	[production]
17:01	<swfrench@deploy1002>	Locking from deployment [ALL REPOSITORIES]: etcd replication maintenance - T358636	[production]
16:56	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7004.magru.wmnet with OS bullseye	[production]
16:55	<ejegg>	payments-wiki upgraded from c7ab847d to c4f43931	[production]
16:53	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7013.magru.wmnet with OS bullseye	[production]
16:53	<sukhe@cumin1002>	END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002"	[production]
16:53	<sukhe@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002"	[production]
16:52	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7005.magru.wmnet with OS bullseye	[production]
16:52	<fabfur@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - fabfur@cumin1002"	[production]
16:51	<fabfur@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - fabfur@cumin1002"	[production]
16:36	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7002.magru.wmnet with OS bullseye	[production]
16:33	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7012.magru.wmnet with OS bullseye	[production]
16:33	<sukhe@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002"	[production]
16:30	<sukhe@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002"	[production]
16:30	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7013.magru.wmnet with reason: host reimage	[production]
16:27	<fabfur@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp7003.magru.wmnet with OS bullseye	[production]
16:27	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7004.magru.wmnet with reason: host reimage	[production]
16:25	<sukhe@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp7013.magru.wmnet with reason: host reimage	[production]
16:25	<fabfur@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp7004.magru.wmnet with reason: host reimage	[production]
16:24	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7005.magru.wmnet with reason: host reimage	[production]
16:22	<btullis@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cephosd1003.eqiad.wmnet with OS bullseye	[production]
16:22	<fabfur@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp7005.magru.wmnet with reason: host reimage	[production]
16:18	<stevemunene@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
16:18	<stevemunene@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
16:16	<elukey@puppetmaster1001>	conftool action : set/pooled=true; selector: dnsdisc=inference,name=codfw	[production]
16:16	<elukey@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore1*.eqiad.wmnet: Move to PKI Truststore - elukey@cumin1002	[production]
16:12	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7003.magru.wmnet with reason: host reimage	[production]
16:10	<fabfur@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7002.magru.wmnet with reason: host reimage	[production]
16:09	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
16:08	<sukhe@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host dns7001.wikimedia.org with OS bullseye	[production]
16:07	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7012.magru.wmnet with reason: host reimage	[production]
16:06	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: sync on main	[production]
16:05	<fabfur@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp7003.magru.wmnet with reason: host reimage	[production]
16:05	<fabfur@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp7002.magru.wmnet with reason: host reimage	[production]
16:04	<sukhe@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp7012.magru.wmnet with reason: host reimage	[production]
16:02	<btullis@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cephosd1003.eqiad.wmnet with reason: host reimage	[production]