production SAL

3351-3400 of 10000 results (80ms)

2023-11-15 §
19:07	<mutante>	aphlict2001 - restart aphlict service after puppet 7 upgrade	[production]
19:05	<jbond@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::virt_ceph	[production]
19:01	<taavi@cumin1001>	START - Cookbook sre.puppet.migrate-host for host cloudlb2001-dev.codfw.wmnet	[production]
19:00	<taavi@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudgw2003-dev.codfw.wmnet	[production]
18:59	<jbond@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::services	[production]
18:59	<dzahn@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host aphlict2001.codfw.wmnet	[production]
18:59	<jbond@cumin1001>	START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::virt_ceph	[production]
18:58	<jbond@cumin1001>	END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: wmcs::openstack::codfw1dev::virt_ceph	[production]
18:56	<cmooney@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS bullseye	[production]
18:54	<jbond@cumin1001>	START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::virt_ceph	[production]
18:54	<jbond@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::net	[production]
18:54	<dzahn@cumin1001>	START - Cookbook sre.puppet.migrate-host for host aphlict2001.codfw.wmnet	[production]
18:54	<taavi@cumin1001>	START - Cookbook sre.puppet.migrate-host for host cloudgw2003-dev.codfw.wmnet	[production]
18:51	<jbond@cumin1001>	START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::services	[production]
18:49	<taavi@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudgw2002-dev.codfw.wmnet	[production]
18:45	<jbond@cumin1001>	START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::net	[production]
18:42	<topranks>	Reset BGP to lvs4010 from cr3-ulsfo to validate new config T350488	[production]
18:41	<taavi@cumin1001>	START - Cookbook sre.puppet.migrate-host for host cloudgw2002-dev.codfw.wmnet	[production]
18:36	<topranks>	remove TTL setting on server-facing BGP peerings on cr3-ulsfo T350488	[production]
18:25	<jbond@cumin1001>	END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::db	[production]
18:16	<bking@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudelastic1010.wikimedia.org with OS bullseye	[production]
18:15	<bking@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudelastic1009.wikimedia.org with OS bullseye	[production]
18:14	<jbond@cumin1001>	START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::db	[production]
18:12	<bking@cumin2002>	START - Cookbook sre.hosts.reimage for host cloudelastic1008.wikimedia.org with OS bullseye	[production]
18:05	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Depooling db1141 (T348183)', diff saved to https://phabricator.wikimedia.org/P53488 and previous config saved to /var/cache/conftool/dbconfig/20231115-180503-arnaudb.json	[production]
18:04	<arnaudb@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1141.eqiad.wmnet with reason: Maintenance	[production]
18:04	<arnaudb@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1141.eqiad.wmnet with reason: Maintenance	[production]
18:01	<jynus>	All restart_daemons were successful	[production]
18:01	<root@cumin2002>	END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw	[production]
17:57	<bking@cumin1001>	START - Cookbook sre.wdqs.data-reload	[production]
17:57	<bking@cumin1001>	END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97)	[production]
17:56	<bking@cumin1001>	START - Cookbook sre.wdqs.data-reload	[production]
17:56	<root@cumin2002>	START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-codfw	[production]
17:52	<inflatador>	bking@wdqs1024 reboot host to hopefully reduce data reload failures T349011	[production]
17:51	<bking@cumin1001>	END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97)	[production]
17:29	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on P{lvs1019,lvs2013} and A:lvs (T349796)	[production]
17:27	<hnowlan@cumin1001>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on P{lvs1019,lvs2013} and A:lvs (T349796)	[production]
17:26	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on P{lvs1020,lvs2014} and A:lvs (T349796)	[production]
17:23	<hnowlan@cumin1001>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on P{lvs1020,lvs2014} and A:lvs (T349796)	[production]
17:19	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on P{lvs1020,lvs2014} and A:lvs (T349796)	[production]
17:18	<hnowlan@cumin1001>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on P{lvs1020,lvs2014} and A:lvs (T349796)	[production]
16:52	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1102.eqiad.wmnet	[production]
16:52	<fabfur@cumin1001>	START - Cookbook sre.hosts.remove-downtime for cp1102.eqiad.wmnet	[production]
16:45	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp1102.eqiad.wmnet	[production]
16:36	<fabfur@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cp1102.eqiad.wmnet	[production]
16:35	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp1102.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
16:25	<fabfur@cumin1001>	START - Cookbook sre.hosts.provision for host cp1102.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
16:25	<elukey>	reload thanos-rule on titan[12]001 to pick up new pyrra generated configs	[production]
16:21	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp1102.eqiad.wmnet with reason: BIOS settings fix	[production]
16:21	<fabfur@cumin1001>	START - Cookbook sre.hosts.downtime for 4:00:00 on cp1102.eqiad.wmnet with reason: BIOS settings fix	[production]