admin SAL

1-50 of 5555 results (32ms)

2024-01-07 §
19:34	<andrewbogott>	removed cloudvirt1063 from 'ceph' aggregate, added to 'maintenance' aggregate T353408	[admin]
19:34	<andrewbogott>	evacuating all VMs from cloudvirt1063. T353408	[admin]
2024-01-02 §
16:18	<andrew@cloudcumin1001>	END (FAIL) - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (exit_code=99) (T353408)	[admin]
16:18	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (T353408)	[admin]
10:22	<wm-bot2>	fran@wmf3169 END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99)	[admin]
10:22	<wm-bot2>	fran@wmf3169 START - Cookbook wmcs.openstack.cloudvirt.vm_console	[admin]
2023-12-31 §
21:39	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
21:35	<andrewbogott>	running openstack service restart cookbook in eqiad1 in response to a bunch of service down alerts	[admin]
21:34	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
2023-12-21 §
16:51	<dhinus>	puppet node deactivate cloudvirt1063.eqiad.wmnet T353406	[admin]
03:01	<andrewbogott>	restarting mariadb on cloudcontrol1005, hoping to get Galera back in sync	[admin]
2023-12-20 §
19:13	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
19:08	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
2023-12-18 §
17:29	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
17:23	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
17:23	<andrewbogott>	restarting all eqiad1 openstack services after a rabbitmq upgrade/rebuild for T353646	[admin]
15:29	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
15:25	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
2023-12-15 §
13:13	<dcaro>	restarted nova-fullstack on codfw as it was stuck (and alerting through stale prometheus file)	[admin]
2023-12-14 §
00:27	<andrew@cloudcumin1001>	END (FAIL) - Cookbook wmcs.openstack.cloudvirt.set_maintenance (exit_code=99)	[admin]
00:26	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.cloudvirt.set_maintenance	[admin]
00:25	<andrewbogott>	evacuating hosts from cloudvirt1063 and depooling. T353406	[admin]
2023-12-13 §
16:38	<taavi@cloudcumin1001>	END (FAIL) - Cookbook wmcs.toolforge.scale_grid_exec (exit_code=99)	[admin]
2023-12-12 §
21:01	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
20:55	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
17:45	<taavi@cloudcumin1001>	END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker role in the tools cluster	[admin]
17:45	<taavi@cloudcumin1001>	Added a new k8s worker tools-k8s-worker-100.tools.eqiad1.wikimedia.cloud to the cluster	[admin]
16:11	<taavi@cloudcumin1001>	END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker role in the tools cluster	[admin]
16:11	<taavi@cloudcumin1001>	Added a new k8s worker tools-k8s-worker-99.tools.eqiad1.wikimedia.cloud to the cluster	[admin]
15:49	<taavi@cloudcumin1001>	END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker role in the tools cluster	[admin]
15:49	<taavi@cloudcumin1001>	Added a new k8s worker tools-k8s-worker-98.tools.eqiad1.wikimedia.cloud to the cluster	[admin]
2023-12-10 §
18:37	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
18:33	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
18:30	<andrew@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
18:27	<andrewbogott>	restarting all openstack API servers, hoping to make things a bit more responsive	[admin]
18:25	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
2023-12-08 §
12:00	<wm-bot2>	dcaro@urcuchillay END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on etcd-discovery-1.cloudinfra-codfw1dev.codfw1dev.wikimedia.cloud (T353055)	[admin]
11:58	<wm-bot2>	dcaro@urcuchillay START - Cookbook wmcs.vps.refresh_puppet_certs on etcd-discovery-1.cloudinfra-codfw1dev.codfw1dev.wikimedia.cloud (T353055)	[admin]
11:57	<wm-bot2>	dcaro@urcuchillay END (ERROR) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=97) on etcd-discovery-1.cloudinfra-codfw1dev.codfw1dev.wikimedia.cloud	[admin]
11:57	<wm-bot2>	dcaro@urcuchillay START - Cookbook wmcs.vps.refresh_puppet_certs on etcd-discovery-1.cloudinfra-codfw1dev.codfw1dev.wikimedia.cloud	[admin]
09:38	<wm-bot2>	dcaro@urcuchillay END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) (T345084)	[admin]
09:32	<dcaro>	restarting nova and keystone as they are getting too slow (T345084)	[admin]
09:32	<wm-bot2>	dcaro@urcuchillay START - Cookbook wmcs.openstack.restart_openstack (T345084)	[admin]
09:32	<wm-bot2>	dcaro@urcuchillay END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) (T345084)	[admin]
09:31	<wm-bot2>	dcaro@urcuchillay START - Cookbook wmcs.openstack.restart_openstack (T345084)	[admin]
2023-12-07 §
13:12	<dcaro>	rebooting cloudcephosd1001 to make sure puppet7 migration went ok	[admin]
2023-12-04 §
00:07	<andrewbogott>	rebooting cloudcontrol1006 to recover from full disk error	[admin]
2023-12-03 §
09:05	<taavi@cloudcumin1001>	END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0)	[admin]
09:05	<taavi@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack	[admin]
2023-12-02 §
12:27	<taavi>	powercycle cloudvirt1063 T352595	[admin]