201-250 of 395 results (13ms)
2019-10-23 §
09:23 <arturo> cloudvirt1026 reboot ended OK [admin]
09:12 <arturo> rebooting cloudvirt1026 for kernel upgrade [admin]
09:09 <arturo> cloudvirt1025 reboot ended OK [admin]
09:00 <arturo> rebooting cloudvirt1025 for kernel upgrade [admin]
08:51 <arturo> icinga downtime cloudvirt1025/1026 for reboots [admin]
2019-10-18 §
16:01 <arturo> created the `eqiad1.wikimedia.cloud` DNS zone (T235846) [admin]
14:27 <andrewbogott> deleted a bunch of leaked VMS from earlier today from the admin-monitoring project. Fullstack leaks due to an api outage, maybe? [admin]
10:44 <arturo> double max_message_size from 40KB to 80KB in the cloud-admin mailing list. A simple email with a couple of quotes can go over the 40KB limit. [admin]
2019-10-16 §
21:59 <jeh> resync wiki replica tool and user accounts T235697 [admin]
09:40 <arturo> reboot of cloudvirt1030 went fine [admin]
09:28 <arturo> reboot of cloudvirt1029 went fine [admin]
09:28 <arturo> rebooting cloudvirt1030 for kernel updates [admin]
09:12 <arturo> rebooting cloudvirt1029 for kernel updates [admin]
09:11 <arturo> reboot of cloudvirt1028 went fine [admin]
09:00 <arturo> rebooting cloudvirt1028 for kernel updates [admin]
08:56 <arturo> icinga downtime cloudvirt[1028-1030].eqiad.wmnet for 1h for reboots [admin]
2019-10-15 §
13:30 <jeh> creating indexes and views for banwiki T234770 [admin]
2019-10-10 §
18:55 <bd808> Created indexes and views for nqowiki (T230543) [admin]
11:59 <arturo> network switch hardware is down affecting cloudvirt1025/1026 (T227536) VMs are supposed to be online but unreachable [admin]
2019-10-09 §
10:44 <arturo> cloudvirt1013 rebooted well [admin]
10:32 <arturo> cloudvirt1013 is rebooting [admin]
10:32 <arturo> cloudvirt1012 rebooted just fine (very slow, 35 VMs) [admin]
10:20 <arturo> cloudvirt1012 is rebooting [admin]
10:19 <arturo> cloudvirt1009 rebooted just fine (very slow though) [admin]
10:06 <arturo> cloudvirt1009 is rebooting [admin]
10:06 <arturo> cloudvirt1008 rebooted just fine (very slow though) [admin]
09:58 <arturo> cloudvirt1008 is rebooting [admin]
09:52 <arturo> icinga downtime toolschecker, paws, etc for 2h, because cloudvirt reboots [admin]
2019-10-07 §
14:07 <arturo> horizon is disabled for maintenance (T212302) [admin]
14:00 <arturo> starting scheduled maintenance: upgrading eqiad1 from openstack mitaka to newton [admin]
2019-10-02 §
15:23 <arturo> codfw1dev renaming net/subnet objects to a more modern naming scheme T233665 [admin]
12:49 <arturo> codfw1dev delete all floating ip allocations in the deployment for mangling the network config for testing T233665 [admin]
12:47 <arturo> codfw1dev deleting all VMs in the deployment for mangling the network config for testing T233665 [admin]
11:08 <arturo> codfw1dev rebooting cloudnet2002-dev and cloudnet2003-dev for testing T233665 [admin]
10:31 <arturo> codfw1dev: add cloudinstances2b-gw router to the l3 agent in cloudnet2003-dev [admin]
09:59 <arturo> codfw1dev: cleanup leftover "HA port tenant admin" in neutron (ports from missing servers) [admin]
09:46 <arturo> codfw1dev: cleanup leftover neutron agents [admin]
2019-09-30 §
10:21 <arturo> we installed ferm in every VM by mistake. Deleting it and forcing a puppet agent run to try to go back to a clean state. [admin]
09:38 <arturo> downtime toolschecker for 24h [admin]
09:33 <arturo> force update ferm cloud-wide (in all VMs) for T153468 [admin]
2019-08-18 §
10:39 <arturo> rebooting cloudvirt1023 for new interface names configuration [admin]
10:34 <arturo> downtimed cloudvirt1023 for 2 days [admin]
2019-08-05 §
17:17 <bd808> Set downtime on gridengine and kubernetes webservice checks in icinga until 2019-09-02 (flaky tests) [admin]
2019-07-29 §
20:14 <bd808> Restarted maintain-kubeusers on tools-k8s-master-01 (T194859) [admin]
2019-07-25 §
12:32 <arturo> eqiad1/glance: debian-9.9-stretch image deprecates debian-9.8-stretch (T228983) [admin]
09:59 <arturo> (codfw1dev) drop missing glance images (T228972) [admin]
09:32 <arturo> (codfw1dev) deleting a bunch of VMs that were running in now missing hypervisors [admin]
09:31 <arturo> (codfw1dev) deleting a bunch of VMs in ERROR and SHUTDOWN state [admin]
09:27 <arturo> last log entry refers to the codfw1dev deployment [admin]
09:27 <arturo> cleanup `nova service-list` from old hypervisors (labtest*) [admin]