1-50 of 569 results (7ms)
2020-11-09 §
12:42 <arturo> restarted neutron l3 agent in cloudnet1003 bc it still had the old default route (T265288) [admin]
12:41 <arturo> `root@cloudcontrol1005:~# neutron subnet-delete dcbb0f98-5e9d-4a93-8dfc-4e3ec3c44dcc` (T265288) [admin]
12:40 <arturo> `root@cloudcontrol1005:~# neutron router-gateway-set --fixed-ip subnet_id=7c6bcc12-212f-44c2-9954-5c55002ee371,ip_address=185.15.56.244 cloudinstances2b-gw wan-transport-eqiad` (T265288) [admin]
12:19 <arturo> subnet 185.1.5.56.240/29 has id 7c6bcc12-212f-44c2-9954-5c55002ee371 in neutron (T265288) [admin]
12:19 <arturo> `root@cloudcontrol1005:~# neutron subnet-create --gateway 185.15.56.241 --name cloud-instances-transport1-b-eqiad1 --ip-version 4 --disable-dhcp wan-transport-eqiad 185.15.56.240/29` (T265288) [admin]
12:15 <arturo> icinga-downtime toolschecker for 2h (T265288) [admin]
2020-11-02 §
13:36 <arturo> (typo: dcaro) [admin]
13:35 <arturo> added dcar as projectadmin & user (T266068) [admin]
2020-10-29 §
16:57 <bstorm> silenced deployment-prep project alerts for 60 days since the downtime expired [admin]
08:12 <arturo> force-powercycling cloudcephosd1006 [admin]
2020-10-25 §
16:20 <andrewbogott> adding cloudvirt1038 to the 'ceph' aggregate and removing from the 'spare' aggregate. We need this space while waiting on network upgrades for empty cloudvirts (T216195) [admin]
2020-10-23 §
11:30 <arturo> [codfw1dev] openstack --os-project-id cloudinfra-codfw1dev recordset create --type PTR --record nat.cloudgw.codfw1dev.wikimediacloud.org. --description "created by hand" 0-29.57.15.185.in-addr.arpa. 1.0-29.57.15.185.in-addr.arpa. (T261724) [admin]
10:09 <arturo> [codf1dev] doing DNS changes for the cloudgw PoC, including designate and https://gerrit.wikimedia.org/r/c/operations/dns/+/635965 (T261724) [admin]
2020-10-22 §
10:46 <arturo> [codfw1dev] rebooting cloudinfra-internal-puppetmaster-01.cloudinfra-codfw1dev.codfw1dev.wikimedia.cloud to try fixing some DNS weirdness [admin]
09:43 <arturo> enabling puppet in cloucontrol1003 (message said "please re-enable after 2020-10-22 06:00UTC") [admin]
2020-10-21 §
14:36 <andrewbogott> running apt-get update && apt-get install -y facter on all cloud-vps instances [admin]
10:31 <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) to test puppet code (T261724) [admin]
08:56 <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) to test puppet code (T261724) [admin]
2020-10-20 §
15:47 <arturo> changing DNS recursor ACLs (https://gerrit.wikimedia.org/r/c/operations/puppet/+/635314) this can be reverted any time if it causes problems (T261724) [admin]
14:49 <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) to test puppet code (T261724) [admin]
2020-10-19 §
01:41 <andrewbogott> deleting all Precise base images [admin]
01:36 <andrewbogott> deleting all unused Jessie base images [admin]
2020-10-18 §
23:26 <andrewbogott> deleting all Trusty base images [admin]
21:50 <andrewbogott> migrating all currently used ceph images to rbd [admin]
2020-10-16 §
09:29 <arturo> [codfw1dev] still some DNS weirdness, investigating [admin]
09:25 <arturo> [codfw1dev] hard-rebooting bastion-codfw1dev-02, seems in bad shape, doesn't even wake up in the virsh console [admin]
09:18 <arturo> [codfw1dev] live-hacked cloudservices2002-dev /etc/powerdns/recursor.conf file to include cloud-codfw1dev-floating CIDR (185.15.57.0/29) while https://gerrit.wikimedia.org/r/c/operations/puppet/+/634050 is in review, so VMs with a floating IP can query the DNS recursor (T261724) [admin]
09:01 <arturo> [codfw1dev] basic network connectivity seems stable after cleaning up everything related to address scopes (T261724) [admin]
2020-10-15 §
15:17 <arturo> [codfw1dev] try cleaning up anything related to address scopes in the neutron database (T261724) [admin]
13:56 <arturo> [codfw1dev] drop neutron l3 agent hacks in cloudnet2002/2003-dev (T261724) [admin]
2020-10-13 §
17:54 <andrewbogott> rebuilding cloudvirt1021 for backy support [admin]
15:22 <andrewbogott> draining cloudvirt1021 so I can rebuild it with backy support [admin]
14:19 <andrewbogott> rebuilding cloudvirt1022 with backy support [admin]
14:03 <andrewbogott> draining cloudvirt1022 so I can rebuild it with backy support [admin]
11:19 <arturo> [codfw1dev] rebooting labtestvirt2003 [admin]
2020-10-09 §
10:15 <arturo> [codfwd1ev] root@cloudcontrol2001-dev:~# openstack router set --disable-snat cloudinstances2b-gw --external-gateway wan-transport-codfw (T261724) [admin]
09:22 <arturo> [codfwd1dev] rebooting cloudnet boxes for bridge and vlan changes (T261724) [admin]
09:12 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet delete 31214392-9ca5-4256-bff5-1e19a35661de (cloud-instances-transport1-b-codfw - 208.80.153.184/29) (T261724) [admin]
09:10 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack router set --external-gateway wan-transport-codfw --fixed-ip subnet=cloud-gw-transport-codfw,ip-address=185.15.57.10 cloudinstances2b-gw (T261724) [admin]
08:49 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet create --network wan-transport-codfw --gateway 185.15.57.9 --no-dhcp --subnet-range 185.15.57.8/30 cloud-gw-transport-codfw (T261724) [admin]
08:47 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet delete a5ab5362-4ffb-4059-9ff7-391e22dcf3bc (T261724) [admin]
2020-10-08 §
16:17 <arturo> [codfw1dev] `root@cloudcontrol2001-dev:~# openstack subnet create --network wan-transport-codfw --gateway 185.15.57.8 --no-dhcp --subnet-range 185.15.57.8/31 cloud-gw-transport-codfw` (with a hack -- see task) (T263622) [admin]
16:03 <arturo> [codfw1dev] briefly live-hacked python3-neutron source code in all 3 cloudcontrol2xxx-dev servers to workaround /31 network definition issue (T263622) [admin]
10:28 <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) T261724 [admin]
2020-10-06 §
21:30 <andrewbogott> moved cloudvirt1013 out of the 'ceph' aggregate and into the 'maintenance' aggregate for T243414 [admin]
21:29 <andrewbogott> draining cloudvirt1013 for upgrade to 10G networking [admin]
14:45 <arturo> icinga downtime every cloud* lab* host for 60 minutes for keystone maintenance [admin]
2020-10-05 §
17:40 <bd808> `service uwsgi-labspuppetbackend restart` on cloud-puppetmaster-03 (T264649) [admin]
2020-10-02 §
11:05 <arturo> [codfw1dev] restarting rabbitmq-server in all 3 control nodes, the l3 agent was misbehaving [admin]
09:16 <arturo> [codfw1dev] trying the labtestvirt2003 (cloudgw) reimage again (T261724) [admin]