51-100 of 335 results (10ms)
2020-02-18 §
21:02 <andrewbogott> adding .eqiad1.wikimedia.cloud records to all existing eqiad1 VMs, updating all eqiad1 internal pointer records to reference the new eqiad1.wikimedia.cloud fqdns. [admin]
09:44 <arturo> deleted DNS zone wmcloud.org and try re-creating it [admin]
2020-02-14 §
10:35 <arturo> running `root@cloudcontrol2001-dev:~# designate server-create --name ns1.openstack.codfw1dev.wikimediacloud.org.` (T243766) [admin]
10:32 <arturo> running `root@cloudcontrol1004:~# designate server-create --name ns1.openstack.eqiad1.wikimediacloud.org.` (T243766) [admin]
10:32 <arturo> running `root@cloudcontrol1004:~# designate server-create --name ns0.openstack.eqiad1.wikimediacloud.org.` (T243766) [admin]
2020-02-12 §
13:38 <arturo> [codfw1dev] add reference to subnetpool to the instance subnet `MariaDB [neutron]> update subnets set subnetpool_id='d129650d-d4be-4fe1-b13e-6edb5565cb4a' where id = '7adfcebe-b3d0-4315-92fe-e8365cc80668';` (T244851) [admin]
2020-02-11 §
13:46 <arturo> [codfw1dev] creating some neutron objects to investigate T244851 (subnets, subnet pools, address scopes, ...) [admin]
12:40 <arturo> [codfw1dev] delete unknown address scope 'wmcs-v4-scope': `root@cloudcontrol2001-dev:~# openstack address scope delete 078cfd71-117b-4aac-9197-6ebbbb7dd3de` (T244851) [admin]
12:40 <arturo> [codfw1dev] delete unknown subnet pool 'cloudinstancesb-v4-pool0': `root@cloudcontrol2001-dev:~# openstack subnet pool delete d23a9b88-5c3d-4a53-ab88-053233a75365` (T244851) [admin]
2020-02-07 §
18:11 <jeh> shutdown cloudvirt1016 for hardware maintenance T241882 [admin]
2020-02-06 §
14:44 <jeh> update apt packages on cloudvirt1015 T220853 [admin]
14:28 <jeh> run hardware tests on cloudvirt1015 T220853 [admin]
2020-01-28 §
17:24 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# designate server-create --name ns0.openstack.codfw1dev.wikimediacloud.org. (T243766) [admin]
10:18 <arturo> [codfw1dev] created DNS record `bastion-codfw1dev-01.codfw1dev.wmcloud.org A 185.15.57.2` (T242976, T229441) [admin]
10:13 <arturo> [codfw1dev] the zone `codfw1dev.wmcloud.org` belongs now to the `cloudinfra-codfw1dev` project (T242976) [admin]
10:11 <arturo> [codfw1dev] `root@cloudcontrol2001-dev:~# openstack zone create --description "main DNS domain for public addresses" --email "root@wmflabs.org" --type PRIMARY --ttl 3600 codfw1dev.wmcloud.org.` (T242976 and T243766) [admin]
09:53 <arturo> restart apache2 in labweb1001/1002 because horizon errors [admin]
09:47 <arturo> created DNS zone wmcloud.org in eqiad1, transfer it to the cloudinfra project (T242976) right now only use is to delegate codfw1dev.wmcloud.org subdomain to designate in the other deployment [admin]
2020-01-27 §
12:45 <arturo> [codfw1dev] manually move the new domain to the `cloudinfra-codfw1dev` project clouddb2001-dev: `[designate]> update zones set tenant_id='cloudinfra-codfw1dev' where id = '4c75410017904858a5839de93c9e8b3d';` T243556 [admin]
12:44 <arturo> [codfw1dev] `root@cloudcontrol2001-dev:~# openstack zone create --description "main DNS domain for VMs" --email "root@wmflabs.org" --type PRIMARY --ttl 3600 codfw1dev.wikimedia.cloud.` T243556 [admin]
2020-01-24 §
15:10 <jeh> remove icinga downtime for cloudvirt1013 T241313 [admin]
12:52 <arturo> repooling cloudvirt1013 after HW got fixed (T241313) [admin]
2020-01-21 §
17:43 <bstorm_> remounting /mnt/nfs/dumps-labstore1007.wikimedia.org/ on all dumps-mounting projects [admin]
10:24 <arturo> running `sudo systemctl restart apache2.service` in both labweb servers to try mitigating T240852 [admin]
2020-01-15 §
16:59 <bd808> Changed the config for cloud-announce mailing list so that lsit admins do not get bounce unsubscribe notices [admin]
2020-01-14 §
14:03 <arturo> icinga downtime all cloudvirts for another 2h for fixing some icinga checks [admin]
12:04 <arturo> icinga downtime toolchecker for 2 hours for openstack upgrades T241347 [admin]
12:02 <arturo> icinga downtime cloud* labs* hosts for 2 hours for openstack upgrades T241347 [admin]
04:26 <andrewbogott> upgrading designate on cloudservices1003/1004 [admin]
2020-01-13 §
13:34 <arturo> [¢odfw1dev] prevent neutron from allocating floating IPs from the wrong subnet by doing `neutron subnet-update --allocation-pool start=208.80.153.190,end=208.80.153.190 cloud-instances-transport1-b-codfw` (T242594) [admin]
2020-01-10 §
13:27 <arturo> cloudvirt1009: virsh undefine i-000069b6. This is tools-elastic-01 which is running on cloudvirt1008 (so, leaked on cloudvirt1009) [admin]
2020-01-09 §
11:12 <arturo> running `MariaDB [nova_eqiad1]> update quota_usages set in_use='0' where project_id='etytree';` (T242332) [admin]
11:11 <arturo> running `MariaDB [nova_eqiad1]> select * from quota_usages where project_id = 'etytree';` (T242332) [admin]
10:32 <arturo> ran `root@cloudcontrol1004:~# nova-manage project quota_usage_refresh --project etytree` [admin]
2020-01-08 §
10:53 <arturo> icinga downtime all cloudvirts for 30 minutes to re-create all canary VMs" [admin]
2020-01-07 §
11:12 <arturo> icinga-downtime everything cloud* for 30 minutes to merge nova scheduler changes [admin]
10:02 <arturo> icinga downtime cloudvirt1009 for 30 minutes to re-create canary VM (T242078) [admin]
2020-01-06 §
13:45 <andrewbogott> restarting nova-api and nova-conductor on cloudcontrol1003 and 1004 [admin]
2020-01-04 §
16:34 <arturo> icinga downtime cloudvirt1024 for 2 months because hardware errors (T241884) [admin]
2019-12-31 §
11:46 <andrewbogott> I couldn't! [admin]
11:39 <andrewbogott> restarting cloudservices2002-dev to see if I can reproduce an issue I saw earlier [admin]
2019-12-25 §
10:13 <arturo> icinga downtime for 30 minutes the whole cloud* lab* fleet to merge https://gerrit.wikimedia.org/r/c/operations/puppet/+/560575 (will restart some openstack components) [admin]
2019-12-24 §
15:13 <arturo> icinga downtime all the lab* fleet for nova password change for 1h [admin]
14:39 <arturo> icinga downtime all the cloud* fleet for nova password change for 1h [admin]
2019-12-23 §
11:13 <arturo> enable puppet in cloudcontrol1003/1004 [admin]
10:40 <arturo> disable puppet in cloudcontrol1003/1004 while doing changes related to python-ldap [admin]
2019-12-22 §
23:48 <andrewbogott> restarting nova-conductor and nova-api on cloudcontrol1003 and 1004 [admin]
09:45 <arturo> cloudvirt1013 is back (did it alone) T241313 [admin]
09:37 <arturo> cloudvirt1013 is down for good. Apparently powered off. I can't even reach it via iLO [admin]
2019-12-20 §
12:43 <arturo> icinga downtime cloudmetrics1001 for 128 hours [admin]