1001-1050 of 10000 results (18ms)
2020-10-09 §
12:20 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
12:20 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:16 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:15 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
12:13 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
11:38 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
11:16 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
11:15 <elukey> bootstrap the Analytics Hadoop test cluster [analytics]
11:13 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
11:13 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . [production]
10:52 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'mathoid' for release 'staging' . [production]
10:41 <gehel@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
10:17 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
10:17 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
10:16 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
10:15 <arturo> [codfwd1ev] root@cloudcontrol2001-dev:~# openstack router set --disable-snat cloudinstances2b-gw --external-gateway wan-transport-codfw (T261724) [admin]
10:11 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
10:11 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
09:55 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
09:53 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
09:47 <elukey> roll restart of hadoop-yarn-nodemanager on all hadoop workers to pick up new settings [analytics]
09:47 <elukey> roll restart of hadoop-yarn-nodemanager on all hadoop workers to pick up new settings [production]
09:38 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
09:38 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' . [production]
09:32 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:32 <kormat@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:22 <arturo> [codfwd1dev] rebooting cloudnet boxes for bridge and vlan changes (T261724) [admin]
09:12 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet delete 31214392-9ca5-4256-bff5-1e19a35661de (cloud-instances-transport1-b-codfw - 208.80.153.184/29) (T261724) [admin]
09:10 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack router set --external-gateway wan-transport-codfw --fixed-ip subnet=cloud-gw-transport-codfw,ip-address=185.15.57.10 cloudinstances2b-gw (T261724) [admin]
09:07 <XioNoX> remove user from all network devices [production]
08:49 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet create --network wan-transport-codfw --gateway 185.15.57.9 --no-dhcp --subnet-range 185.15.57.8/30 cloud-gw-transport-codfw (T261724) [admin]
08:47 <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet delete a5ab5362-4ffb-4059-9ff7-391e22dcf3bc (T261724) [admin]
08:22 <marostegui> Restart dbstore1005 mysql to pick up new buffer pool sizes [production]
08:11 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:11 <filippo@cumin1001> START - Cookbook sre.hosts.downtime [production]
07:58 <elukey> decom analytics1044 from Hadoop [analytics]
07:36 <moritzm> installing xen security updates for buster (libs only) [production]
07:34 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
07:34 <filippo@cumin1001> START - Cookbook sre.hosts.downtime [production]
07:04 <elukey> failover from an-master1002 to 1001 for HDFS namenode (the namenode failed over hours ago, no logs to check) [analytics]
00:16 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
00:00 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
2020-10-08 §
23:42 <ryankemper> `cloudelastic1006` done. Writes thawed, maintenance window lifted; restarts are done for `cloudelastic` [production]
23:37 <ryankemper> `cloudelastic1005` done [production]
23:31 <ryankemper> `cloudelastic1004` done [production]
23:27 <ryankemper> `cloudelastic1003` done [production]
23:23 <ryankemper> `cloudelastic1002` done [production]
23:16 <tgr_> Evening deploys done [production]
23:16 <ryankemper> `cloudelastic1001` is done restarting and cluster is green again. Proceeding to `cloudelastic1002` [production]
23:16 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:632797|Enable logging of session cookie changes everywhere (T264793)]] (duration: 01m 01s) [production]