451-500 of 10000 results (18ms)
2020-10-09 §
13:31 <gehel@cumin1001> START - Cookbook sre.wdqs.data-reload [production]
13:29 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . [production]
13:23 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'wikifeeds' for release 'staging' . [production]
13:12 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'termbox' for release 'production' . [production]
12:55 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'termbox' for release 'production' . [production]
12:52 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
12:33 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:20 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
12:20 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:16 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:15 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
12:13 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
11:38 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
11:16 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
11:13 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
11:13 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . [production]
10:52 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'mathoid' for release 'staging' . [production]
10:41 <gehel@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
10:17 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
10:17 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
10:16 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
10:11 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
10:11 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
09:55 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
09:53 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
09:47 <elukey> roll restart of hadoop-yarn-nodemanager on all hadoop workers to pick up new settings [production]
09:38 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
09:38 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' . [production]
09:32 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:32 <kormat@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:07 <XioNoX> remove user from all network devices [production]
08:22 <marostegui> Restart dbstore1005 mysql to pick up new buffer pool sizes [production]
08:11 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:11 <filippo@cumin1001> START - Cookbook sre.hosts.downtime [production]
07:36 <moritzm> installing xen security updates for buster (libs only) [production]
07:34 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
07:34 <filippo@cumin1001> START - Cookbook sre.hosts.downtime [production]
00:16 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
00:00 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
2020-10-08 §
23:42 <ryankemper> `cloudelastic1006` done. Writes thawed, maintenance window lifted; restarts are done for `cloudelastic` [production]
23:37 <ryankemper> `cloudelastic1005` done [production]
23:31 <ryankemper> `cloudelastic1004` done [production]
23:27 <ryankemper> `cloudelastic1003` done [production]
23:23 <ryankemper> `cloudelastic1002` done [production]
23:16 <tgr_> Evening deploys done [production]
23:16 <ryankemper> `cloudelastic1001` is done restarting and cluster is green again. Proceeding to `cloudelastic1002` [production]
23:16 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:632797|Enable logging of session cookie changes everywhere (T264793)]] (duration: 01m 01s) [production]
23:04 <ryankemper> Beginning cluster restarts one server at a time. For each server, the process is depool->restart elasticsearch services->wait for services to restart and then pool->wait for cluster to return to green status before starting next server [production]
23:01 <ryankemper> Writes are frozen for `cloudelastic`: `/usr/local/bin/mwscript extensions/CirrusSearch/maintenance/FreezeWritesToCluster.php --wiki=enwiki --cluster=cloudelastic` on `mwmaint2001` => `Applied cluster-wide freeze` [production]
22:56 <ryankemper> `sudo apt policy wmf-elasticsearch-search-plugins` shows correct state: `Installed: 6.5.4-4~stretch` [production]