151-200 of 10000 results (38ms)
2020-10-09 §
12:52 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
12:33 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:20 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
12:20 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:16 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'proton' for release 'production' . [production]
12:15 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'push-notifications' for release 'main' . [production]
12:13 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
11:38 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
11:16 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . [production]
11:13 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'mathoid' for release 'production' . [production]
11:13 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . [production]
10:52 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'mathoid' for release 'staging' . [production]
10:41 <gehel@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
10:17 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
10:17 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
10:16 <jayme@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
10:11 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . [production]
10:11 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
09:55 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . [production]
09:53 <jayme@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
09:47 <elukey> roll restart of hadoop-yarn-nodemanager on all hadoop workers to pick up new settings [production]
09:38 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' . [production]
09:38 <jayme@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' . [production]
09:32 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:32 <kormat@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:07 <XioNoX> remove user from all network devices [production]
08:22 <marostegui> Restart dbstore1005 mysql to pick up new buffer pool sizes [production]
08:11 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:11 <filippo@cumin1001> START - Cookbook sre.hosts.downtime [production]
07:36 <moritzm> installing xen security updates for buster (libs only) [production]
07:34 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
07:34 <filippo@cumin1001> START - Cookbook sre.hosts.downtime [production]
00:16 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
00:00 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
2020-10-08 §
23:42 <ryankemper> `cloudelastic1006` done. Writes thawed, maintenance window lifted; restarts are done for `cloudelastic` [production]
23:37 <ryankemper> `cloudelastic1005` done [production]
23:31 <ryankemper> `cloudelastic1004` done [production]
23:27 <ryankemper> `cloudelastic1003` done [production]
23:23 <ryankemper> `cloudelastic1002` done [production]
23:16 <tgr_> Evening deploys done [production]
23:16 <ryankemper> `cloudelastic1001` is done restarting and cluster is green again. Proceeding to `cloudelastic1002` [production]
23:16 <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:632797|Enable logging of session cookie changes everywhere (T264793)]] (duration: 01m 01s) [production]
23:04 <ryankemper> Beginning cluster restarts one server at a time. For each server, the process is depool->restart elasticsearch services->wait for services to restart and then pool->wait for cluster to return to green status before starting next server [production]
23:01 <ryankemper> Writes are frozen for `cloudelastic`: `/usr/local/bin/mwscript extensions/CirrusSearch/maintenance/FreezeWritesToCluster.php --wiki=enwiki --cluster=cloudelastic` on `mwmaint2001` => `Applied cluster-wide freeze` [production]
22:56 <ryankemper> `sudo apt policy wmf-elasticsearch-search-plugins` shows correct state: `Installed: 6.5.4-4~stretch` [production]
22:56 <ryankemper> `sudo -E cumin -b 6 C:role::elasticsearch::cloudelastic 'DEBIAN_FRONTEND=noninteractive sudo apt-get -y -o Dpkg::Options::="--force-confdef" -o Dpkg::Options::="--force-confold" install wmf-elasticsearch-search-plugins'` [production]
22:54 <ryankemper> About to start plugin upgrade followed by restarts of `cloudelastic`. Maintenance window set for the next 2 hours on `cloudelastic100[1-6]` [production]
21:54 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@a923949]: search_satisfaction: update druid datasource to match previous data (duration: 01m 04s) [production]
21:53 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@a923949]: search_satisfaction: update druid datasource to match previous data [production]
21:52 <hashar@deploy1001> Synchronized php-1.36.0-wmf.10/includes/session/SessionBackend.php: Deduplicate SessionBackend::logPersistenceChange calls - T264793 (duration: 01m 01s) [production]