1-50 of 10000 results (19ms)
2020-10-05 ยง
23:11 <ejegg> updated payments staging from 52704ffe24 to db03677b2d [production]
22:31 <Amir1> deleted deployment-mailman01 (T257118) [releng]
22:29 <Amir1> deleted deployment-imagescaler01 and deployment-imagescaler02 (T257118) [releng]
22:29 <mutante> deleted the shinken module [monitoring]
22:27 <mutante> removing shinken puppet module and role [production]
22:01 <ebernhardson> restore wikidatawiki_content enwiki_content enwiki_general and commonswiki_file to default index.merge.policy.deletes_pct_allowed on eqiad cirrus cluster T264053 [production]
21:58 <bstorm> setting "mtail::from_component: true" on both mx-out servers to make puppet work again [cloudinfra]
21:01 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
21:00 <Urbanecm> Run keyholder arm at deployment-cumin [releng]
21:00 <Urbanecm> Run puppet at deployment-mwmaint01 and deployment-mediawiki-07 [releng]
20:59 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
20:45 <hauskatze> Fixed puppet for deployment-mediawiki-07: s/memcache/memcached/ [releng]
20:30 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
20:28 <andrew@cumin1001> START - Cookbook sre.hosts.downtime [production]
20:26 <ebernhardson> restart elasticsearch_6@production-search-codfw on elastic2051 to take reduced (32 sector, 16kB) readahead settings T264053 [production]
20:13 <ebernhardson> restart elasticsearch_6@production-search-codfw on elastic2051 to take reduced (64 sector, 32kB) readahead settings T264053 [production]
19:56 <ebernhardson> restart elasticsearch_6@production-search-codfw on elastic2050 to take reduced (128kB) readahead settings T264053 [production]
19:31 <mutante> ran sre.dns.netbox to push addition of an-worker1113 which was commited in prod repo but not in netbox data [production]
19:30 <dzahn@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:27 <dzahn@cumin1001> START - Cookbook sre.dns.netbox [production]
19:14 <mforns> restarted oozie coord unique_devices-per_domain-monthly after deployment [analytics]
19:05 <mforns> finished deploying refinery to unblock deletion of raw mediawiki_job and raw netflow data [analytics]
18:59 <mforns@deploy1001> Finished deploy [analytics/refinery@2c6c335] (thin): [THIN] Special deployment to unblock deletion jobs [analytics/refinery@2c6c335e61cecd0321ec6f066a153feaf2dbbc27] (duration: 00m 08s) [production]
18:59 <mforns@deploy1001> Started deploy [analytics/refinery@2c6c335] (thin): [THIN] Special deployment to unblock deletion jobs [analytics/refinery@2c6c335e61cecd0321ec6f066a153feaf2dbbc27] [production]
18:58 <mforns@deploy1001> Finished deploy [analytics/refinery@2c6c335]: Special deployment to unblock deletion jobs [analytics/refinery@2c6c335e61cecd0321ec6f066a153feaf2dbbc27] (duration: 12m 08s) [production]
18:46 <mforns@deploy1001> Started deploy [analytics/refinery@2c6c335]: Special deployment to unblock deletion jobs [analytics/refinery@2c6c335e61cecd0321ec6f066a153feaf2dbbc27] [production]
18:46 <mutante> marked project for deletion in 2020 purge [planet]
18:45 <mforns> deploying refinery to unblock deletion of raw mediawiki_job and raw netflow data [analytics]
18:44 <mutante> shutting down instance pk8s - not in use since 2019 [planet]
18:20 <elukey> manual creation of /opt/rocm -> /opt/rocm-3.3.0 on stat1008 to avoid failures in finding the lib dir [analytics]
18:17 <elukey@cumin1001> END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) [production]
18:17 <elukey@cumin1001> START - Cookbook sre.hadoop.init-hadoop-workers [production]
18:15 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) [production]
18:13 <elukey@cumin1001> START - Cookbook sre.hadoop.init-hadoop-workers [production]
18:11 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) [production]
18:10 <elukey@cumin1001> START - Cookbook sre.hadoop.init-hadoop-workers [production]
17:53 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
17:51 <elukey@cumin1001> START - Cookbook sre.hosts.downtime [production]
17:44 <wm-bot> <bd808> Purged cache of puppet roles for T264649 [tools.openstack-browser]
17:40 <bd808> `service uwsgi-labspuppetbackend restart` on cloud-puppetmaster-03 (T264649) [admin]
17:29 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
17:27 <elukey@cumin1001> START - Cookbook sre.hosts.downtime [production]
17:25 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
17:25 <hnowlan@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
17:11 <elukey> bootstrap an-worker[1115-1117] as hadoop workers [analytics]
17:00 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
17:00 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
16:59 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
16:59 <hnowlan@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
16:51 <hnowlan@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]