2020-10-06
§
|
10:48 |
<effie> |
set mw2279.codfw.wmnet as inactive T264698 |
[production] |
10:47 |
<jiji@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw2279.codfw.wmnet |
[production] |
10:45 |
<hnowlan@deploy1001> |
Finished deploy [restbase/deploy@4ad65b0]: Deploying restbase to new hosts (duration: 01m 19s) |
[production] |
10:44 |
<hnowlan@deploy1001> |
Started deploy [restbase/deploy@4ad65b0]: Deploying restbase to new hosts |
[production] |
10:43 |
<hnowlan@deploy1001> |
Finished deploy [restbase/deploy@4ad65b0]: Deploying restbase to new hosts (duration: 01m 19s) |
[production] |
10:41 |
<hnowlan@deploy1001> |
Started deploy [restbase/deploy@4ad65b0]: Deploying restbase to new hosts |
[production] |
10:37 |
<hnowlan@deploy1001> |
Finished deploy [restbase/deploy@4ad65b0]: Redeploying to depooled restbase2009 (duration: 00m 15s) |
[production] |
10:37 |
<hnowlan@deploy1001> |
Started deploy [restbase/deploy@4ad65b0]: Redeploying to depooled restbase2009 |
[production] |
10:36 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
10:33 |
<hnowlan@deploy1001> |
Finished deploy [restbase/deploy@4ad65b0]: (no justification provided) (duration: 03m 01s) |
[production] |
10:31 |
<volans@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
10:30 |
<hnowlan@deploy1001> |
Started deploy [restbase/deploy@4ad65b0]: (no justification provided) |
[production] |
10:01 |
<marostegui> |
Restart mysql on dbstore1004 to pick up new buffer pool sizes |
[production] |
09:59 |
<effie> |
enable puppet on mc20* |
[production] |
09:41 |
<effie> |
enable puppet on mc10* |
[production] |
09:38 |
<effie> |
disable puppet on mc* |
[production] |
09:27 |
<klausman@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:26 |
<klausman@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:08 |
<elukey> |
add an-worker1114 to the hadoop cluster |
[analytics] |
09:04 |
<klausman> |
Starting reimaging of stat1007 |
[analytics] |
08:57 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) |
[production] |
08:55 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers |
[production] |
08:33 |
<jayme> |
imported envoyproxy_1.15.1-1+deb9u1 to stretch-wikimedia |
[production] |
08:27 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:26 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:02 |
<volans> |
removing unused ms-fe and ms-fe-thumbs svc records from DNS (gerrit/628086) |
[production] |
07:53 |
<marostegui> |
Change innodb_change_buffering = inserts on db2087:3316 db2089:3316 db2076 db2097:3316 db2114 T263443 |
[production] |
07:39 |
<filippo@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . |
[production] |
07:35 |
<filippo@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . |
[production] |
07:32 |
<elukey> |
bootstrap an-worker111[13] as hadoop workers |
[analytics] |
07:31 |
<filippo@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'wikifeeds' for release 'staging' . |
[production] |
07:17 |
<marostegui> |
Remove es2015 and es2017 from tendril and zarcillo T264700 T264386 |
[production] |
07:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es2015 T264700 ', diff saved to https://phabricator.wikimedia.org/P12926 and previous config saved to /var/cache/conftool/dbconfig/20201006-071451-marostegui.json |
[production] |
07:05 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
06:59 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
06:01 |
<Matthew> |
Restarted bot completely, two instances refused to reconnect. |
[wm-bot] |
05:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove es2017 from dbctl T264386', diff saved to https://phabricator.wikimedia.org/P12925 and previous config saved to /var/cache/conftool/dbconfig/20201006-052849-marostegui.json |
[production] |
2020-10-05
§
|
23:11 |
<ejegg> |
updated payments staging from 52704ffe24 to db03677b2d |
[production] |
22:31 |
<Amir1> |
deleted deployment-mailman01 (T257118) |
[releng] |
22:29 |
<Amir1> |
deleted deployment-imagescaler01 and deployment-imagescaler02 (T257118) |
[releng] |
22:29 |
<mutante> |
deleted the shinken module |
[monitoring] |
22:27 |
<mutante> |
removing shinken puppet module and role |
[production] |
22:01 |
<ebernhardson> |
restore wikidatawiki_content enwiki_content enwiki_general and commonswiki_file to default index.merge.policy.deletes_pct_allowed on eqiad cirrus cluster T264053 |
[production] |
21:58 |
<bstorm> |
setting "mtail::from_component: true" on both mx-out servers to make puppet work again |
[cloudinfra] |
21:01 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
21:00 |
<Urbanecm> |
Run keyholder arm at deployment-cumin |
[releng] |
21:00 |
<Urbanecm> |
Run puppet at deployment-mwmaint01 and deployment-mediawiki-07 |
[releng] |
20:59 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
20:45 |
<hauskatze> |
Fixed puppet for deployment-mediawiki-07: s/memcache/memcached/ |
[releng] |
20:30 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |