2021-08-23
§
|
09:55 |
<mbsantos> |
start re-import OSM planet data into maps1009 eqiad master (T288400, T288897) |
[production] |
09:53 |
<urbanecm> |
Deploy security patch for T289408 |
[production] |
09:51 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
09:50 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
09:33 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=false; selector: dnsdisc=swift-ro,name=codfw |
[production] |
09:33 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=false; selector: dnsdisc=swift,name=codfw |
[production] |
09:02 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=true; selector: dnsdisc=swift-ro,name=eqiad |
[production] |
09:02 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=true; selector: dnsdisc=swift,name=eqiad |
[production] |
09:01 |
<godog> |
pooling swift in eqiad - T288458 |
[production] |
07:59 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
07:55 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
07:46 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
07:44 |
<ladsgroup@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:714322|Set request languages rdf output for wikidata to true (T285795)]] (duration: 00m 57s) |
[production] |
07:42 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
07:28 |
<Amir1> |
running FlaggedRevs/maintenance/pruneRevData.php on all flaggedrevs wikis |
[production] |
07:28 |
<ladsgroup@deploy1002> |
Synchronized php-1.37.0-wmf.19/extensions/FlaggedRevs/maintenance/pruneRevData.php: Backport: [[gerrit:714151|Avoid calling delete() with empty arrays in PruneFRIncludeData (T289249)]] (duration: 00m 59s) |
[production] |
07:20 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2023.codfw.wmnet with reason: REIMAGE |
[production] |
07:18 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2023.codfw.wmnet with reason: REIMAGE |
[production] |
2021-08-20
§
|
23:17 |
<legoktm> |
deployed patch for T289385 |
[production] |
17:03 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1141.eqiad.wmnet |
[production] |
17:01 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1141.eqiad.wmnet |
[production] |
16:58 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1140.eqiad.wmnet |
[production] |
16:56 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1140.eqiad.wmnet |
[production] |
16:56 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1139.eqiad.wmnet |
[production] |
16:54 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1139.eqiad.wmnet |
[production] |
16:45 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1134.eqiad.wmnet |
[production] |
16:43 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1134.eqiad.wmnet |
[production] |
16:38 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1133.eqiad.wmnet |
[production] |
16:36 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1133.eqiad.wmnet |
[production] |
15:37 |
<jayme> |
deleting various pods from staging to have them recreated with priorities - T289131 |
[production] |
15:25 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1129.eqiad.wmnet |
[production] |
15:23 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1129.eqiad.wmnet |
[production] |
15:14 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
14:41 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2021.codfw.wmnet with reason: REIMAGE |
[production] |
14:39 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2021.codfw.wmnet with reason: REIMAGE |
[production] |
13:54 |
<jelto@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
13:48 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
12:00 |
<jayme> |
enabled priority admission plugin on k8s staging, rolling restart all pods in kube-system namespace - T289131 |
[production] |
11:55 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
10:35 |
<dzahn@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
09:33 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts druid1001.eqiad.wmnet |
[production] |
09:32 |
<dzahn@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
09:23 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts druid1001.eqiad.wmnet |
[production] |
08:48 |
<godog> |
roll depool/pool thanos-fe to apply swift change - T288815 |
[production] |
08:43 |
<godog> |
temp depool thanos-fe2003 to test https://gerrit.wikimedia.org/r/c/operations/puppet/+/713815 |
[production] |
08:43 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on druid1001.eqiad.wmnet with reason: decommissioning druid1001 |
[production] |
08:43 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on druid1001.eqiad.wmnet with reason: decommissioning druid1001 |
[production] |
07:14 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mc2019.codfw.wmnet with reason: REIMAGE |
[production] |