2021-01-19
ยง
|
21:51 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2312.codfw.wmnet |
[production] |
21:50 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2315.codfw.wmnet |
[production] |
21:46 |
<ottomata> |
wiping kafka-test cluster data and starting from scratch - T255973 |
[production] |
21:00 |
<Urbanecm> |
Start of `foreachwikiindblist group2 extensions/AbuseFilter/maintenance/MigrateAflFilter.php --batch-size=1000` (T269713) |
[production] |
20:09 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.36.0-wmf.27 |
[production] |
20:08 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2315.codfw.wmnet |
[production] |
20:07 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2314.codfw.wmnet |
[production] |
20:07 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2313.codfw.wmnet |
[production] |
20:07 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2312.codfw.wmnet |
[production] |
19:46 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
19:30 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
19:27 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 |
[production] |
19:22 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.stop-cluster for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 |
[production] |
18:58 |
<brennen@deploy1001> |
Pruned MediaWiki: 1.36.0-wmf.22 (duration: 03m 53s) |
[production] |
18:47 |
<mbsantos@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'push-notifications' for release 'main' . |
[production] |
18:43 |
<mbsantos@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'push-notifications' for release 'main' . |
[production] |
18:42 |
<brennen@deploy1001> |
Finished scap: testwikis wikis to 1.36.0-wmf.27 (duration: 41m 57s) |
[production] |
18:39 |
<mbsantos@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'push-notifications' for release 'main' . |
[production] |
18:01 |
<brennen@deploy1001> |
Started scap: testwikis wikis to 1.36.0-wmf.27 |
[production] |
17:59 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on restbase2009.codfw.wmnet with reason: REIMAGE |
[production] |
17:59 |
<brennen> |
starting deploy-promote to testwikis for 1.36.0-wmf.27 (T271341) |
[production] |
17:58 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on restbase2009.codfw.wmnet with reason: REIMAGE |
[production] |
17:30 |
<Urbanecm> |
Start of `foreachwikiindblist group1 extensions/AbuseFilter/maintenance/MigrateAflFilter.php --batch-size=1000 ` (T269713) |
[production] |
17:08 |
<Urbanecm> |
Run extensions/AbuseFilter/maintenance/MigrateAflFilter.php for all group0 wikis (T269713) |
[production] |
17:06 |
<Urbanecm> |
mwscript extensions/AbuseFilter/maintenance/MigrateAflFilter.php --wiki=test2wiki --batch-size=1000 # T269713 |
[production] |
17:04 |
<Urbanecm> |
mwscript extensions/AbuseFilter/maintenance/MigrateAflFilter.php --wiki=testwiki --batch-size=1000 # T269713 |
[production] |
16:51 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mw2314.codfw.wmnet with reason: new install on buster |
[production] |
16:51 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on mw2314.codfw.wmnet with reason: new install on buster |
[production] |
16:50 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2314.codfw.wmnet with reason: REIMAGE |
[production] |
16:48 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2313.codfw.wmnet with reason: REIMAGE |
[production] |
16:47 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'test' . |
[production] |
16:47 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'main' . |
[production] |
16:47 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'canary' . |
[production] |
16:47 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2315.codfw.wmnet with reason: REIMAGE |
[production] |
16:46 |
<brennen> |
1.36.0-wmf.27 was branched at fbb516d8e33924c6cb66c93bb6d42907558c31f3 for T271341 |
[production] |
16:45 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2312.codfw.wmnet with reason: REIMAGE |
[production] |
16:45 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2315.codfw.wmnet with reason: REIMAGE |
[production] |
16:45 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2314.codfw.wmnet with reason: REIMAGE |
[production] |
16:43 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2313.codfw.wmnet with reason: REIMAGE |
[production] |
16:43 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2312.codfw.wmnet with reason: REIMAGE |
[production] |
16:43 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'canary' . |
[production] |
16:43 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'test' . |
[production] |
16:43 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'main' . |
[production] |
16:41 |
<jmm@cumin2001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host ms-be1046.eqiad.wmnet |
[production] |
16:39 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'test' . |
[production] |
16:39 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'main' . |
[production] |
16:39 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'canary' . |
[production] |
16:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1112 (re)pooling @ 100%: After moving wikireplicas to another host', diff saved to https://phabricator.wikimedia.org/P13838 and previous config saved to /var/cache/conftool/dbconfig/20210119-163637-root.json |
[production] |
16:30 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'test' . |
[production] |
16:30 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'similar-users' for release 'canary' . |
[production] |