2020-07-23
§
|
09:27 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'mobileapps' for release 'nontls' . |
[production] |
09:25 |
<akosiaris@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
09:24 |
<jayme@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'termbox' for release 'production' . |
[production] |
09:20 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'termbox' for release 'production' . |
[production] |
09:19 |
<akosiaris> |
lower replica count back to 80 for mobileapps. T218733 |
[production] |
09:19 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
09:19 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mobileapps' for release 'nontls' . |
[production] |
09:02 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'termbox' for release 'staging' . |
[production] |
08:59 |
<marostegui> |
transfer --type=xtrabackup from db1117:3322 to db1107 T257540 |
[production] |
08:45 |
<jayme@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
08:42 |
<godog> |
test librenms poller from netmon2001 |
[production] |
08:41 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:40 |
<XioNoX> |
remove pim-rp IPs from last routers - T257573 |
[production] |
08:40 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
08:39 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:29 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
08:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove db1107 from s1 T257540', diff saved to https://phabricator.wikimedia.org/P12025 and previous config saved to /var/cache/conftool/dbconfig/20200723-082647-marostegui.json |
[production] |
08:16 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1107 to move it to m2 T257540', diff saved to https://phabricator.wikimedia.org/P12024 and previous config saved to /var/cache/conftool/dbconfig/20200723-081650-marostegui.json |
[production] |
05:29 |
<marostegui> |
Restore labsdb1009's original weight |
[production] |
00:24 |
<legoktm@deploy1001> |
Synchronized php-1.35.0-wmf.41/includes/: T258664: Revert "Add a new type of database to the installer from extension" (2/2) (duration: 01m 08s) |
[production] |
00:22 |
<legoktm@deploy1001> |
Synchronized php-1.35.0-wmf.41/includes/libs/rdbms/database/Database.php: T258664: Revert "Add a new type of database to the installer from extension" (duration: 01m 05s) |
[production] |
00:20 |
<legoktm@deploy1001> |
Scap failed!: 9/9 canaries failed their endpoint checks(https://en.wikipedia.org) |
[production] |
00:16 |
<legoktm@deploy1001> |
Synchronized php-1.36.0-wmf.1/includes/: T258664: Revert "Add a new type of database to the installer from extension" (duration: 01m 09s) |
[production] |
00:11 |
<legoktm@deploy1001> |
scap failed: average error rate on 3/9 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/e474f13ffac6b8c3bf919c4aeafc8c9b for details) |
[production] |
2020-07-22
§
|
23:32 |
<bstorm> |
setting the default NFS version to 4.2 while excepting the two stretch servers T257945 |
[paws] |
23:24 |
<bstorm> |
created server group 'tools-k8s-worker' to create any new worker nodes in so that they have a low chance of being scheduled together by openstack unless it is necessary T258663 |
[tools] |
23:22 |
<bstorm> |
running puppet and NFS 4.2 remount on tools-k8s-worker-[56-60] T257945 |
[tools] |
23:17 |
<bstorm> |
running puppet and NFS 4.2 remount on tools-k8s-worker-[41-55] T257945 |
[tools] |
23:14 |
<bstorm> |
running puppet and NFS 4.2 remount on tools-k8s-worker-[21-40] T257945 |
[tools] |
23:11 |
<bstorm> |
running puppet and NFS remount on tools-k8s-worker-[1-15] T257945 |
[tools] |
23:07 |
<bstorm> |
disabling puppet on k8s workers to reduce the effect of changing the NFS mount version all at once T257945 |
[tools] |
22:28 |
<bstorm> |
setting tools-k8s-control prefix to mount NFS v4.2 T257945 |
[tools] |
22:26 |
<wm-bot> |
<lucaswerkmeister> deployed 9eb2aa216d (no edit region without regions) |
[tools.wd-image-positions] |
22:15 |
<bstorm> |
set the tools-k8s-control nodes to also use 800MBps to prevent issues with toolforge ingress and api system |
[tools] |
22:07 |
<cdanis> |
remove downtime on api.svc.codfw.wmnet T258614 |
[production] |
22:07 |
<bstorm> |
set the tools-k8s-haproxy-1 (main load balancer for toolforge) to have an egress limit of 800MB per sec instead of the same as all the other servers |
[tools] |
22:06 |
<wm-bot> |
<lucaswerkmeister> deployed aa97ea1589 (Esc for editing regions) |
[tools.wd-image-positions] |
21:36 |
<wm-bot> |
<lucaswerkmeister> deployed 6b34c5bb7b (editing regions) |
[tools.wd-image-positions] |
20:48 |
<brennen> |
restarted php7.2-fpm on deployment-mediawiki-{07,09} for T258628 |
[releng] |
20:10 |
<Urbanecm> |
tools.stewardbots@tools-sgebastion-07:~$ restart_stewardbot.sh |
[tools.stewardbots] |
19:26 |
<jhuneidi@deploy1001> |
Synchronized php: group1 wikis to 1.36.0-wmf.1 (duration: 01m 03s) |
[production] |
19:25 |
<jhuneidi@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.36.0-wmf.1 |
[production] |
19:15 |
<urbanecm@deploy1001> |
Finished scap: 9529cf8d2570bbf6dd1e919c966f5954e39dbd67: b66ec9143bd96cbf3a20b70f6aa3f2d6d7963bb5: OOUI backport; 93755a6a92923ae390e3a04b19421c8562568d2a: i18n changes for OAuth, removal of spam messages (duration: 42m 26s) |
[production] |
19:14 |
<ejegg> |
updated payments-wiki from bf91f8adff to 31a3de1130 |
[production] |
19:11 |
<mutante> |
mw2335 - mw2339 - scap pull |
[production] |
18:39 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw233[5-9].codfw.wmnet |
[production] |
18:38 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw233[6-9].codfw.wmnet |
[production] |
18:36 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw233[6-9].codfw.wmnet |
[production] |
18:33 |
<urbanecm@deploy1001> |
Started scap: 9529cf8d2570bbf6dd1e919c966f5954e39dbd67: b66ec9143bd96cbf3a20b70f6aa3f2d6d7963bb5: OOUI backport; 93755a6a92923ae390e3a04b19421c8562568d2a: i18n changes for OAuth, removal of spam messages |
[production] |
18:33 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2335.codfw.wmnet |
[production] |