2020-05-19
§
|
12:37 |
<jayme> |
imported helm 2.16.7-2 to main for buster-wikimedia, stretch-wikimedia, jessie-wikimedia |
[production] |
12:17 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) |
[production] |
11:51 |
<jynus> |
starting backups of es1, es2, es3 on eqiad into backup1002 |
[production] |
11:41 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Depool es1018, es1015, es1019', diff saved to https://phabricator.wikimedia.org/P11232 and previous config saved to /var/cache/conftool/dbconfig/20200519-114148-jynus.json |
[production] |
11:12 |
<marostegui> |
Deploy schema change on db2124 (frwiki, jawiki, ruwiki) T238966 |
[production] |
10:34 |
<mutante> |
releases2001 - restarted failed jenkins |
[production] |
10:33 |
<mutante> |
releases2001 - Failed to restart jenkins.service: The name org.freedesktop.PolicyKit1 was not provided by any .service files |
[production] |
10:32 |
<volans> |
flushed all Netbox caches (manage.py invalidate all) - T253091 |
[production] |
10:29 |
<volans> |
start Netbox restore - T253091 |
[production] |
10:18 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' . |
[production] |
10:13 |
<akosiaris> |
upgrade etherpad-lite to 1.8.4 on etherpad1002 |
[production] |
09:58 |
<hnowlan> |
roll-restart of eqiad restbase hosts for java security updates |
[production] |
09:58 |
<hnowlan@cumin1001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
09:55 |
<jayme@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
09:55 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'canary' . |
[production] |
09:55 |
<jayme@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'production' . |
[production] |
09:54 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' . |
[production] |
09:10 |
<godog> |
eqiad-prod: decom ms-be101[678] - T252008 |
[production] |
08:07 |
<XioNoX> |
Push 596597: BGP: standardize fixed part of IX4/IX6 groups - eqsin |
[production] |
08:04 |
<XioNoX> |
Push 596597: BGP: standardize fixed part of IX4/IX6 groups - esams |
[production] |
08:01 |
<XioNoX> |
Push 596597: BGP: standardize fixed part of IX4/IX6 groups - eqiad |
[production] |
07:55 |
<volker-e@deploy1001> |
Finished deploy [design/style-guide@37c67dd]: Deploy design/style-guide: (duration: 00m 06s) |
[production] |
07:54 |
<volker-e@deploy1001> |
Started deploy [design/style-guide@37c67dd]: Deploy design/style-guide: |
[production] |
07:52 |
<XioNoX> |
Push 596597: BGP: standardize fixed part of IX4/IX6 groups - *dfw |
[production] |
07:49 |
<XioNoX> |
Push 596597: BGP: standardize fixed part of IX4/IX6 groups - ulsfo |
[production] |
07:45 |
<vgutierrez> |
rolling upgrade to trafficserver 8.0.7-1wm10 with puppet disabled on cp hosts |
[production] |
07:09 |
<jynus> |
starting es4 & es5 eqiad backups with low concurrency |
[production] |
06:35 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) |
[production] |
06:29 |
<elukey@cumin1001> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper |
[production] |
06:24 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) |
[production] |
06:17 |
<elukey@cumin1001> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper |
[production] |
05:57 |
<volker-e@deploy1001> |
Finished deploy [design/style-guide@7bfbd2a]: Deploy design/style-guide: (duration: 00m 06s) |
[production] |
05:57 |
<volker-e@deploy1001> |
Started deploy [design/style-guide@7bfbd2a]: Deploy design/style-guide: |
[production] |
05:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set s2 and s8 as read-only=off for maintenance T251981', diff saved to https://phabricator.wikimedia.org/P11227 and previous config saved to /var/cache/conftool/dbconfig/20200519-050346-marostegui.json |
[production] |
05:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set s2 and s8 as read-only for maintenance T251981', diff saved to https://phabricator.wikimedia.org/P11226 and previous config saved to /var/cache/conftool/dbconfig/20200519-050043-marostegui.json |
[production] |
04:27 |
<marostegui> |
Repool labsdb1011 T249188 |
[production] |
03:29 |
<volker-e@deploy1001> |
Finished deploy [design/style-guide@4b4bc51]: Deploy design/style-guide: (duration: 00m 07s) |
[production] |
03:28 |
<volker-e@deploy1001> |
Started deploy [design/style-guide@4b4bc51]: Deploy design/style-guide: |
[production] |
2020-05-18
§
|
23:50 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
23:47 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
23:25 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
23:23 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
23:12 |
<ryankemper> |
Restarted `wdqs-updater` across all wdqs nodes and restarted `wdqs-categories` across all nodes except 1010 (test wdqs server) and 1009 (automated deployment server) |
[production] |
22:55 |
<Krinkle> |
Clear module_deps on dewiki (group2, old mw version, s5) to monitor regeneration |
[production] |
22:48 |
<Krinkle> |
Clear module_deps on group0 (mostly s3) to monitor regeneration |
[production] |
22:48 |
<Krinkle> |
Clear module_deps on commonswiki (group0, mostly s3) to monitor regeneration |
[production] |
22:35 |
<Krinkle> |
Clear module_deps on commonswiki (group1, s4) to monitor regeneration |
[production] |
22:33 |
<ryankemper@deploy1001> |
Finished deploy [wdqs/wdqs@4886dc3]: 0.3.32 (duration: 17m 12s) |
[production] |
22:19 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
22:18 |
<Krinkle> |
Clear module_deps on s2 wikis to monitor regeneration |
[production] |