2020-01-24
§
|
11:09 |
<moritzm> |
purged stale grafana package from grafana1001, caused systemd unit failure |
[production] |
11:04 |
<effie> |
restart php-fpm on mw1238-mw1239 |
[production] |
09:29 |
<akosiaris> |
disable and mask etherpad-lite on etherpad1002 to avoid corruption issues. T224580 |
[production] |
08:42 |
<marostegui> |
Remove wikiadmin2 user from pc2XXX codfw hosts T243512 |
[production] |
08:17 |
<moritzm> |
installing python-apt security updates |
[production] |
07:19 |
<_joe_> |
force run puppet on all esams cache nodes, for mitigation of T243313 |
[production] |
06:37 |
<marostegui> |
Stop replication on db1107 |
[production] |
06:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2085 after memory replacement T243148', diff saved to https://phabricator.wikimedia.org/P10256 and previous config saved to /var/cache/conftool/dbconfig/20200124-061228-marostegui.json |
[production] |
01:24 |
<mutante> |
running puppet on cp-text_ulsfo |
[production] |
00:46 |
<mutante> |
cp4032 - starting varnishmtail.service |
[production] |
00:36 |
<catrope@deploy1001> |
Synchronized php-1.35.0-wmf.16/extensions/CentralNotice/resources/ext.centralNotice.display/hide.js: T240802 (duration: 01m 05s) |
[production] |
00:34 |
<catrope@deploy1001> |
Synchronized php-1.35.0-wmf.15/extensions/CentralNotice/resources/ext.centralNotice.display/hide.js: T240802 (duration: 01m 07s) |
[production] |
00:33 |
<mutante> |
cp4032 - starting varnishmtail.service which was failed |
[production] |
00:32 |
<catrope@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Bump Parsoid/PHP cluster memory_limit again (T239806, T236833) (duration: 01m 05s) |
[production] |
2020-01-23
§
|
21:08 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventstreams' for release 'production' . |
[production] |
20:30 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: Revert "group2 wikis to 1.35.0-wmf.15" |
[production] |
20:29 |
<brennen> |
reverting group2 to 1.35.0-wmf.15 |
[production] |
20:10 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.35.0-wmf.16 |
[production] |
20:00 |
<Urbanecm> |
Morning SWAT done |
[production] |
19:56 |
<mlitn@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Add 3d-patents page to wgForceUIMsgAsContentMsg (duration: 01m 08s) |
[production] |
19:15 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 2d8f773: Use editeditorprotected for protecting pages for editors (T230103) (duration: 01m 05s) |
[production] |
19:10 |
<urbanecm@deploy1001> |
Synchronized php-1.35.0-wmf.16/extensions/WikimediaMessages/extension.json: SWAT: 23a6f8e: InukaPageView: update schema version (T238029) (duration: 01m 05s) |
[production] |
19:06 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 629b5fc: Add *.eso.org to the wgCopyUploadsDomains (T243423) (duration: 01m 06s) |
[production] |
19:03 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
18:59 |
<mutante> |
ganeti1003 - creating new VM etherpad1002.eqiad.wmnet with 1GB RAM and 10GB disk, row C, private link (T243475) |
[production] |
18:58 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
18:54 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventstreams' for release 'production' . |
[production] |
18:47 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventstreams' for release 'production' . |
[production] |
18:40 |
<jforrester@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Stop setting wgWikimediaMessagesPartialBlockBanner, never read T240300 (duration: 01m 06s) |
[production] |
18:35 |
<rlazarus> |
etcd main cluster switchover complete, eqiad is now read-write |
[production] |
18:28 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'eventstreams' for release 'production' . |
[production] |
18:27 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventstreams' for release 'production' . |
[production] |
18:22 |
<vgutierrez> |
pooling cp4032 running buster - T242093 |
[production] |
18:15 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventstreams' for release 'production' . |
[production] |
18:05 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:05 |
<robh@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:03 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:03 |
<robh@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:01 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:01 |
<robh@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
17:59 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
17:59 |
<robh@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
17:53 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
17:53 |
<robh@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
17:52 |
<_joe_> |
running systemctl reset-failed on conf1005 to clear useless alerts |
[production] |
17:33 |
<marostegui> |
Poweroff db2085:3311 and db2085:3318 for maintenance - T243148 |
[production] |
17:33 |
<jforrester@deploy1001> |
Synchronized static/images/project-logos: [trwiki] Tweak logo versions T242977 (duration: 01m 07s) |
[production] |
17:00 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
16:59 |
<akosiaris@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
16:58 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |