2019-05-15
§
|
11:27 |
<akosiaris@deploy1001> |
scap-helm citoid cluster staging completed |
[production] |
11:27 |
<akosiaris@deploy1001> |
scap-helm citoid upgrade -f citoid-staging-values.yaml staging stable/citoid [namespace: citoid, clusters: staging] |
[production] |
10:31 |
<elukey> |
superset.wikimedia.org moved to analytics-tool1004 (Buster + python 3.7 + Superset 0.32 upgrade) |
[production] |
10:27 |
<moritzm> |
installing linux 4.9.168-1+deb9u2 kernel on stretch hosts (no reboots, just installing the new package) |
[production] |
10:04 |
<elukey@deploy1001> |
Finished deploy [analytics/superset/deploy@9cdb9c5]: Superset 0.32 - update pyhive dependency (duration: 00m 26s) |
[production] |
10:04 |
<elukey@deploy1001> |
Started deploy [analytics/superset/deploy@9cdb9c5]: Superset 0.32 - update pyhive dependency |
[production] |
09:33 |
<hashar> |
Disable CI castor cache system since the instance is being migrated. Some / most CI jobs might have failed for the last 20 minutes or so T223148 |
[production] |
08:45 |
<elukey@deploy1001> |
Finished deploy [analytics/superset/deploy@31c2c30]: Superset 0.32 (duration: 00m 26s) |
[production] |
08:44 |
<elukey@deploy1001> |
Started deploy [analytics/superset/deploy@31c2c30]: Superset 0.32 |
[production] |
08:36 |
<elukey> |
stop superset on analytics-tool1003 as prep step for the migration to the new host - T212243 |
[production] |
08:31 |
<moritzm> |
rebooting mw2164 |
[production] |
07:33 |
<elukey> |
restart nutcracker on mw2245 to pick up config changes (removal of memcached config) |
[production] |
07:29 |
<elukey> |
powercycle an-worker1094 (OEM event occurred, checking if temporary) |
[production] |
07:21 |
<oblivian@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Remove the php7 beta feature T219128 (duration: 00m 59s) |
[production] |
06:24 |
<elukey> |
force remount of /mnt/hdfs on stat1007 - fuse hdfs stuck |
[production] |
01:40 |
<eileen> |
process control updated - omnigroupmember.load re-enabled |
[production] |
01:39 |
<eileen> |
civicrm revision changed from 5024c968ed to 4b6d569383, config revision is a099f13a55 |
[production] |
2019-05-14
§
|
20:44 |
<herron@deploy1001> |
Finished deploy [logstash/plugins@7fb8843]: adding logstash-filter-truncate plugin (duration: 00m 07s) |
[production] |
20:43 |
<herron@deploy1001> |
Started deploy [logstash/plugins@7fb8843]: adding logstash-filter-truncate plugin |
[production] |
20:41 |
<herron@deploy1001> |
Finished deploy [logstash/plugins@7fb8843]: (no justification provided) (duration: 00m 01s) |
[production] |
20:41 |
<herron@deploy1001> |
Started deploy [logstash/plugins@7fb8843]: (no justification provided) |
[production] |
20:13 |
<chaomodus> |
restarting gerrit on cobalt to pick up metrics export changes |
[production] |
19:37 |
<herron> |
adding logstash filter truncate plugin to prod logstash collectors |
[production] |
19:28 |
<gehel> |
shutting down elastic2038 for memory replacement - T217398 |
[production] |
19:25 |
<gehel> |
ban elastic2038 from elasticsearch cluster for memory replacement - T217398 |
[production] |
18:21 |
<mutante> |
mwmaint1002 - deleting /root/home-mwmaint2001 to save space - confirmed we have bacula backups of home on mwmaint2001 |
[production] |
17:55 |
<mutante> |
elastic2029 - enable puppet agent - was disabled without reason and nobody seems to have logged in recently |
[production] |
17:54 |
<mutante> |
elastic2038 - restart nagios-nrpe-server - attempt to fix "CHECK_NRPE STATE UNKNOWN" for a single check |
[production] |
17:32 |
<mutante> |
contint1001 - mkdir /srv/zuul-logs ; mv /var/log/zuul/debug.log* /srv/zuul-logs/ to prevent CI running out of disk again (T207707) |
[production] |
17:22 |
<mbsantos@deploy1001> |
Finished deploy [proton/deploy@881b22b]: Update chromium-render to 8cc96e7 make timeout handler more robust (T217724) (duration: 02m 23s) |
[production] |
17:20 |
<mbsantos@deploy1001> |
Started deploy [proton/deploy@881b22b]: Update chromium-render to 8cc96e7 make timeout handler more robust (T217724) |
[production] |
16:30 |
<jynus> |
stop replication and start table recompression on labsdb1009 T222978 |
[production] |
16:22 |
<godog> |
statsd_exporter 0.9 upgrade on thumbor - T220709 |
[production] |
16:04 |
<gilles@deploy1001> |
Finished deploy [performance/coal@5a32eb2]: T221401 (duration: 00m 06s) |
[production] |
16:04 |
<gilles@deploy1001> |
Started deploy [performance/coal@5a32eb2]: T221401 |
[production] |
15:56 |
<jforrester@deploy1001> |
Synchronized php-1.34.0-wmf.4/extensions/VisualEditor/includes/ApiVisualEditor.php: Hot-deploy VE unset variable fix T223281 (duration: 00m 55s) |
[production] |
15:51 |
<jforrester@deploy1001> |
Synchronized php-1.34.0-wmf.5/extensions/VisualEditor/includes/ApiVisualEditor.php: Hot-deploy VE unset variable fix T223281 (duration: 00m 57s) |
[production] |
15:49 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@81059c6]: Deploy new reqs for reports (duration: 00m 55s) |
[production] |
15:49 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@81059c6]: Deploy new reqs for reports |
[production] |
15:43 |
<jynus> |
reload haproxy config @ dbproxy1010, dbproxy1011 |
[production] |
15:38 |
<XioNoX> |
re-activate bgp to telia on cr1-codfw - T222967 |
[production] |
15:33 |
<XioNoX> |
deactivate bgp to telia on cr1-codfw - T222967 |
[production] |
15:19 |
<papaul> |
shutting down elastic2038 for memory replacement |
[production] |
15:14 |
<hashar> |
mw1263: scap pull |
[production] |
14:53 |
<hashar@deploy1001> |
rebuilt and synchronized wikiversions files: group0 to 1.34.0-wmf.5 |
[production] |
14:50 |
<moritzm> |
rebooting mw1263 for kernel update |
[production] |
14:47 |
<hashar@deploy1001> |
Finished scap: testwiki to 1.34.0-wmf.5 and rebuild l10n cache (duration: 62m 47s) |
[production] |
14:07 |
<_joe_> |
apt-get lean on mwmaint1002 |
[production] |
13:44 |
<hashar@deploy1001> |
Started scap: testwiki to 1.34.0-wmf.5 and rebuild l10n cache |
[production] |
13:44 |
<godog> |
rearm keyholder on deploy and cumin hosts |
[production] |