2018-01-18
ยง
|
14:44 |
<herron> |
disabling puppet agents during deploy of 404587, 404689 |
[production] |
14:39 |
<ema> |
cache_upload: upgrade cp3038 to varnish 5 |
[production] |
14:39 |
<godog> |
restart hhvm on mw1233 |
[production] |
14:31 |
<_joe_> |
restarting hhvm on a few API appservers |
[production] |
14:30 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1087 - T174569 (duration: 01m 12s) |
[production] |
14:28 |
<ema> |
cache_upload: repool cp3035 (varnish 5) |
[production] |
14:25 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Promote db2043 to s3 master after db2036 crash (duration: 01m 12s) |
[production] |
14:25 |
<godog> |
restart hhvm on mw1227 |
[production] |
14:23 |
<ema> |
cache_upload: upgrade cp3035 to varnish 5 |
[production] |
14:19 |
<jynus> |
starting mysql on db2043 |
[production] |
14:17 |
<jynus> |
stopping mysql on db2043 |
[production] |
14:10 |
<zeljkof> |
EU SWAT finished |
[production] |
14:10 |
<ema> |
cache_upload: repool cp3037 (varnish 5) |
[production] |
14:09 |
<zfilipin@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:404911|Change autoconfirmed settings and Enable flood group at zhwikibooks (T185182)]] (duration: 01m 13s) |
[production] |
13:54 |
<ema> |
cache_upload: upgrade cp3037 to varnish 5 |
[production] |
13:49 |
<moritzm> |
upgrade mw* servers in eqiad running 3.18.5+dfsg-1+wmf3 (recent installations) to 3.18.5+dfsg-1+wmf4 |
[production] |
13:19 |
<jynus> |
changing topology of codfw s3 databases |
[production] |
13:05 |
<akosiaris> |
reboot poolcounter2001 for PCID/INVPCID CPU feature enabling |
[production] |
13:03 |
<akosiaris> |
reboot webperf1001 for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
12:57 |
<akosiaris> |
enable puppet across the fleet after nitrogen (puppetdb) reboot |
[production] |
12:56 |
<akosiaris> |
reboot nitrogen for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
12:52 |
<jgleeson> |
turned on donations queue consumer process-control job (actual time of change 17/01/18 ~16:20) |
[production] |
12:45 |
<akosiaris> |
reboot seaborgium for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
12:43 |
<elukey> |
bohrium rebooted for kernel upgrades |
[production] |
12:43 |
<akosiaris> |
disable puppet across the fleet for nitrogen (puppetdb) reboot |
[production] |
12:40 |
<elukey> |
set piwik in readonly mode and stopped mysql on bohrium (prep step for reboot) |
[production] |
12:36 |
<akosiaris> |
reboot chlorine.eqiad.wmnet etcd1003.eqiad.wmnet etcd1005.eqiad.wmnet fermium.wikimedia.org install1002.wikimedia.org krypton.eqiad.wmnet kubestagetcd1003.eqiad.wmnet logstash1009.eqiad.wmnet mwdebug1001.eqiad.wmnet sca1004.eqiad.wmnet for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
11:34 |
<akosiaris> |
reboot logstash1008 etcd1002 kubestagetcd1002.eqiad.wmnet for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
11:12 |
<ema> |
cp3046: restart varnish-be due to mbox lag |
[production] |
11:06 |
<volans> |
disabled puppet on tegmen to test impact on puppetdb - T170740 |
[production] |
10:57 |
<akosiaris> |
reboot actinium.wikimedia.org aluminium.wikimedia.org argon.eqiad.wmnet boron.eqiad.wmnet bromine.eqiad.wmnet darmstadtium.eqiad.wmnet dbmonitor1001.wikimedia.org dubnium.wikimedia.org dysprosium.wikimedia.org etcd1001.eqiad.wmnet etcd1004.eqiad.wmnet fermium.wikimedia.org hassium.eqiad.wmnet kubestagetcd1001.eqiad.wmnet logstash1007.eqiad.wmnet meitnerium.wikimedia.org mendelevium.eqiad.wmnet mwdebug1002.eqiad.wmnet m |
[production] |
10:45 |
<ema> |
cp3034: restart varnishxcps and varnishmedia, they were both using 100% of a cpu core |
[production] |
10:35 |
<Amir1> |
ladsgroup@terbium:/srv/mediawiki/php-1.31.0-wmf.17$ mwscript extensions/WikibaseQualityConstraints/maintenance/ImportConstraintStatements.php --wiki wikidatawiki (T184720) |
[production] |
10:30 |
<akosiaris> |
reboot etherpad1001 for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
10:29 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Remove db2034 from s1 as it will be in x1 - T184888 (duration: 01m 12s) |
[production] |
10:25 |
<mobrovac@tin> |
Finished deploy [restbase/deploy@5c353f7]: Use stable packge names, normalise cache-control headers, update top definition, take #2 - T184199 T184833 T184541 (duration: 12m 18s) |
[production] |
10:12 |
<mobrovac@tin> |
Started deploy [restbase/deploy@5c353f7]: Use stable packge names, normalise cache-control headers, update top definition, take #2 - T184199 T184833 T184541 |
[production] |
10:10 |
<mobrovac@tin> |
Finished deploy [restbase/deploy@04e7cdb]: Use stable packge names, normalise cache-control headers, update top definition - T184199 T184833 T184541 (duration: 02m 29s) |
[production] |
10:07 |
<mobrovac@tin> |
Started deploy [restbase/deploy@04e7cdb]: Use stable packge names, normalise cache-control headers, update top definition - T184199 T184833 T184541 |
[production] |
10:07 |
<moritzm> |
rebooting rdb1002/rdb1004/rdb1006/rdb1008 for kernel security update |
[production] |
09:58 |
<akosiaris> |
reboot etcd1006 for PCID, INVPCID feature enabling (INVPCID not supported on current hardware, but still enabling it cluster wide) |
[production] |
09:49 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1067 - T162807 (duration: 01m 12s) |
[production] |
09:43 |
<ema> |
cache_upload: repooled cp3034 running varnish 5 |
[production] |
09:38 |
<elukey> |
reboot thorium (analytics webserver) for security upgrade - This maintenance will cause temporary unavailability of the Analytics websites |
[production] |
09:27 |
<marostegui> |
!log Stop replication in sync db1089 and db2048 (codfw master) - T162807 |
[production] |
09:26 |
<jynus> |
reimage es2003 to stretch |
[production] |
09:21 |
<elukey> |
reboot druid1001 for kernel upgrades |
[production] |
09:20 |
<akosiaris> |
reboot oresrdb2001 for PCID/INVPCID CPU feature enabling |
[production] |
09:10 |
<akosiaris> |
reboot alcyone pollux sca2004 poolcounter2002 serpens for PCID/INVPCID CPU feature enabling |
[production] |
09:07 |
<marostegui> |
Stop replication in sync db1089 db1067 - T162807 |
[production] |