5201-5250 of 10000 results (48ms)
2017-02-09 ยง
16:17 <marostegui> Shutdown db2060 for maintenance - T156161 [production]
16:15 <marostegui> Compressing commonswiki on labsdb1009 - T153743 [production]
16:08 <ema> lvs1012: upgrade to jessie 8.7, pybal 1.13.4, reboot into kernel 4.4.2-3+wmf8 T155401 [production]
16:06 <jynus> rolling restart of replication threads for dbstore1002/2001/2002 T111654 [production]
15:49 <elukey@puppetmaster1001> conftool action : set/pooled=yes; selector: name=aqs1009.eqiad.wmnet [production]
15:42 <godog> roll-restart diamond to pick up graphite2001 changes [production]
15:30 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1086 - T136428 (duration: 00m 44s) [production]
15:23 <ema> shutdown cp3020 T130883 [production]
15:21 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1086 - T136428 (duration: 00m 40s) [production]
15:19 <elukey> restarting all Analytics Kafka brokers for Java security upgrades [production]
15:10 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1079 - T136428 (duration: 00m 40s) [production]
15:01 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1079 - T136428 (duration: 00m 43s) [production]
14:53 <moritzm> upgrading hhvm on mw1189-mw1199 and mw1293/mw1294 [production]
14:48 <godog> move diamond traffic to graphite2001 - T157022 [production]
14:46 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1062 - T136428 (duration: 00m 41s) [production]
14:20 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1062 - T136428 (duration: 00m 45s) [production]
14:01 <gehel@puppetmaster1001> conftool action : set/pooled=no; selector: name=elastic20(21|22|23|24).codfw.wmnet [production]
13:45 <gehel@puppetmaster1001> conftool action : set/pooled=yes; selector: name=elastic2020.codfw.wmnet [production]
13:34 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1028 - T153300 (duration: 00m 41s) [production]
13:11 <moritzm> upgrading firejail on sca cluster [production]
12:52 <gehel> killing salt runs stuck on failing reimage of elastic2018 [production]
12:37 <mforns@tin> Finished deploy [analytics/refinery@9e689f3]: (no justification provided) (duration: 03m 05s) [production]
12:34 <mforns@tin> Started deploy [analytics/refinery@9e689f3]: (no justification provided) [production]
11:56 <moritzm> upgrading hhvm on mw1170-mw1188 (also effecting updates of openssl, libgd, lcms, gnutls, sqlite, libxpm and glibc) [production]
11:39 <gehel> failed reimage on elastic201[89], restarting [production]
10:54 <moritzm> deploy exim and openssh bugfix updates from jessie point release [production]
10:51 <moritzm> upgrading java on kafka clusters and druid [production]
10:49 <elukey> restarting Java daemons on druid100[123] for security upgrades [production]
10:42 <jynus> preparing to reimage db2040 T111654 [production]
10:37 <jynus@tin> Synchronized wmf-config/db-codfw.php: Depool db2040 (duration: 00m 40s) [production]
10:26 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1034 - T111654 (duration: 00m 41s) [production]
10:09 <hashar> Restarted Jenkins on contint1001 [production]
10:04 <hashar> Running package upgrades on contint2001 [production]
10:03 <elukey> restore Hadoop master to an1001 [production]
09:57 <elukey> failover Hadoop masters from an1001 to an1002 to allow Java upgrades [production]
09:52 <gehel> cleaning up logs on elastic20(01|16) - T139043 [production]
09:50 <elukey> restarting oozie and hive on analytics1003 for java security upgrades [production]
09:39 <marostegui> Deploy alter table on eqiad hosts for s7 metawiki and wiki on the echo_notification tables - T136428 [production]
09:38 <jynus> upgrading and restarting db1034 T111654 [production]
09:34 <jynus@tin> Synchronized wmf-config/db-eqiad.php: Depool db1034 (duration: 00m 44s) [production]
09:32 <gehel@puppetmaster1001> conftool action : set/pooled=no; selector: name=elastic20(17|18|19|20).codfw.wmnet [production]
09:20 <jynus@tin> Synchronized wmf-config/db-codfw.php: Repool db2057 (duration: 00m 41s) [production]
09:17 <gehel> restarting blazegraph on wdqs1003 to ensure proper war is loaded [production]
09:10 <marostegui> Deploy alter table on codfw hosts for s7 metawiki and wiki on the echo_notification tables - T136428 [production]
09:08 <moritzm> restarting archiva on meitnerium for java security update [production]
09:07 <elukey> Executing Cassandra nodetool cleanup on aqs1006-{a,b} (one at the time) and aqs1009-a [production]
09:01 <elukey> restarting java daemons on all the Hadoop nodes for security upgrades [production]
08:59 <gehel> cleaning empty logs on elastic10(22|24|40) - thanks elukey ! [production]
08:51 <moritzm> installing Java security updates on Hadoop cluster [production]
08:45 <moritzm> installing Java security updates on stat* and contint1001 [production]