5401-5450 of 10000 results (43ms)
2016-11-25 §
10:20 <_joe_> upgrading HHVM across codfw [production]
09:23 <_joe_> upgraded hhvm on the debug hosts [production]
08:58 <_joe_> uploading hhvm_3.12.7+dfsg-1+wmf4 to apt [production]
08:53 <volans> restarting zotero on sca1003, almost out of RAM, puppet failing [production]
08:52 <elukey> restarting Yarn and HDFS masters on analytics100[12] (Hadoop cluster) to complete the openjdk update [production]
07:51 <marostegui> Stopping replication db1052 for maintenance - T151607 [production]
02:22 <l10nupdate@tin> ResourceLoader cache refresh completed at Fri Nov 25 02:22:40 UTC 2016 (duration 4m 20s) [production]
02:18 <l10nupdate@tin> scap sync-l10n completed (1.29.0-wmf.3) (duration: 06m 48s) [production]
2016-11-24 §
17:25 <_joe_> turned off additional workers for htmlcacheupdate on commonswiki as the queue has reduced to acceptable sizes (T151196) [production]
15:03 <ema> uploaded varnish 4.1.3-1wm4 to carbon main component, replacing version 3.0.6plus-wm9 (T150660) [production]
14:47 <ema> uploaded varnishkafka 1.0.12-1 to carbon main component, replacing version 1.0.7-1 (T150660) [production]
13:31 <akosiaris> balance the load between thumbor1001 and thumbor1002 evenly [production]
13:31 <akosiaris@puppetmaster1001> conftool action : set/weight=10; selector: thumbor1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=thumbor', 'service=thumbor']) [production]
13:20 <akosiaris@puppetmaster1001> conftool action : set/weight=5; selector: thumbor1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=thumbor', 'service=thumbor']) [production]
13:04 <akosiaris@puppetmaster1001> conftool action : set/weight=20; selector: thumbor1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=thumbor', 'service=thumbor']) [production]
12:54 <gilles> restarting thumbor on thumbor1001 [production]
12:49 <akosiaris> lower thumbor1001 load by 50% to easy debugging [production]
12:48 <gilles> restarting thumbor on thumbor1001 [production]
12:48 <akosiaris@puppetmaster1001> conftool action : set/weight=5; selector: thumbor1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=thumbor', 'service=thumbor']) [production]
12:36 <elukey> launched preferred-replica-election to re-add kafka1022 among the Topic partition leader brokers of the Analytics Kafka cluster (all metrics looks good) [production]
11:41 <hoo> Killed the Wikidata JSON dump creation on snapshot1007: Wont succeed before Monday, due to T151356 [production]
10:13 <_joe_> running commonswiki htmlCacheUpdate jobs on terbium to catch up with the backlog, monitoring caches for vhtcpd queue overflows T151196 [production]
09:38 <marostegui> Stopping replication db1052 (depooled) for maintenance - T150960 [production]
08:59 <marostegui> Deploy alter table S5 - dewiki.revision on db1092 (depooled) - T148967 [production]
08:15 <_joe_> uploaded calico-cni 1.5.1 to jessie-wikimedia [production]
07:32 <marostegui> Stopping MySQL db2070 for maintenance - https://phabricator.wikimedia.org/T149553 [production]
02:35 <l10nupdate@tin> ResourceLoader cache refresh completed at Thu Nov 24 02:35:10 UTC 2016 (duration 5m 15s) [production]
02:29 <l10nupdate@tin> scap sync-l10n completed (1.29.0-wmf.3) (duration: 10m 39s) [production]
00:28 <reedy@tin> Synchronized php-1.29.0-wmf.3/extensions/CentralAuth/maintenance/populateLocalAndGlobalIds.php: Some perf related improvements (duration: 00m 45s) [production]
00:12 <demon@tin> Synchronized docroot/foundation/: rm more junk (duration: 00m 45s) [production]
2016-11-23 §
23:11 <godog> cleanup older labs instances metrics from 'instances' hierarchy on graphite1001 [production]
22:57 <mutante> phab2001 - installing vim upgrade [production]
22:52 <godog> cleanup older labs instances metrics from 'instances' hierarchy on graphite2001 [production]
21:59 <mutante> gerrit restarting for config change 323179 [production]
21:07 <demon@tin> Finished scap: pruning old deployment branches (duration: 19m 14s) [production]
20:48 <demon@tin> Started scap: pruning old deployment branches [production]
20:42 <XenoRyet> Updated payments-wiki from f8ca94201a3f69ee8176f4984f1caa410ac90c49 to d7ed14407aa7be9a790778cae644c2b320bb7aa4 [production]
19:24 <godog> swift eqiad-prod: ms-be1027 to weight 2000 T136631 [production]
18:56 <marostegui> Shutting down db2034 for maintenance - T149553 [production]
18:04 <volans@puppetmaster1001> conftool action : set/pooled=inactive; selector: name=mw2092.codfw.wmnet [production]
17:58 <demon@tin> Synchronized php-1.29.0-wmf.3/extensions/CentralAuth/maintenance/populateLocalAndGlobalIds.php: (no message) (duration: 00m 53s) [production]
17:36 <marostegui> Stopping MySQL on db2070 for maintenance - https://phabricator.wikimedia.org/T149553 [production]
16:24 <marostegui> Setting offline disk [32:4] on db1053 - looks like it is causing repl issues [production]
16:01 <marostegui> Stopping replication db2070 for maintenance - T149553 [production]
15:50 <dcausse> elastic@eqiad: ruwiki reindex done (T148344) [production]
14:37 <dcausse> elastic@eqiad: reindexing ruwiki from terbium, logs in ~dcausse/bm25_reindex/cirrus_log (T148344) [production]
14:33 <jynus> rebooting, upgrading db1092 while it is depooled for maintenance [production]
14:31 <marostegui> Stopping replication db1095 (not pooled) - maintenance - T150960 [production]
11:48 <_joe_> uploaded calico/kube-policy-controller:0.5.0 to the docker registry [production]
10:24 <marostegui> Stopping replication on the following m3 hosts for maintenance - db1048, dbstore1002 (m3 instance), db2012 - T151384 [production]