201-250 of 1135 results (12ms)
2017-10-27 §
07:40 <elukey> re-run wikidata-articleplaceholder_metrics-wf-2017-10-26 [analytics]
07:36 <elukey> stop & mask hadoop-httpfs.service on analytics1001 after https://gerrit.wikimedia.org/r/#/c/386684/ [analytics]
2017-10-26 §
16:58 <ottomata> now mirroring main Kafka cluster topics to jumbo Kafka cluster,  with MirrorMaker instances running on analytics-eqiad broker nodes. https://phabricator.wikimedia.org/T177216 [analytics]
2017-10-25 §
13:32 <elukey> restart yarn nodemanager and hdfs datanode on analytics1030 to apply new JVM settings [analytics]
2017-10-24 §
20:29 <nuria_> started unique_devices-per_project_family-druid-daily-coord 0102816-170829140538136-oozie-oozi-C [analytics]
20:24 <nuria_> restarted job unique_devices-per_project_family-druid-monthly-coord 0102799-170829140538136-oozie-oozi-C [analytics]
20:23 <nuria_> restarted job uniques-monthly-per-domain-druid 0102785-170829140538136-oozie-oozi-C [analytics]
19:44 <nuria_> killing druid coordinators uniques-monthly and per-project-family: 0066771-170829140538136-oozie-oozi-C,0066767-170829140538136-oozie-oozi-C,0010139-170621131133576-oozie-oozi-C [analytics]
2017-10-23 §
18:50 <joal> Deploying AQS after fix [analytics]
13:30 <joal> deploy AQS from tin [analytics]
2017-10-19 §
20:04 <mforns> Deployed refinery using scap, then deployed onto hdfs [analytics]
11:44 <joal> deploying AQS in beta [analytics]
11:44 <joal> deploying AQS in b [analytics]
2017-10-16 §
17:32 <mforns> restarted EventLogging for changes in blacklist to take effect [analytics]
16:27 <joal> Re-Deploy AQS after monitoring fix [analytics]
16:14 <joal> Deploy AQS with new code [analytics]
2017-10-13 §
16:49 <ottomata> deployed refinery to use rand() for webrequest sampling [analytics]
2017-10-12 §
15:40 <elukey> run kafka preferred-replica-election to allow kafka1013 to re-join the topic leaders [analytics]
14:48 <elukey> disable httpfs access on analytics1001 [analytics]
2017-10-09 §
18:28 <ottomata> resuming oozie druid indexing jobs, 1004-1006 are offline [analytics]
16:34 <ottomata> stopping druid services on druid1006 [analytics]
16:05 <ottomata> pausing all druid oozie coordinators in preperation for druid public separation [analytics]
12:47 <joal> Kill restart oozie job lading mediawiki-history into druid [analytics]
12:14 <joal> Kill-Restart oozie jobs loading banner data into druid [analytics]
12:04 <joal> Deploy refinery onto HDFS [analytics]
11:47 <joal> Deploying refinery from scap [analytics]
08:53 <joal> Rerunning wikidata-articleplaceholder_metrics-wf-2017-10-7 after failure [analytics]
2017-10-06 §
11:10 <elukey> restart all druid daemons to pick up new logging changes [analytics]
11:08 <joal> Rerun pageview-druid-hourly-wf-2017-10-6-9 [analytics]
09:31 <elukey> restart all the druid daemons on druid1005 to apply the new logging rules [analytics]
08:49 <elukey> restarted all the druid broker daemons to pick up the new logging changes [analytics]
2017-10-05 §
13:48 <milimetric> restarted banner_activity-druid-monthly for September again [analytics]
2017-10-04 §
18:39 <ottomata> druid-analytics.svc.eqiad.wmnet:8082 should only be accessible to analytics networks [analytics]
17:32 <ottomata> deploying new LVS service for druid-analytics-broker [analytics]
2017-10-03 §
14:50 <milimetric> restarted failed workflow 0057215-170829140538136-oozie-oozi-W (druid monthly banner activity) [analytics]
2017-09-28 §
10:02 <elukey> renabled camus after maintenance [analytics]
09:51 <elukey> restart mapreduce history server on an1001 to apply new heap settings (Xmx/s to 4g) [analytics]
2017-09-27 §
15:18 <joal> Kill/restart stuck jobs [analytics]
14:45 <elukey> rolling restart of all the Yarn nodemanager daemons on analytics1028-1068 (ease heap consumption pressure, seamless restart) [analytics]
13:40 <elukey> manual failover of HDFS namenode from an1002 to an1001 [analytics]
13:17 <elukey> manual failover of HDFS namenode from an1001 to an1002 to test 6G max heap size [analytics]
13:14 <elukey> restart mapreduce history server on analytics1001 after crash (java.lang.OutOfMemoryError: GC overhead limit exceeded) [analytics]
2017-09-26 §
14:49 <joal> restart mobile_apps session_metrics bundle [analytics]
14:49 <joal> restart [analytics]
11:01 <joal> Restart mediawiki-history-denormalize and mediawiki-history-druid jobs after deploy [analytics]
10:58 <joal> Restart webrequest load job after deploy [analytics]
10:35 <joal> Deploying refinery onto HDFS [analytics]
10:25 <joal> Deploy Refinery with scap [analytics]
09:33 <joal> Releasing refinery-source v0.0.53 with Jenskins [analytics]
2017-09-25 §
08:41 <joal> Rerun mobile_apps-session_metrics-wf- 2017-9-17 after failure [analytics]