analytics SAL

201-250 of 1135 results (13ms)

2017-10-27 §
07:40	<elukey>	re-run wikidata-articleplaceholder_metrics-wf-2017-10-26	[analytics]
07:36	<elukey>	stop & mask hadoop-httpfs.service on analytics1001 after https://gerrit.wikimedia.org/r/#/c/386684/	[analytics]
2017-10-26 §
16:58	<ottomata>	now mirroring main Kafka cluster topics to jumbo Kafka cluster, with MirrorMaker instances running on analytics-eqiad broker nodes. https://phabricator.wikimedia.org/T177216	[analytics]
2017-10-25 §
13:32	<elukey>	restart yarn nodemanager and hdfs datanode on analytics1030 to apply new JVM settings	[analytics]
2017-10-24 §
20:29	<nuria_>	started unique_devices-per_project_family-druid-daily-coord 0102816-170829140538136-oozie-oozi-C	[analytics]
20:24	<nuria_>	restarted job unique_devices-per_project_family-druid-monthly-coord 0102799-170829140538136-oozie-oozi-C	[analytics]
20:23	<nuria_>	restarted job uniques-monthly-per-domain-druid 0102785-170829140538136-oozie-oozi-C	[analytics]
19:44	<nuria_>	killing druid coordinators uniques-monthly and per-project-family: 0066771-170829140538136-oozie-oozi-C,0066767-170829140538136-oozie-oozi-C,0010139-170621131133576-oozie-oozi-C	[analytics]
2017-10-23 §
18:50	<joal>	Deploying AQS after fix	[analytics]
13:30	<joal>	deploy AQS from tin	[analytics]
2017-10-19 §
20:04	<mforns>	Deployed refinery using scap, then deployed onto hdfs	[analytics]
11:44	<joal>	deploying AQS in beta	[analytics]
11:44	<joal>	deploying AQS in b	[analytics]
2017-10-16 §
17:32	<mforns>	restarted EventLogging for changes in blacklist to take effect	[analytics]
16:27	<joal>	Re-Deploy AQS after monitoring fix	[analytics]
16:14	<joal>	Deploy AQS with new code	[analytics]
2017-10-13 §
16:49	<ottomata>	deployed refinery to use rand() for webrequest sampling	[analytics]
2017-10-12 §
15:40	<elukey>	run kafka preferred-replica-election to allow kafka1013 to re-join the topic leaders	[analytics]
14:48	<elukey>	disable httpfs access on analytics1001	[analytics]
2017-10-09 §
18:28	<ottomata>	resuming oozie druid indexing jobs, 1004-1006 are offline	[analytics]
16:34	<ottomata>	stopping druid services on druid1006	[analytics]
16:05	<ottomata>	pausing all druid oozie coordinators in preperation for druid public separation	[analytics]
12:47	<joal>	Kill restart oozie job lading mediawiki-history into druid	[analytics]
12:14	<joal>	Kill-Restart oozie jobs loading banner data into druid	[analytics]
12:04	<joal>	Deploy refinery onto HDFS	[analytics]
11:47	<joal>	Deploying refinery from scap	[analytics]
08:53	<joal>	Rerunning wikidata-articleplaceholder_metrics-wf-2017-10-7 after failure	[analytics]
2017-10-06 §
11:10	<elukey>	restart all druid daemons to pick up new logging changes	[analytics]
11:08	<joal>	Rerun pageview-druid-hourly-wf-2017-10-6-9	[analytics]
09:31	<elukey>	restart all the druid daemons on druid1005 to apply the new logging rules	[analytics]
08:49	<elukey>	restarted all the druid broker daemons to pick up the new logging changes	[analytics]
2017-10-05 §
13:48	<milimetric>	restarted banner_activity-druid-monthly for September again	[analytics]
2017-10-04 §
18:39	<ottomata>	druid-analytics.svc.eqiad.wmnet:8082 should only be accessible to analytics networks	[analytics]
17:32	<ottomata>	deploying new LVS service for druid-analytics-broker	[analytics]
2017-10-03 §
14:50	<milimetric>	restarted failed workflow 0057215-170829140538136-oozie-oozi-W (druid monthly banner activity)	[analytics]
2017-09-28 §
10:02	<elukey>	renabled camus after maintenance	[analytics]
09:51	<elukey>	restart mapreduce history server on an1001 to apply new heap settings (Xmx/s to 4g)	[analytics]
2017-09-27 §
15:18	<joal>	Kill/restart stuck jobs	[analytics]
14:45	<elukey>	rolling restart of all the Yarn nodemanager daemons on analytics1028-1068 (ease heap consumption pressure, seamless restart)	[analytics]
13:40	<elukey>	manual failover of HDFS namenode from an1002 to an1001	[analytics]
13:17	<elukey>	manual failover of HDFS namenode from an1001 to an1002 to test 6G max heap size	[analytics]
13:14	<elukey>	restart mapreduce history server on analytics1001 after crash (java.lang.OutOfMemoryError: GC overhead limit exceeded)	[analytics]
2017-09-26 §
14:49	<joal>	restart mobile_apps session_metrics bundle	[analytics]
14:49	<joal>	restart	[analytics]
11:01	<joal>	Restart mediawiki-history-denormalize and mediawiki-history-druid jobs after deploy	[analytics]
10:58	<joal>	Restart webrequest load job after deploy	[analytics]
10:35	<joal>	Deploying refinery onto HDFS	[analytics]
10:25	<joal>	Deploy Refinery with scap	[analytics]
09:33	<joal>	Releasing refinery-source v0.0.53 with Jenskins	[analytics]
2017-09-25 §
08:41	<joal>	Rerun mobile_apps-session_metrics-wf- 2017-9-17 after failure	[analytics]