analytics SAL

2001-2050 of 2975 results (16ms)

2017-12-01 §
13:07	<elukey>	re-run aqs-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots)	[analytics]
12:42	<elukey>	temporarily switch pivot's config to druid1002 (to reboot druid1001)	[analytics]
12:37	<elukey>	re-run webrequest-load-wf-upload-2017-12-1-10 and webrequest-load-wf-upload-2017-12-1-7 (failed due to Hadoop reboots)	[analytics]
12:36	<elukey>	re-run webrequest-load-wf-text-2017-12-1-10 and webrequest-load-wf-text-2017-12-1-9 (failed due to Hadoop reboots)	[analytics]
12:35	<elukey>	re-run pageview-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots)	[analytics]
12:34	<elukey>	re-run webrequest-druid-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots)	[analytics]
2017-11-30 §
18:20	<elukey>	re-run webrequest-load-wf-upload-2017-11-30-16 (failed due to hadoop reboots)	[analytics]
18:19	<elukey>	re-run webrequest-load-wf-text-2017-11-30-14 (failed due to hadoop reboots)	[analytics]
16:21	<joal>	wikidata-wdqs_extract-wf-2017-11-30-15	[analytics]
15:50	<elukey>	restart hue on thorium - timeouts and 500s	[analytics]
14:58	<joal>	Update druid overlord config to equalDistribution dynamically	[analytics]
2017-11-29 §
21:46	<joal>	rerun pageview-druid-hourly-wf-2017-11-29-18 and pageview-druid-hourly-wf-2017-11-29-19	[analytics]
21:19	<joal>	rerun webrequest-druid-hourly-wf-2017-11-29-18	[analytics]
2017-11-28 §
14:41	<ottomata>	restarting eventlogging on eventlog1001 for https://gerrit.wikimedia.org/r/#/c/393613/	[analytics]
09:08	<elukey>	log database on dbstore1002 dropped for good	[analytics]
2017-11-22 §
16:09	<ottomata>	restarting eventlogging services on eventlog1001	[analytics]
2017-11-20 §
18:28	<elukey>	deployed prometheus-druid-exporter (still not released in apt) on druid1004 for testing	[analytics]
15:45	<ottomata>	deploying fixes to EL EventCapsule discrepancies: https://phabricator.wikimedia.org/T179625#3755242	[analytics]
2017-11-16 §
15:25	<milimetric>	deployed refinery and running interlanguage links dataset now	[analytics]
2017-11-15 §
14:22	<addshore>	addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/r-library	[analytics]
14:22	<addshore>	addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/installRlib	[analytics]
2017-11-14 §
09:45	<elukey>	executed chmod g+rx /home/ezachte/wikistats_data/dumps to unblock Joseph (should be safe)	[analytics]
2017-11-13 §
21:20	<addshore>	addshore@stat1005:/srv/analytics-wmde/wdcm/src$ sudo -u analytics-wmde Rscript ./_installProduction_analytics-wmde.R	[analytics]
21:20	<addshore>	test	[analytics]
14:44	<joal>	Resuming all druid loading jobs after fixing restart issues	[analytics]
14:18	<joal>	Suspending pageview-druid-hourly-coord again trying to fix druid loadin	[analytics]
14:10	<joal>	Unsuspend pageview-druid-hourly-coord	[analytics]
13:08	<joal>	Suspend webrequest druid loading waiting for elukey	[analytics]
13:05	<joal>	Rerun webrequest-druid-hourly-wf-2017-11-13-11	[analytics]
11:15	<elukey>	suspend pageview-druid-hourly-coord to allow an easier druid daemon reload (new prometheus jvm agent)	[analytics]
2017-11-08 §
15:16	<ottomata>	deploying eventlogging analytics change for eventcapsule schema fixes, will be no-op until we deploy puppet changes too	[analytics]
11:28	<elukey>	resumed cassandra-coord-pageview-per-project-hourly after maintenance to aqs hosts	[analytics]
10:04	<elukey>	suspended cassandra-coord-pageview-per-project-hourly as prep step to reboot aqs nodes - T179943	[analytics]
2017-11-06 §
15:37	<milimetric>	found geowiki was hitting the wrong databases, updated it to always hit analytics-store	[analytics]
2017-11-03 §
10:55	<joal>	Kill mediawiki-history oozie job to prevent computing october snapshot before fixing reconstruction process	[analytics]
2017-11-02 §
08:54	<elukey>	relaunched failed pageview-druid-hourly jobs - Druid indexation check failures in the logs (01 Nov 2017 21:00:00 and 01 Nov 2017 19:00:00)	[analytics]
2017-11-01 §
20:06	<ottomata>	rerunning pageview-druid-hourly-wf-2017-11-1-18	[analytics]
19:05	<ottomata>	deploying refinery with refinery/source 0.0.54 for JsonRefine job T162610	[analytics]
18:40	<ottomata>	rerunning unique_devices-per_project_family-druid-monthly-wf-2017-10	[analytics]
2017-10-30 §
10:12	<elukey>	added Francisco to the analytics-alerts@ mailing list	[analytics]
2017-10-27 §
07:40	<elukey>	re-run wikidata-articleplaceholder_metrics-wf-2017-10-26	[analytics]
07:36	<elukey>	stop & mask hadoop-httpfs.service on analytics1001 after https://gerrit.wikimedia.org/r/#/c/386684/	[analytics]
2017-10-26 §
16:58	<ottomata>	now mirroring main Kafka cluster topics to jumbo Kafka cluster, with MirrorMaker instances running on analytics-eqiad broker nodes. https://phabricator.wikimedia.org/T177216	[analytics]
2017-10-25 §
13:32	<elukey>	restart yarn nodemanager and hdfs datanode on analytics1030 to apply new JVM settings	[analytics]
2017-10-24 §
20:29	<nuria_>	started unique_devices-per_project_family-druid-daily-coord 0102816-170829140538136-oozie-oozi-C	[analytics]
20:24	<nuria_>	restarted job unique_devices-per_project_family-druid-monthly-coord 0102799-170829140538136-oozie-oozi-C	[analytics]
20:23	<nuria_>	restarted job uniques-monthly-per-domain-druid 0102785-170829140538136-oozie-oozi-C	[analytics]
19:44	<nuria_>	killing druid coordinators uniques-monthly and per-project-family: 0066771-170829140538136-oozie-oozi-C,0066767-170829140538136-oozie-oozi-C,0010139-170621131133576-oozie-oozi-C	[analytics]
2017-10-23 §
18:50	<joal>	Deploying AQS after fix	[analytics]
13:30	<joal>	deploy AQS from tin	[analytics]