analytics SAL

3801-3850 of 4788 results (33ms)

2017-12-05 §
10:31	<elukey>	disabled druid middlemanager on druid1003 with curl -X POST http://druid1003.eqiad.wmnet:8091/druid/worker/v1/disable	[analytics]
10:03	<elukey>	stop camus as precautionary measure before Hadoop masters reboot	[analytics]
09:57	<elukey>	suspend webrequest load bundle as extra precaution before Hadoop masters reboot	[analytics]
2017-12-04 §
16:29	<elukey>	restart webrequest-load-wf-upload-2017-12-4-12 (failed due to hadoop reboots)	[analytics]
16:12	<elukey>	restart webrequest-load-wf-upload-2017-12-4-13 (failed due to hadoop reboots)	[analytics]
15:09	<joal>	Rerun webrequest-load-wf-upload-2017-12-4-12 and webrequest-load-wf-upload-2017-12-4-13	[analytics]
15:08	<joal>	Rerunning 15:47:35 < fdans> whatuuuup mforns	[analytics]
14:17	<elukey>	re-run pageview-druid-hourly-wf-2017-12-4-11 in Hue (failed due to reboots)	[analytics]
12:04	<elukey>	re-run webrequest-load-wf-upload-2017-12-4-8 (failed due to reboots)	[analytics]
12:04	<elukey>	re-run webrequest-load-check_sequence_statistics-wf-upload-2017-12-4-7 (failed due to reboots)	[analytics]
2017-12-02 §
11:47	<joal>	Rerun unique_devices-per_project_family-monthly-wf-2017-11	[analytics]
2017-12-01 §
15:20	<elukey>	rerun webrequest-druid-hourly-wf-2017-12-1-8 after an unexpected Druid Overlord inconsistency	[analytics]
15:09	<elukey>	rerun pageview-druid-hourly-wf-2017-12-1-8 after an unexpected Druid Overlord inconsistency	[analytics]
13:07	<elukey>	re-run aqs-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots)	[analytics]
12:42	<elukey>	temporarily switch pivot's config to druid1002 (to reboot druid1001)	[analytics]
12:37	<elukey>	re-run webrequest-load-wf-upload-2017-12-1-10 and webrequest-load-wf-upload-2017-12-1-7 (failed due to Hadoop reboots)	[analytics]
12:36	<elukey>	re-run webrequest-load-wf-text-2017-12-1-10 and webrequest-load-wf-text-2017-12-1-9 (failed due to Hadoop reboots)	[analytics]
12:35	<elukey>	re-run pageview-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots)	[analytics]
12:34	<elukey>	re-run webrequest-druid-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots)	[analytics]
2017-11-30 §
18:20	<elukey>	re-run webrequest-load-wf-upload-2017-11-30-16 (failed due to hadoop reboots)	[analytics]
18:19	<elukey>	re-run webrequest-load-wf-text-2017-11-30-14 (failed due to hadoop reboots)	[analytics]
16:21	<joal>	wikidata-wdqs_extract-wf-2017-11-30-15	[analytics]
15:50	<elukey>	restart hue on thorium - timeouts and 500s	[analytics]
14:58	<joal>	Update druid overlord config to equalDistribution dynamically	[analytics]
2017-11-29 §
21:46	<joal>	rerun pageview-druid-hourly-wf-2017-11-29-18 and pageview-druid-hourly-wf-2017-11-29-19	[analytics]
21:19	<joal>	rerun webrequest-druid-hourly-wf-2017-11-29-18	[analytics]
2017-11-28 §
14:41	<ottomata>	restarting eventlogging on eventlog1001 for https://gerrit.wikimedia.org/r/#/c/393613/	[analytics]
09:08	<elukey>	log database on dbstore1002 dropped for good	[analytics]
2017-11-22 §
16:09	<ottomata>	restarting eventlogging services on eventlog1001	[analytics]
2017-11-20 §
18:28	<elukey>	deployed prometheus-druid-exporter (still not released in apt) on druid1004 for testing	[analytics]
15:45	<ottomata>	deploying fixes to EL EventCapsule discrepancies: https://phabricator.wikimedia.org/T179625#3755242	[analytics]
2017-11-16 §
15:25	<milimetric>	deployed refinery and running interlanguage links dataset now	[analytics]
2017-11-15 §
14:22	<addshore>	addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/r-library	[analytics]
14:22	<addshore>	addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/installRlib	[analytics]
2017-11-14 §
09:45	<elukey>	executed chmod g+rx /home/ezachte/wikistats_data/dumps to unblock Joseph (should be safe)	[analytics]
2017-11-13 §
21:20	<addshore>	addshore@stat1005:/srv/analytics-wmde/wdcm/src$ sudo -u analytics-wmde Rscript ./_installProduction_analytics-wmde.R	[analytics]
21:20	<addshore>	test	[analytics]
14:44	<joal>	Resuming all druid loading jobs after fixing restart issues	[analytics]
14:18	<joal>	Suspending pageview-druid-hourly-coord again trying to fix druid loadin	[analytics]
14:10	<joal>	Unsuspend pageview-druid-hourly-coord	[analytics]
13:08	<joal>	Suspend webrequest druid loading waiting for elukey	[analytics]
13:05	<joal>	Rerun webrequest-druid-hourly-wf-2017-11-13-11	[analytics]
11:15	<elukey>	suspend pageview-druid-hourly-coord to allow an easier druid daemon reload (new prometheus jvm agent)	[analytics]
2017-11-08 §
15:16	<ottomata>	deploying eventlogging analytics change for eventcapsule schema fixes, will be no-op until we deploy puppet changes too	[analytics]
11:28	<elukey>	resumed cassandra-coord-pageview-per-project-hourly after maintenance to aqs hosts	[analytics]
10:04	<elukey>	suspended cassandra-coord-pageview-per-project-hourly as prep step to reboot aqs nodes - T179943	[analytics]
2017-11-06 §
15:37	<milimetric>	found geowiki was hitting the wrong databases, updated it to always hit analytics-store	[analytics]
2017-11-03 §
10:55	<joal>	Kill mediawiki-history oozie job to prevent computing october snapshot before fixing reconstruction process	[analytics]
2017-11-02 §
08:54	<elukey>	relaunched failed pageview-druid-hourly jobs - Druid indexation check failures in the logs (01 Nov 2017 21:00:00 and 01 Nov 2017 19:00:00)	[analytics]
2017-11-01 §
20:06	<ottomata>	rerunning pageview-druid-hourly-wf-2017-11-1-18	[analytics]