651-700 of 1626 results (12ms)
2017-12-01 §
15:09 <elukey> rerun pageview-druid-hourly-wf-2017-12-1-8 after an unexpected Druid Overlord inconsistency [analytics]
13:07 <elukey> re-run aqs-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots) [analytics]
12:42 <elukey> temporarily switch pivot's config to druid1002 (to reboot druid1001) [analytics]
12:37 <elukey> re-run webrequest-load-wf-upload-2017-12-1-10 and webrequest-load-wf-upload-2017-12-1-7 (failed due to Hadoop reboots) [analytics]
12:36 <elukey> re-run webrequest-load-wf-text-2017-12-1-10 and webrequest-load-wf-text-2017-12-1-9 (failed due to Hadoop reboots) [analytics]
12:35 <elukey> re-run pageview-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots) [analytics]
12:34 <elukey> re-run webrequest-druid-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots) [analytics]
2017-11-30 §
18:20 <elukey> re-run webrequest-load-wf-upload-2017-11-30-16 (failed due to hadoop reboots) [analytics]
18:19 <elukey> re-run webrequest-load-wf-text-2017-11-30-14 (failed due to hadoop reboots) [analytics]
16:21 <joal> wikidata-wdqs_extract-wf-2017-11-30-15 [analytics]
15:50 <elukey> restart hue on thorium - timeouts and 500s [analytics]
14:58 <joal> Update druid overlord config to equalDistribution dynamically [analytics]
2017-11-29 §
21:46 <joal> rerun pageview-druid-hourly-wf-2017-11-29-18 and pageview-druid-hourly-wf-2017-11-29-19 [analytics]
21:19 <joal> rerun webrequest-druid-hourly-wf-2017-11-29-18 [analytics]
2017-11-28 §
14:41 <ottomata> restarting eventlogging on eventlog1001 for https://gerrit.wikimedia.org/r/#/c/393613/ [analytics]
09:08 <elukey> log database on dbstore1002 dropped for good [analytics]
2017-11-22 §
16:09 <ottomata> restarting eventlogging services on eventlog1001 [analytics]
2017-11-20 §
18:28 <elukey> deployed prometheus-druid-exporter (still not released in apt) on druid1004 for testing [analytics]
15:45 <ottomata> deploying fixes to EL EventCapsule discrepancies: https://phabricator.wikimedia.org/T179625#3755242 [analytics]
2017-11-16 §
15:25 <milimetric> deployed refinery and running interlanguage links dataset now [analytics]
2017-11-15 §
14:22 <addshore> addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/r-library [analytics]
14:22 <addshore> addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/installRlib [analytics]
2017-11-14 §
09:45 <elukey> executed chmod g+rx /home/ezachte/wikistats_data/dumps to unblock Joseph (should be safe) [analytics]
2017-11-13 §
21:20 <addshore> addshore@stat1005:/srv/analytics-wmde/wdcm/src$ sudo -u analytics-wmde Rscript ./_installProduction_analytics-wmde.R [analytics]
21:20 <addshore> test [analytics]
14:44 <joal> Resuming all druid loading jobs after fixing restart issues [analytics]
14:18 <joal> Suspending pageview-druid-hourly-coord again trying to fix druid loadin [analytics]
14:10 <joal> Unsuspend pageview-druid-hourly-coord [analytics]
13:08 <joal> Suspend webrequest druid loading waiting for elukey [analytics]
13:05 <joal> Rerun webrequest-druid-hourly-wf-2017-11-13-11 [analytics]
11:15 <elukey> suspend pageview-druid-hourly-coord to allow an easier druid daemon reload (new prometheus jvm agent) [analytics]
2017-11-08 §
15:16 <ottomata> deploying eventlogging analytics change for eventcapsule schema fixes, will be no-op until we deploy puppet changes too [analytics]
11:28 <elukey> resumed cassandra-coord-pageview-per-project-hourly after maintenance to aqs hosts [analytics]
10:04 <elukey> suspended cassandra-coord-pageview-per-project-hourly as prep step to reboot aqs nodes - T179943 [analytics]
2017-11-06 §
15:37 <milimetric> found geowiki was hitting the wrong databases, updated it to always hit analytics-store [analytics]
2017-11-03 §
10:55 <joal> Kill mediawiki-history oozie job to prevent computing october snapshot before fixing reconstruction process [analytics]
2017-11-02 §
08:54 <elukey> relaunched failed pageview-druid-hourly jobs - Druid indexation check failures in the logs (01 Nov 2017 21:00:00 and 01 Nov 2017 19:00:00) [analytics]
2017-11-01 §
20:06 <ottomata> rerunning pageview-druid-hourly-wf-2017-11-1-18 [analytics]
19:05 <ottomata> deploying refinery with refinery/source 0.0.54 for JsonRefine job T162610 [analytics]
18:40 <ottomata> rerunning unique_devices-per_project_family-druid-monthly-wf-2017-10 [analytics]
2017-10-30 §
10:12 <elukey> added Francisco to the analytics-alerts@ mailing list [analytics]
2017-10-27 §
07:40 <elukey> re-run wikidata-articleplaceholder_metrics-wf-2017-10-26 [analytics]
07:36 <elukey> stop & mask hadoop-httpfs.service on analytics1001 after https://gerrit.wikimedia.org/r/#/c/386684/ [analytics]
2017-10-26 §
16:58 <ottomata> now mirroring main Kafka cluster topics to jumbo Kafka cluster,  with MirrorMaker instances running on analytics-eqiad broker nodes. https://phabricator.wikimedia.org/T177216 [analytics]
2017-10-25 §
13:32 <elukey> restart yarn nodemanager and hdfs datanode on analytics1030 to apply new JVM settings [analytics]
2017-10-24 §
20:29 <nuria_> started unique_devices-per_project_family-druid-daily-coord 0102816-170829140538136-oozie-oozi-C [analytics]
20:24 <nuria_> restarted job unique_devices-per_project_family-druid-monthly-coord 0102799-170829140538136-oozie-oozi-C [analytics]
20:23 <nuria_> restarted job uniques-monthly-per-domain-druid 0102785-170829140538136-oozie-oozi-C [analytics]
19:44 <nuria_> killing druid coordinators uniques-monthly and per-project-family: 0066771-170829140538136-oozie-oozi-C,0066767-170829140538136-oozie-oozi-C,0010139-170621131133576-oozie-oozi-C [analytics]
2017-10-23 §
18:50 <joal> Deploying AQS after fix [analytics]