3801-3850 of 4788 results (22ms)
2017-12-05 §
10:31 <elukey> disabled druid middlemanager on druid1003 with curl -X POST http://druid1003.eqiad.wmnet:8091/druid/worker/v1/disable [analytics]
10:03 <elukey> stop camus as precautionary measure before Hadoop masters reboot [analytics]
09:57 <elukey> suspend webrequest load bundle as extra precaution before Hadoop masters reboot [analytics]
2017-12-04 §
16:29 <elukey> restart webrequest-load-wf-upload-2017-12-4-12 (failed due to hadoop reboots) [analytics]
16:12 <elukey> restart webrequest-load-wf-upload-2017-12-4-13 (failed due to hadoop reboots) [analytics]
15:09 <joal> Rerun webrequest-load-wf-upload-2017-12-4-12 and webrequest-load-wf-upload-2017-12-4-13 [analytics]
15:08 <joal> Rerunning 15:47:35 < fdans> whatuuuup mforns [analytics]
14:17 <elukey> re-run pageview-druid-hourly-wf-2017-12-4-11 in Hue (failed due to reboots) [analytics]
12:04 <elukey> re-run webrequest-load-wf-upload-2017-12-4-8 (failed due to reboots) [analytics]
12:04 <elukey> re-run webrequest-load-check_sequence_statistics-wf-upload-2017-12-4-7 (failed due to reboots) [analytics]
2017-12-02 §
11:47 <joal> Rerun unique_devices-per_project_family-monthly-wf-2017-11 [analytics]
2017-12-01 §
15:20 <elukey> rerun webrequest-druid-hourly-wf-2017-12-1-8 after an unexpected Druid Overlord inconsistency [analytics]
15:09 <elukey> rerun pageview-druid-hourly-wf-2017-12-1-8 after an unexpected Druid Overlord inconsistency [analytics]
13:07 <elukey> re-run aqs-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots) [analytics]
12:42 <elukey> temporarily switch pivot's config to druid1002 (to reboot druid1001) [analytics]
12:37 <elukey> re-run webrequest-load-wf-upload-2017-12-1-10 and webrequest-load-wf-upload-2017-12-1-7 (failed due to Hadoop reboots) [analytics]
12:36 <elukey> re-run webrequest-load-wf-text-2017-12-1-10 and webrequest-load-wf-text-2017-12-1-9 (failed due to Hadoop reboots) [analytics]
12:35 <elukey> re-run pageview-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots) [analytics]
12:34 <elukey> re-run webrequest-druid-hourly-wf-2017-12-1-8 (failed due to Hadoop reboots) [analytics]
2017-11-30 §
18:20 <elukey> re-run webrequest-load-wf-upload-2017-11-30-16 (failed due to hadoop reboots) [analytics]
18:19 <elukey> re-run webrequest-load-wf-text-2017-11-30-14 (failed due to hadoop reboots) [analytics]
16:21 <joal> wikidata-wdqs_extract-wf-2017-11-30-15 [analytics]
15:50 <elukey> restart hue on thorium - timeouts and 500s [analytics]
14:58 <joal> Update druid overlord config to equalDistribution dynamically [analytics]
2017-11-29 §
21:46 <joal> rerun pageview-druid-hourly-wf-2017-11-29-18 and pageview-druid-hourly-wf-2017-11-29-19 [analytics]
21:19 <joal> rerun webrequest-druid-hourly-wf-2017-11-29-18 [analytics]
2017-11-28 §
14:41 <ottomata> restarting eventlogging on eventlog1001 for https://gerrit.wikimedia.org/r/#/c/393613/ [analytics]
09:08 <elukey> log database on dbstore1002 dropped for good [analytics]
2017-11-22 §
16:09 <ottomata> restarting eventlogging services on eventlog1001 [analytics]
2017-11-20 §
18:28 <elukey> deployed prometheus-druid-exporter (still not released in apt) on druid1004 for testing [analytics]
15:45 <ottomata> deploying fixes to EL EventCapsule discrepancies: https://phabricator.wikimedia.org/T179625#3755242 [analytics]
2017-11-16 §
15:25 <milimetric> deployed refinery and running interlanguage links dataset now [analytics]
2017-11-15 §
14:22 <addshore> addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/r-library [analytics]
14:22 <addshore> addshore@stat1005:/srv/analytics-wmde$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/installRlib [analytics]
2017-11-14 §
09:45 <elukey> executed chmod g+rx /home/ezachte/wikistats_data/dumps to unblock Joseph (should be safe) [analytics]
2017-11-13 §
21:20 <addshore> addshore@stat1005:/srv/analytics-wmde/wdcm/src$ sudo -u analytics-wmde Rscript ./_installProduction_analytics-wmde.R [analytics]
21:20 <addshore> test [analytics]
14:44 <joal> Resuming all druid loading jobs after fixing restart issues [analytics]
14:18 <joal> Suspending pageview-druid-hourly-coord again trying to fix druid loadin [analytics]
14:10 <joal> Unsuspend pageview-druid-hourly-coord [analytics]
13:08 <joal> Suspend webrequest druid loading waiting for elukey [analytics]
13:05 <joal> Rerun webrequest-druid-hourly-wf-2017-11-13-11 [analytics]
11:15 <elukey> suspend pageview-druid-hourly-coord to allow an easier druid daemon reload (new prometheus jvm agent) [analytics]
2017-11-08 §
15:16 <ottomata> deploying eventlogging analytics change for eventcapsule schema fixes, will be no-op until we deploy puppet changes too [analytics]
11:28 <elukey> resumed cassandra-coord-pageview-per-project-hourly after maintenance to aqs hosts [analytics]
10:04 <elukey> suspended cassandra-coord-pageview-per-project-hourly as prep step to reboot aqs nodes - T179943 [analytics]
2017-11-06 §
15:37 <milimetric> found geowiki was hitting the wrong databases, updated it to always hit analytics-store [analytics]
2017-11-03 §
10:55 <joal> Kill mediawiki-history oozie job to prevent computing october snapshot before fixing reconstruction process [analytics]
2017-11-02 §
08:54 <elukey> relaunched failed pageview-druid-hourly jobs - Druid indexation check failures in the logs (01 Nov 2017 21:00:00 and 01 Nov 2017 19:00:00) [analytics]
2017-11-01 §
20:06 <ottomata> rerunning pageview-druid-hourly-wf-2017-11-1-18 [analytics]