2401-2450 of 4365 results (21ms)
2019-06-16 §
08:09 <elukey> manually restart refinery-druid-drop-public-snapshots.service with new unit settings (-t druid1004.eqiad.wmnet vs -t druid1004.eqiad.wmnet:8081) [analytics]
2019-06-14 §
13:22 <joal> Restarting AQS using `scap deploy --service-restart` [analytics]
2019-06-13 §
18:18 <fdans> deployment complete [analytics]
17:42 <fdans> deploying refinery [analytics]
17:40 <fdans> updating refinery jar symlinks [analytics]
17:20 <fdans> Releasing new version of refinery source (v0.0.92) [analytics]
2019-06-11 §
07:38 <fdans> reset fail alert for efinery-import-page-history-dumps [analytics]
2019-06-10 §
18:12 <joal> Restart pageview, pageview-druid-hourly/daily/monthly ooie jobs for them to run in production queue [analytics]
18:05 <joal> Kill/Restart webrequest bundle and move it to production queue [analytics]
17:54 <ottomata> rolling restart of AQS service using scap deploy for new mediawiki_history_snaphost [analytics]
2019-06-08 §
08:17 <joal> Manually re-run patched refine_eventlogging_analytics on an-coord1001 with flags "--ignore_failure_flag=true --since 48" [analytics]
08:12 <elukey> remove org.wikimedia.analytics.refinery.job.refine.filter_out_non_wiki_hostname from refine's transform functions temporarily to unblock T225342 [analytics]
07:37 <elukey> manual run of monitor_refine_eventlogging_analytics [analytics]
07:28 <joal> Manually run refine_eventlogging_analytics on an-coord1001 with flag --ignore_failure_flag=true [analytics]
2019-06-07 §
17:42 <joal> Drop currently unused /wmf/data/wmf/webrequest_subset folder [analytics]
17:29 <elukey> chown -R analytics:analytics-privatedata-users + chmod o-rw /wmf/data/wmf/netflow on HDFS [analytics]
17:18 <mforns> restarted turnilo to clear deleted datasource [analytics]
17:17 <elukey> restart turnilo to remove the old netflow datasource's settings [analytics]
17:01 <mforns> restarted turnilo to clear deleted datasource [analytics]
16:18 <joal> rerun webrequest-load-wf-text-2019-6-7-14 after failure [analytics]
09:59 <joal> Kill wikitext-history job to prevent more resource-consuption becasue of failures [analytics]
2019-06-06 §
09:52 <elukey> chown report updater output dirs on stat1007 to analytics:wikidev (was hdfs:wikidev) to unblock creation of new data [analytics]
09:45 <elukey> re-run refine_sanitize_eventlogging_analytics_immediate with since = 900 in the .properties file [analytics]
06:38 <elukey> re-run refine_sanitize_eventlogging_analytics_immediate with since = 48 in the .properties file (manually added) [analytics]
05:36 <elukey> chown analytics:analytics /wmf/data/event_sanitized/{CentralNoticeTiming,LayoutJank,EventTiming,ElementTiming} (new directories created with yarn:analytics) [analytics]
2019-06-05 §
20:59 <mforns> finished deployment of analytics/refinery up to 0660e70153dec892ae20bee7119a72cc17e8ec87 [analytics]
20:20 <mforns> starting deployment of analytics/refinery up to 0660e70153dec892ae20bee7119a72cc17e8ec87 [analytics]
18:20 <mforns> finished deployment of analytics/refinery/source v0.0.91 [analytics]
18:00 <mforns> starting deployment of analytics/refinery/source v0.0.91 [analytics]
10:07 <elukey> attempt to re-run webrequest-load-wf-text-2019-6-4-20 via Hue (temporary errors in the logs) [analytics]
2019-06-04 §
08:03 <elukey> restart hive-server2 on an-coord1001 to pick up new GC/Heap settings [analytics]
06:57 <elukey> restart hive metastore on an-coord1001 to apply new GC/heap settings [analytics]
2019-06-03 §
06:51 <elukey> add the server field to the webrequest event format in varnishkafka + roll restart of all the varnishkafkas (via puppet) - T224236 [analytics]
2019-06-02 §
07:04 <elukey> manually restart refinery-import-page-history-dumps.service with some debug info to check what file breaks [analytics]
04:50 <joal> Restart mediawiki-history-wikitext (dumps conversion) oozie job [analytics]
04:12 <joal> Restart load-cassandra oozie bundle to use analytics user [analytics]
2019-06-01 §
08:03 <elukey> manually restart refinery-sqoop-whole-mediawiki.service after failure [analytics]
2019-05-27 §
19:42 <elukey> chown analytics:analytics /wmf/data/event/mediawiki_job_userOptionsUpdate on HDFS [analytics]
2019-05-22 §
21:29 <joal> Manually refine webrequest_upload_2019_05_22_12 removing 19 rows having user-agents causing UAParser issue [analytics]
20:44 <joal> Manually refine webrequest_text_2019_05_22_12 removing 19 rows having user-agents causing UAParser issue [analytics]
17:27 <joal> Manually Rerun webrequest-load-wf-upload-2019-5-22-12 with higher error-threshold as dataloss-error is confirmed flase positive [analytics]
2019-05-21 §
06:28 <elukey> chown analytics:analytics /user/hdfs/salts/eventlogging_sanitization on HDFS [analytics]
2019-05-20 §
17:17 <elukey> chown -R analytics:analytics /tmp/DataFrameToDruid on HDFS [analytics]
16:39 <joal> Manually run webrequest-load-wf-upload-2019-5-20-11 with higher error threshold as error were false positive [analytics]
15:28 <joal> Rerunning timeout webrequest-load-coord-text and webrequest-load-coord-upload (2019-05-20T09:00) [analytics]
14:41 <elukey> chown analytics:analytics /wmf/data/event_sanitized on HDFS [analytics]
12:02 <elukey> chown analytics:analytics /wmf/data/event on HDFS [analytics]
12:00 <elukey> chown analytics:analytics /wmf/data/wmf/event on HDFS [analytics]
10:21 <elukey> chown -R analytics:analytics /wmf/data/raw/ dirs (except the webrequest one that has different perms) [analytics]
10:07 <elukey> chown analytics:analytics /wmf/camus dirs (except the webrequest dir) [analytics]