301-350 of 1737 results (17ms)
2018-07-05 §
10:36 <elukey> restart oozie on analytics1003 - connection timeouts from thorium after mariadb maintenance [analytics]
10:34 <elukey> restart hive metastore on an1003, errors after mariadb maintenance this morning [analytics]
07:44 <elukey> all jobs re-enabled [analytics]
06:26 <elukey> stop camus to allow mariadb restart on analytics1003 [analytics]
2018-07-02 §
14:56 <elukey> resume cassandra bundle via hue [analytics]
13:27 <elukey> suspend cassandra bundle via Hue to ease the reimage of aqs1004 [analytics]
09:12 <joal> Rerun mediawiki-geoeditors-load-wf-2018-06 after having fixed the wmf_raw.mediawiki_private_cu_changes table issueb [analytics]
07:12 <joal> Restart cassandra bundle [analytics]
2018-06-28 §
14:46 <elukey> upgrade piwik 3.2.1 to matomo (new name/package) 3.5.1 [analytics]
11:27 <joal> Change mediawiki-reduced table format to be parquet and restart mediawiki-reduced oozie job [analytics]
11:19 <joal> Restart druid uniques daily-monthly-aggregated indexation jobs [analytics]
11:19 <joal> Start backfilling job cassandra pageviews-top-countries ceiled-values [analytics]
10:20 <joal> Deploying refinery to HDFS [analytics]
10:09 <joal> Deploying refinery using scap [analytics]
09:03 <joal> deploying AQS pageviews-bycountry ceiled value glue code [analytics]
07:41 <fdans> testing load of 2 months of per country pageviews with the new ceiled value [analytics]
06:10 <elukey> move /srv/kafka to a dedicated 60G partition on deployment-jumbo hosts in deployment-prep [analytics]
2018-06-27 §
21:51 <elukey> piwik maintenance completed [analytics]
13:08 <elukey> piwik upgraded to 3.2.1 on bohrium + started the db migration procedure (will last 2/3h probably) [analytics]
12:57 <elukey> set Piwik in maintenance mode as prep step for backup + upgrade [analytics]
2018-06-20 §
19:54 <ottomata> removed Kafka MirrorMaker from kafka10(12|13|14) [analytics]
2018-06-18 §
11:57 <joal> Restart oozie webrequest refine jobs [analytics]
11:19 <joal> Launch oozie webrequest refine jobs for the failing hour 2018-06-14-11 [analytics]
10:18 <joal> Deployed refiney on hdfs [analytics]
10:18 <joal> Deployed refinery with scap [analytics]
2018-06-15 §
09:00 <joal> Deleting corrupted file hdfs://analytics-hadoop/user/joal/wmf/data/raw/webrequest/webrequest_upload/hourly/2018/06/14/11/webrequest_upload.1004.10.1214791.15490650727.1528974000000._COPYING_ to prevent webrequest refine jobs from failing. No data will be lost as the correct file exist. [analytics]
2018-06-14 §
19:29 <joal> try rerunning webrequest-load-wf-upload-2018-6-14-11 [analytics]
13:14 <elukey> re-run failed webrequest-upload/text jobs (namenodes restarted) [analytics]
2018-06-11 §
13:56 <ottomata> bouncing eventlogging processes to apply kafka event time producing [analytics]
2018-06-08 §
11:45 <joal> Launching manual sqooping of revision and archive table to recover from failure [analytics]
2018-06-01 §
08:37 <joal> Restart every druid loading oozie job (except mediawiki reduced) to pick new configuration [analytics]
08:33 <joal> Restart mediawiki-history-denormalize oozie job after deploy [analytics]
08:24 <joal> Deploy refinery on HDFS [analytics]
08:08 <joal> Deploying refinery using scap [analytics]
07:53 <joal> Releasing refinery-source v0.0.65 to archiva [analytics]
07:05 <joal> Rerun virtualpageview-druid-monthly-wf-2018-5 [analytics]
2018-05-31 §
17:01 <ottomata> dropping and deleting MobileWikiAppiOS* tables and data per request from chelsyx [analytics]
10:51 <elukey> stopped Pivot on thorium [analytics]
07:27 <joal> Restart webrequest-load-bundle with default oozie_launcher_memory value (should be 2048 set by workflows) [analytics]
05:33 <elukey> re-run faied webrequest-load upload|misc jobs via Hue [analytics]
01:02 <ottomata> bouncing main-eqiad -> jumbo-eqiad mirror maker [analytics]
2018-05-30 §
17:49 <joal> Rerun webrequest-load-wf-misc-2018-5-30-16 [analytics]
13:15 <elukey> re-run webrequest-load-wf-upload-2018-5-30-11 - died after worker node reboots [analytics]
06:14 <elukey> re-run failed webrequest-load jobs [analytics]
06:11 <elukey> temporary point Turnilo to druid1002 to allow druid1001's reimage [analytics]
05:50 <elukey> restart mirror maker on kafka10[12-23] - failures to consume after rebalance [analytics]
2018-05-29 §
17:02 <elukey> re-run webrequest-load-text 29th May 2018 12:00:00 [analytics]
15:03 <joal> rerun webrequest-load-wf-upload-2018-5-29-13 [analytics]
10:30 <elukey> roll restart of druid-middlemanagers on druid* to pick up the new runtime settings (no more references to hadoop-client-cdh) [analytics]
10:04 <elukey> re-run pageview-druid-hourly-wf-2018-5-29-7 [analytics]