451-500 of 916 results (14ms)
2016-10-20 §
17:06 <elukey> created 0000294-161020124223818-oozie-oozi-C to re-run webrequest-load-check_sequence_statistics-wf-upload-2016-10-20-13 (oozie errors) [analytics]
16:17 <ottomata> restarting eventlogging after rebooting kafka brokers [analytics]
15:43 <mforns> restarted EventLogging after throughput drop [analytics]
14:20 <ottomata> starting rolling restart of analytics-eqiad kafka brokers to apply kernel update [analytics]
13:13 <elukey> re-enabling oozie and camus after cluster reboot [analytics]
10:04 <elukey> stopped camus on an1027 and all the oozie bundles as prep step for the reboots of analytics* [analytics]
07:49 <elukey> created 0038058-160922102909979-oozie-oozi-C to re-run webrequest-load-check_sequence_statistics-wf-upload-2016-10-20-5 (oozie errors) [analytics]
07:48 <elukey> created 0038054-160922102909979-oozie-oozi-C to re-run webrequest-load-check_sequence_statistics-wf-upload-2016-10-20-3 (oozie errors) [analytics]
02:14 <andrewbogott> restarted druid101 because it was in some kind of kernel lockup [analytics]
2016-10-18 §
11:55 <elukey> removed /etc/mysql/conf.d/research-client.cnf from stat1002 (root:root perms, not supposed to be there but only on stat1003) [analytics]
09:24 <elukey> merged puppet change to force varnishkafka to use -q 'ReqMethod ne "PURGE" and not Timestamp:Pipe and not ReqHeader:Upgrade ~ "[wW]ebsocket" and not HttpGarbage' and -L 10000 [analytics]
06:16 <elukey> created the oozie coordinator 0035451-160922102909979-oozie-oozi-C for webrequest-load-check_sequence_statistics-wf-upload-2016-10-18-5 [analytics]
05:51 <elukey> created the oozie coordinator 0035415-160922102909979-oozie-oozi-C for webrequest-load-check_sequence_statistics-wf-upload-2016-10-18-[234] [analytics]
2016-10-17 §
16:57 <elukey> started the oozie coordinator 0034720-160922102909979-oozie-oozi-C to re-execute webrequest-load-wf-upload-2016-10-17-14 [analytics]
16:42 <ottomata> restarting hadoop nodemanagers 1 at a time [analytics]
15:32 <ottomata> rebootting analytics1030 [analytics]
08:24 <elukey> upgraded nodejs on aqs100[56] (already done on aqs1004) [analytics]
07:35 <elukey> created oozie coordinator 0034240-160922102909979-oozie-oozi-C to restart webrequest-load-check_sequence_statistics-wf-upload-2016-10-17-6 [analytics]
06:18 <elukey> created oozie coordinator 0034161-160922102909979-oozie-oozi-C to restart webrequest-load-check_sequence_statistics-wf-upload-2016-10-17-4 [analytics]
06:17 <elukey> created oozie coordinator 0034153-160922102909979-oozie-oozi-C to restart webrequest-load-check_sequence_statistics-wf-upload-2016-10-17-3 [analytics]
06:16 <elukey> created oozie coordinator 0034149-160922102909979-oozie-oozi-C to restart webrequest-load-check_sequence_statistics-wf-upload-2016-10-17-2 [analytics]
06:15 <elukey> created oozie coordinator 0034143-160922102909979-oozie-oozi-C to restart webrequest-load-check_sequence_statistics-wf-upload-2016-10-17-1 [analytics]
2016-10-13 §
06:57 <elukey> launched 0029282-160922102909979-oozie-oozi-C to re-run webrequest-load-check_sequence_statistics-wf-upload-2016-10-13-5 with higher error threshold [analytics]
06:56 <elukey> launched 0029278-160922102909979-oozie-oozi-C to re-run webrequest-load-check_sequence_statistics-wf-upload-2016-10-13-2 with higher error threshold [analytics]
2016-10-11 §
18:57 <elukey> kafka1018 back in service after maintenace [analytics]
13:44 <elukey> merged https://gerrit.wikimedia.org/r/315101 on stat1002 (removal of ::statistics:wikistats) [analytics]
13:00 <joal> Start prod version of wdqs_extract job [analytics]
12:16 <joal> Deploying refinery [analytics]
11:17 <elukey> started 0026094-160922102909979-oozie-oozi-C to fix webrequest-load-check_sequence_statistics-wf-upload-2016-10-11-9 (oozie data consistency errors) [analytics]
08:22 <joal> Killing cassandra loading jobs for old aqs [analytics]
2016-10-10 §
13:22 <elukey> moved pivot apache vhost to localhost:9090 [analytics]
09:16 <joal_> Deploy refinery [analytics]
2016-10-04 §
14:06 <elukey> applied role::spare to analytics1026 opened a task to decom it together with 1015 [analytics]
07:10 <elukey> rebooting eventlog1001 for kernel upgrades (EventLogging stopped) [analytics]
2016-09-23 §
09:06 <elukey> reboot eventlog2001.codfw.wmnet for kernel upgrades [analytics]
08:45 <elukey> upgrading varnishkafka to 1.0.12-1 in cache:misc [analytics]
08:32 <elukey> upgrading varnishkafka to 1.0.12-1 in cache:maps [analytics]
2016-09-22 §
15:30 <elukey> analytics1001 is back Yarn/HDFS master [analytics]
13:16 <elukey> previous comment was meant to be read as "set a permanent read only = false" [analytics]
13:16 <elukey> set read_only = false (on startup) for the analytics1003's mariadb instance [analytics]
13:12 <elukey> restarted oozie jobs for 2016-9-22-6 [analytics]
12:50 <elukey> varnishkafka 1.0.12 installed in cache:upload ulsfo and eqiad [analytics]
11:04 <elukey> re-enabling oozie and camus after cluster reboots [analytics]
10:57 <elukey> rebooted analytics1001 [analytics]
10:55 <elukey> Failover from analytics1001 to analytics1002 as prep step for 1001's reboot [analytics]
10:28 <elukey> setting global read_only = 0 to analytics1003 mariadb instance [analytics]
10:04 <elukey> rebooted analytics1003 (oozie, hive-metastore and hive-server2 daemons affected) [analytics]
09:51 <elukey> executed aptitude remove apache2 on analytic1027 (we use nginx in front of hue, apache steals port 8888 to hue and it does not start) [analytics]
09:49 <elukey> suspended all oozie bundles as prep step to reboot analytics1003 [analytics]
09:39 <elukey> rebooted analytics1027 [analytics]