2018-01-17
§
|
14:21 |
<elukey> |
forced kill of banner impression data streaming job to get it restarted |
[analytics] |
11:44 |
<elukey> |
re-run pageview-druid-hourly-wf-2018-1-17-9 and pageview-druid-hourly-wf-2018-1-17-8 (failed due to druid1002's middlemanager being in a weird state after reboot) |
[analytics] |
11:44 |
<elukey> |
restart druid middlemanager on druid1002 |
[analytics] |
10:38 |
<elukey> |
stopped all crons on hadoop-coordinator-1 |
[analytics] |
10:37 |
<elukey> |
re-run webrequest-druid-hourly-wf-2018-1-17-8 (failed due to druid1002's reboot) |
[analytics] |
10:22 |
<elukey> |
reboot druid1002 for kernel upgrades |
[analytics] |
09:53 |
<elukey> |
disable druid middlemanager on druid1002 as prep step for reboot |
[analytics] |
09:46 |
<elukey> |
rebooted analytics1003 |
[analytics] |
09:46 |
<elukey> |
removed upstart config for brrd on eventlog1001 (failing and spamming syslog, old leftover?) |
[analytics] |
08:53 |
<elukey> |
disabled camus as prep step for analytics1003 reboot |
[analytics] |
2018-01-11
§
|
22:35 |
<ottomata> |
restarting kafka-jumbo brokers to apply https://gerrit.wikimedia.org/r/403774 |
[analytics] |
22:04 |
<ottomata> |
restarting kafka-jumbo brokers to apply https://gerrit.wikimedia.org/r/#/c/403762/ |
[analytics] |
20:57 |
<ottomata> |
restarting kafka-jumbo brokers to apply https://gerrit.wikimedia.org/r/#/c/403753/ |
[analytics] |
17:37 |
<joal> |
Kill manual banner-streaming job to see it restarted by cron |
[analytics] |
17:11 |
<ottomata> |
restart kafka on kafka-jumbo1003 |
[analytics] |
17:08 |
<ottomata> |
restart kafka on kafka-jumbo1001...something is not right with my certpath change yesterday |
[analytics] |
14:46 |
<joal> |
Deploy refinery onto HDFS |
[analytics] |
14:33 |
<joal> |
Deploy refinery with Scap |
[analytics] |
14:07 |
<joal> |
Manually restarting banner streaming job to prevent alerting |
[analytics] |
13:23 |
<joal> |
Killing banner-streaming job to have it auto-restarted from cron |
[analytics] |
11:45 |
<elukey> |
re-run webrequest-load-wf-text-2018-1-11-8 (failed due to reboots) |
[analytics] |
11:39 |
<joal> |
rerun mediacounts-load-wf-2018-1-11-8 |
[analytics] |
10:48 |
<joal> |
Restarting banner-streaming job after hadoop nodes reboot |
[analytics] |
10:01 |
<elukey> |
reboot analytics1059-61 for kernel updates |
[analytics] |
09:34 |
<elukey> |
reboot analytics1055->1058 for kernel updates |
[analytics] |
09:04 |
<elukey> |
reboot analytics1051->1054 for kernel updates |
[analytics] |
2018-01-09
§
|
16:53 |
<joal> |
Rerun pageview-druid-hourly-wf-2018-1-9-13 |
[analytics] |
15:33 |
<elukey> |
stop mysql on dbstore1002 as prep step for shutdown (stop all slaves, mysql stop) |
[analytics] |
15:10 |
<elukey> |
reboot analytics1028 (hadoop worker and hdfs journal node) for kernel updates |
[analytics] |
15:00 |
<elukey> |
reboot kafka-jumbo1006 for kernel updates |
[analytics] |
14:41 |
<elukey> |
reboot kafka-jumbo1005 for kernel updates |
[analytics] |
14:33 |
<elukey> |
reboot kafka1023 for kernel updates |
[analytics] |
14:04 |
<elukey> |
reboot kafka1022 for kernel updates |
[analytics] |
13:51 |
<elukey> |
reboot kafka-jumbo1003 for kernel updates |
[analytics] |
10:08 |
<elukey> |
reboot kafka-jumbo1002 for kernel updates |
[analytics] |
09:35 |
<elukey> |
reboot kafka1014 for kernel updates |
[analytics] |