2019-07-31
§
|
14:04 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) |
[production] |
13:49 |
<elukey@cumin1001> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper |
[production] |
13:37 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) |
[production] |
13:31 |
<elukey@cumin1001> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper |
[production] |
13:27 |
<elukey> |
roll restart of zookeeper on conf100[4-6] and conf200[1-3] for openjdk upgrades |
[production] |
13:12 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) |
[production] |
13:05 |
<elukey@cumin1001> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper |
[production] |
12:59 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) |
[production] |
12:53 |
<elukey@cumin1001> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper |
[production] |
10:08 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) |
[production] |
09:56 |
<elukey@cumin1001> |
START - Cookbook sre.kafka.roll-restart-mirror-maker |
[production] |
08:37 |
<elukey> |
restart Yarn Resource Managers on an-master100[12] to pick up the new openjdk version |
[production] |
08:05 |
<elukey> |
restart hadoop Namenodes on an-master100[12] to pick up new heap settings and new openjdk |
[production] |
07:29 |
<elukey> |
restart-hhvm on mw1290 |
[production] |
2019-07-29
§
|
16:19 |
<elukey> |
manually stopped the sre.kafka.roll-restart-brokers cookbook after 4 brokers restarts since the sleep interval (10mins) is too tight. |
[production] |
16:17 |
<elukey@cumin1001> |
END (ERROR) - Cookbook sre.kafka.roll-restart-brokers (exit_code=97) |
[production] |
15:34 |
<elukey@cumin1001> |
START - Cookbook sre.kafka.roll-restart-brokers |
[production] |
13:30 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) |
[production] |
13:01 |
<elukey@cumin1001> |
START - Cookbook sre.druid.roll-restart-workers |
[production] |
09:24 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.druid.roll-restart-workers (exit_code=99) |
[production] |
09:22 |
<elukey@cumin1001> |
START - Cookbook sre.druid.roll-restart-workers |
[production] |
09:21 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) |
[production] |
08:55 |
<elukey@cumin1001> |
START - Cookbook sre.druid.roll-restart-workers |
[production] |
08:47 |
<elukey> |
set mcrouter async behavior for codfw replication to all mw app/api servers (changes will be picked up when puppet runs on the hosts) - T225642 |
[production] |
08:32 |
<elukey@cumin1001> |
END (ERROR) - Cookbook sre.hadoop.roll-restart-workers (exit_code=97) |
[production] |
08:32 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |
07:18 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.roll-restart-workers (exit_code=0) |
[production] |
06:30 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |