2020-10-05
§
|
18:17 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) |
[production] |
18:17 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers |
[production] |
18:15 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) |
[production] |
18:13 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers |
[production] |
18:11 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) |
[production] |
18:10 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.init-hadoop-workers |
[production] |
17:53 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:51 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
17:29 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:27 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:41 |
<elukey> |
shutdown stat1005 and stat1008 for ram expansion (1005 again) |
[production] |
14:25 |
<elukey> |
shutdown an-master1001 for ram expansion |
[production] |
13:54 |
<elukey> |
shutdown stat1005 for ram upgrade |
[production] |
13:31 |
<elukey> |
shutdown an-master1002 for ram expansion (64 -> 128G) |
[production] |
10:37 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) |
[production] |
10:34 |
<elukey@cumin1001> |
START - Cookbook sre.aqs.roll-restart |
[production] |
06:33 |
<elukey> |
reboot stat1005 to resolve weird GPU state (scheduled last week) |
[production] |