7151-7200 of 10000 results (24ms)
2020-11-19 §
15:25 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
15:23 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
15:22 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
15:18 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
15:17 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
14:54 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
14:53 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
14:50 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
14:49 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
14:45 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
14:44 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
14:36 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
11:40 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
11:33 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
09:07 <elukey> restart kafka daemons on kafka-jumbo1001 for openjdk upgrades (canary) [production]
08:55 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) [production]
08:49 <elukey> restart hadoop daemons on analytics1058 for openjdk upgrades (canary) [production]
08:25 <elukey@cumin1001> START - Cookbook sre.hadoop.roll-restart-masters [production]
08:19 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) [production]
07:22 <elukey@cumin1001> START - Cookbook sre.hadoop.roll-restart-masters [production]
07:21 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.roll-restart-workers (exit_code=0) [production]
07:05 <elukey> roll restart java daemons on Hadoop test for openjdk upgrades [production]
07:05 <elukey@cumin1001> START - Cookbook sre.hadoop.roll-restart-workers [production]
2020-11-18 §
17:18 <elukey> shutdown an-presto1004 for hw maintenance [production]
16:50 <elukey> update /etc/krb5.keytab on krb1001/krb2001 to match the most up to date key version for host/krb2001.codfw.wmnet [production]
14:09 <elukey> copied /etc/krb5.keytab from krb1001 to krb2001 (the last one contained only one principal for 2001, the first one both for 1001 and 2001) [production]
14:02 <elukey> restart krb5-kpropd.service on krb2001 to force the pick up of new client configs [production]
09:22 <elukey> set dns_canonicalize_hostname = false to all kerberos clients [production]
06:53 <elukey> restart also mirror maker on kafka-main1001/1003 (seems not related but just to clear old errors and a possible weird state) [production]
06:37 <elukey> restart kafka-mirror-main-codfw_to_main-eqiad@0.service on kafka-main1002 - consumer msg rate low since kafka-main2003 went down for codfw c7 failure [production]
2020-11-17 §
19:24 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
19:21 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
19:18 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
19:12 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
14:57 <elukey> stutdown stat1008 for ram expansion [production]
13:37 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
13:29 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:27 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
13:23 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:22 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) [production]
13:22 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:21 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) [production]
13:21 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
09:08 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
09:02 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
09:01 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
08:56 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
08:56 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
08:52 <elukey@cumin1001> START - Cookbook sre.hosts.decommission [production]
08:37 <elukey@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]