3501-3550 of 10000 results (32ms)
2023-03-23 §
16:04 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2002.codfw.wmnet with reason: stop kafka and reimage [production]
16:04 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2002.codfw.wmnet with reason: stop kafka and reimage [production]
16:03 <elukey@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: sync [production]
16:03 <elukey@deploy2002> helmfile [staging] START helmfile.d/services/changeprop: sync [production]
14:53 <elukey@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop: sync [production]
14:53 <elukey@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
14:51 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
14:51 <elukey@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
14:26 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
14:26 <elukey@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
14:22 <elukey@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop: sync [production]
14:22 <elukey@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
11:52 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main2004.codfw.wmnet with OS bullseye [production]
11:32 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main2004.codfw.wmnet with reason: host reimage [production]
11:27 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main2004.codfw.wmnet with reason: host reimage [production]
11:08 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-main2004.codfw.wmnet with OS bullseye [production]
11:07 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2004.codfw.wmnet with reason: stop kafka and reimage [production]
11:06 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2004.codfw.wmnet with reason: stop kafka and reimage [production]
10:38 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main2005.codfw.wmnet with OS bullseye [production]
10:18 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main2005.codfw.wmnet with reason: host reimage [production]
10:15 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main2005.codfw.wmnet with reason: host reimage [production]
10:01 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-main2005.codfw.wmnet with OS bullseye [production]
09:57 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2005.codfw.wmnet with reason: stop kafka and reimage [production]
09:57 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2005.codfw.wmnet with reason: stop kafka and reimage [production]
08:20 <elukey> clean up docker and reboot kubernetes2024 to enable overlay2 - T332803 [production]
07:54 <elukey> clean up docker and reboot kubernetes2023 to enable overlay2 - T332803 [production]
07:50 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubernetes2023.codfw.wmnet with reason: Restart docker with overlay [production]
07:49 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kubernetes2023.codfw.wmnet with reason: Restart docker with overlay [production]
07:49 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubernetes2024.codfw.wmnet with reason: Restart docker with overlay [production]
07:49 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kubernetes2024.codfw.wmnet with reason: Restart docker with overlay [production]
07:42 <elukey> clean up docker on kubernetes1024 (cordon + stop kubelet + docker + clean /var/lib/docker/*) and reboot to enable overlay2 - T332803 [production]
07:38 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubernetes1024.eqiad.wmnet with reason: Restart docker with overlay [production]
07:37 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kubernetes1024.eqiad.wmnet with reason: Restart docker with overlay [production]
2023-03-22 §
17:06 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
17:06 <elukey@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
17:05 <elukey@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop: sync [production]
17:04 <elukey@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
16:49 <elukey@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop: sync [production]
16:49 <elukey@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop: sync [production]
16:42 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
16:42 <elukey@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
15:56 <elukey@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts kafka-main2004.codfw.wmnet [production]
15:56 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafka-main2004.codfw.wmnet [production]
15:48 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host kafka-main2004.codfw.wmnet [production]
15:46 <elukey@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts kafka-main2004.codfw.wmnet [production]
15:46 <elukey@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts kafka-main2004.codfw.wmnet [production]
15:46 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafka-main2004.codfw.wmnet [production]
15:40 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host kafka-main2004.codfw.wmnet [production]
15:39 <elukey@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts kafka-main2004.codfw.wmnet [production]
15:39 <elukey@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts kafka-main2004.codfw.wmnet [production]