2201-2250 of 10000 results (129ms)
2024-12-23 ยง
15:39 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1033.eqiad.wmnet with OS bookworm [production]
15:37 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1032.eqiad.wmnet with OS bookworm [production]
15:14 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1032.eqiad.wmnet with reason: host reimage [production]
15:11 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1032.eqiad.wmnet with reason: host reimage [production]
14:58 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1028-1030].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
14:58 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1030.eqiad.wmnet with OS bookworm [production]
14:52 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1032.eqiad.wmnet with OS bookworm [production]
14:51 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl1004.eqiad.wmnet with OS bookworm [production]
14:50 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1031.eqiad.wmnet with OS bookworm [production]
14:39 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1030.eqiad.wmnet with reason: host reimage [production]
14:35 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1030.eqiad.wmnet with reason: host reimage [production]
14:32 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1031.eqiad.wmnet with reason: host reimage [production]
14:27 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1031.eqiad.wmnet with reason: host reimage [production]
14:19 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1030.eqiad.wmnet with OS bookworm [production]
14:17 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1029.eqiad.wmnet with OS bookworm [production]
14:11 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1031.eqiad.wmnet with OS bookworm [production]
14:10 <mvernon@cumin2002> conftool action : set/pooled=false; selector: dnsdisc=swift,name=codfw [production]
14:10 <jayme@cumin1002> START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on P{wikikube-worker[1031-1033].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
14:10 <Emperor> depool codfw swift T382705 [production]
13:58 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1029.eqiad.wmnet with reason: host reimage [production]
13:55 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1029.eqiad.wmnet with reason: host reimage [production]
13:38 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1029.eqiad.wmnet with OS bookworm [production]
13:36 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1028.eqiad.wmnet with OS bookworm [production]
13:34 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1012-1014].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
13:34 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1014.eqiad.wmnet with OS bookworm [production]
13:18 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1028.eqiad.wmnet with reason: host reimage [production]
13:15 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1014.eqiad.wmnet with reason: host reimage [production]
13:14 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1028.eqiad.wmnet with reason: host reimage [production]
13:11 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1014.eqiad.wmnet with reason: host reimage [production]
12:54 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1014.eqiad.wmnet with OS bookworm [production]
12:53 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1013.eqiad.wmnet with OS bookworm [production]
12:34 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1013.eqiad.wmnet with reason: host reimage [production]
12:31 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1028.eqiad.wmnet with OS bookworm [production]
12:31 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1013.eqiad.wmnet with reason: host reimage [production]
12:29 <jayme@cumin1002> START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on P{wikikube-worker[1028-1030].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
12:27 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1025-1027].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
12:27 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1027.eqiad.wmnet with OS bookworm [production]
12:08 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1027.eqiad.wmnet with reason: host reimage [production]
12:04 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1027.eqiad.wmnet with reason: host reimage [production]
12:03 <moritzm> 5558 [production]
12:02 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl1004.eqiad.wmnet with reason: host reimage [production]
11:59 <akosiaris@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-ctrl1004.eqiad.wmnet with reason: host reimage [production]
11:47 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1027.eqiad.wmnet with OS bookworm [production]
11:46 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1026.eqiad.wmnet with OS bookworm [production]
11:40 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl1004.eqiad.wmnet with OS bookworm [production]
11:39 <akosiaris@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl1004.eqiad.wmnet with OS bookworm [production]
11:37 <akosiaris> roll restart of all swift fes in codfw. This seems to have fixed some higher than usual cache_upload error rates. Monitoring. [production]
11:27 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1026.eqiad.wmnet with reason: host reimage [production]
11:25 <akosiaris@cumin1002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw [production]
11:24 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1026.eqiad.wmnet with reason: host reimage [production]