6401-6450 of 10000 results (53ms)
2021-04-01 ยง
16:16 <Majavah> hard reboot unresponsive deployment-cache-text06 [releng]
16:02 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2395.codfw.wmnet with reason: REIMAGE [production]
16:00 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2395.codfw.wmnet with reason: REIMAGE [production]
15:58 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2394.codfw.wmnet with reason: REIMAGE [production]
15:56 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2394.codfw.wmnet with reason: REIMAGE [production]
15:53 <dcaro> Removed etcd member tools-k8s-etcd-5.tools.eqiad.wmflabs, adding a new member (T267082) [tools]
15:43 <dcaro> Removing etcd member tools-k8s-etcd-5.tools.eqiad.wmflabs (T267082) [tools]
15:39 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2393.codfw.wmnet with reason: REIMAGE [production]
15:37 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2393.codfw.wmnet with reason: REIMAGE [production]
15:36 <dcaro> Added new etcd member tools-k8s-etcd-9.tools.eqiad1.wikimedia.cloud (T267082) [tools]
15:32 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2391.codfw.wmnet with reason: REIMAGE [production]
15:30 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2391.codfw.wmnet with reason: REIMAGE [production]
15:18 <dcaro> adding new etcd member using the cookbook wmcs.toolforge.add_etcd_node (T267082) [tools]
15:17 <dcaro> etcd cluster shrunk 3 members (using wmcs.toolforge.remove_etcd_node cookbook) [toolsbeta]
15:05 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2392.codfw.wmnet with reason: REIMAGE [production]
15:03 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2392.codfw.wmnet with reason: REIMAGE [production]
14:54 <dcaro> shrinking etcd cluster to 3 members, cleaning up automation runs [toolsbeta]
14:52 <volans> uploaded python3-wmflib_0.0.7 to bullseye-wikimedia [production]
14:41 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2390.codfw.wmnet with reason: REIMAGE [production]
14:39 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2390.codfw.wmnet with reason: REIMAGE [production]
14:22 <effie> disable puppet on mw* canaries, rolling depool and pooling of canaries [production]
14:06 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-test-worker1001.eqiad.wmnet with reason: REIMAGE [production]
14:04 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-test-worker1001.eqiad.wmnet with reason: REIMAGE [production]
14:01 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2389.codfw.wmnet with reason: REIMAGE [production]
13:59 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2389.codfw.wmnet with reason: REIMAGE [production]
13:53 <pt1979@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2388.codfw.wmnet with reason: REIMAGE [production]
13:51 <pt1979@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2388.codfw.wmnet with reason: REIMAGE [production]
13:24 <ema> cp3054: reboot with Linux 4.19.181+1 -- the kernel was not upgraded earlier during T273278 reboots due to broken dpkg status [production]
13:16 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1022.eqiad.wmnet [production]
13:07 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ganeti1022.eqiad.wmnet [production]
12:59 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) [production]
12:53 <dcaro@cumin1001> START - Cookbook sre.hosts.upgrade-and-reboot [production]
12:52 <Majavah> update floating ip 185.15.56.9 from deployment-parsoid11 to deployment-parsoid12 [releng]
12:51 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) [production]
12:47 <moritzm> drain ganeti1022 [production]
12:46 <dcaro@cumin1001> START - Cookbook sre.hosts.upgrade-and-reboot [production]
12:45 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.upgrade-and-reboot (exit_code=0) [production]
12:45 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1021.eqiad.wmnet [production]
12:40 <dcaro@cumin1001> START - Cookbook sre.hosts.upgrade-and-reboot [production]
12:38 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd2003-dev.codfw.wmnet [production]
12:38 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ganeti1021.eqiad.wmnet [production]
12:34 <dcaro@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudcephosd2003-dev.codfw.wmnet [production]
12:23 <moritzm> drain ganeti1021 [production]
12:21 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd2003-dev.codfw.wmnet [production]
12:19 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1020.eqiad.wmnet [production]
12:15 <dcaro@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudcephosd2003-dev.codfw.wmnet [production]
12:15 <dcaro> Restoring the 4.9 kernel on cloudcephosd2003-dev and upgrading (T274565) [admin]
12:12 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ganeti1020.eqiad.wmnet [production]
11:59 <Urbanecm> Start server upload of two video files (~4 GB in total) # T278856 [production]
11:55 <moritzm> drain ganeti1020 [production]