1251-1300 of 10000 results (26ms)
2026-03-17 ยง
15:51 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2033.codfw.wmnet [production]
15:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2033.codfw.wmnet [production]
15:49 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.add_k8s_etcd_node [toolsbeta]
15:46 <dzahn@cumin2002> START - Cookbook sre.hosts.reimage for host zuul2003.codfw.wmnet with OS trixie [production]
15:45 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1009.eqiad.wmnet [production]
15:45 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1008.eqiad.wmnet [production]
15:44 <samtar@deploy2002> mwscript-k8s job started: foreachwikiindblist group0 cleanupWatchlistLabelMember.php # T420328 [production]
15:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2033.codfw.wmnet [production]
15:38 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2033.codfw.wmnet [production]
15:38 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2048.codfw.wmnet [production]
15:37 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2048.codfw.wmnet [production]
15:37 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) [toolsbeta]
15:36 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1008.eqiad.wmnet [production]
15:36 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1007.eqiad.wmnet [production]
15:34 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2048.codfw.wmnet [production]
15:34 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1012.eqiad.wmnet with reason: host reimage [production]
15:33 <samtar@deploy2002> mwscript-k8s job started: foreachwikiindblist testwikis cleanupWatchlistLabelMember.php # T420328 [production]
15:33 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.remove_k8s_etcd_node [toolsbeta]
15:32 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.add_k8s_etcd_node (exit_code=99) [toolsbeta]
15:32 <btullis@cumin1003> START - Cookbook sre.druid.reboot-workers for Druid analytics cluster: Reboot Druid nodes [production]
15:28 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1007.eqiad.wmnet [production]
15:28 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1006.eqiad.wmnet [production]
15:27 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1012.eqiad.wmnet with reason: host reimage [production]
15:27 <samtar@deploy2002> mwscript-k8s job started: cleanupWatchlistLabelMember.php --wiki=testwiki # T420328 [production]
15:27 <filippo@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudnet2008-dev.codfw.wmnet [production]
15:25 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1015.eqiad.wmnet with OS bookworm [production]
15:23 <jmm@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
15:22 <btullis@cumin1003> START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-jumbo-eqiad [production]
15:21 <jmm@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
15:21 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.add_k8s_etcd_node [toolsbeta]
15:20 <filippo@cumin1003> START - Cookbook sre.hosts.reboot-single for host cloudnet2008-dev.codfw.wmnet [production]
15:20 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1006.eqiad.wmnet [production]
15:20 <mvernon@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be1005.eqiad.wmnet [production]
15:18 <jmm@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
15:18 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] (duration: 06m 32s) [production]
15:16 <jmm@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
15:16 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 16509 [production]
15:15 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) [toolsbeta]
15:14 <filippo@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudservices2005-dev.codfw.wmnet [production]
15:14 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
15:13 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:13 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1012.eqiad.wmnet with OS bookworm [production]
15:12 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2048.codfw.wmnet [production]
15:12 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.remove_k8s_etcd_node [toolsbeta]
15:11 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1244723|cleanup: Growth: Remove temporary GrowthMentorList overrides (T418518)]] [production]
15:11 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) [toolsbeta]
15:10 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.remove_k8s_etcd_node [toolsbeta]
15:10 <brennen@deploy2002> Finished deploy [phabricator/deployment@e845707]: deploy phab1004 for T420366 (duration: 01m 02s) [production]
15:10 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.remove_k8s_etcd_node (exit_code=99) [toolsbeta]
15:09 <mvernon@cumin1003> START - Cookbook sre.hosts.reboot-single for host thanos-be1005.eqiad.wmnet [production]