251-300 of 10000 results (107ms)
2026-03-13 ยง
12:42 <bwojtowicz@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
12:40 <jynus@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host backup1018.eqiad.wmnet [production]
12:39 <jynus@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host backup2016.codfw.wmnet [production]
12:31 <jelto@cumin1003> START - Cookbook sre.gitlab.reboot-runner rolling reboot on A:gitlab-runner [production]
12:29 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup1018.eqiad.wmnet [production]
12:29 <jynus@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host backup1017.eqiad.wmnet [production]
12:28 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup2016.codfw.wmnet [production]
12:27 <jynus@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host backup2015.codfw.wmnet [production]
12:24 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host bast1004.wikimedia.org [production]
12:18 <aokoth@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host doc1004.eqiad.wmnet [production]
12:18 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup1017.eqiad.wmnet [production]
12:17 <daniel@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
12:17 <daniel@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
12:15 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup2015.codfw.wmnet [production]
12:15 <daniel@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
12:15 <daniel@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
12:14 <aokoth@cumin1003> START - Cookbook sre.hosts.reboot-single for host doc1004.eqiad.wmnet [production]
12:13 <aokoth@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aphlict2001.codfw.wmnet [production]
12:10 <aokoth@cumin1003> START - Cookbook sre.hosts.reboot-single for host aphlict2001.codfw.wmnet [production]
12:10 <jmm@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: reboot [production]
12:10 <aokoth@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host vrts2002.codfw.wmnet [production]
12:07 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
12:07 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
12:03 <aokoth@cumin1003> START - Cookbook sre.hosts.reboot-single for host vrts2002.codfw.wmnet [production]
12:02 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
12:02 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
12:01 <jynus@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host backup1016.eqiad.wmnet [production]
12:01 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1019.eqiad.wmnet [production]
11:59 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1018.eqiad.wmnet [production]
11:59 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
11:59 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
11:54 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1019.eqiad.wmnet [production]
11:54 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1018.eqiad.wmnet [production]
11:51 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
11:51 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
11:50 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup1016.eqiad.wmnet [production]
11:49 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-backup2004.codfw.wmnet [production]
11:43 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host ms-backup2004.codfw.wmnet [production]
11:43 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-backup1004.eqiad.wmnet [production]
11:37 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host ms-backup1004.eqiad.wmnet [production]
11:36 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-backup2003.codfw.wmnet [production]
11:34 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-backup1003.eqiad.wmnet [production]
11:32 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker1018.eqiad.wmnet with OS bookworm [production]
11:32 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
11:30 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host ms-backup2003.codfw.wmnet [production]
11:28 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host ms-backup1003.eqiad.wmnet [production]
11:27 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
11:26 <jelto@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host tcp-proxy1001.eqiad.wmnet [production]
11:21 <arnaudb@cumin1003> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host contint1003.wikimedia.org [production]
11:21 <jelto@cumin1003> START - Cookbook sre.hosts.reboot-single for host tcp-proxy1001.eqiad.wmnet [production]