2051-2100 of 10000 results (53ms)
2025-06-14 §
05:13 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=99) [admin]
05:12 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
05:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1020.eqiad.wmnet with OS bullseye [production]
04:55 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
04:52 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
04:36 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1020.eqiad.wmnet with OS bullseye [production]
04:35 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1020.eqiad.wmnet'] [production]
04:29 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1020.eqiad.wmnet'] [production]
04:29 <andrew@cumin1002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cloudcephosd1020.eqiad.wmnet [production]
04:25 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (T309789) [admin]
04:23 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.drain_node (exit_code=99) (T309789) [admin]
04:23 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (T309789) [admin]
04:21 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1020.eqiad.wmnet [production]
04:21 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=0) [admin]
04:20 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.depool_and_destroy [admin]
04:09 <ryankemper> [WDQS] Restarted blazegraph on `wdqs2009`. Probedown already resolved before the restart so this might be necessary but restarting just in case [production]
04:02 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=99) [admin]
04:02 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.depool_and_destroy [admin]
03:59 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=99) [admin]
03:58 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.depool_and_destroy [admin]
03:57 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.drain_node (exit_code=0) (T309789) [admin]
03:56 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (T309789) [admin]
03:56 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.drain_node (exit_code=99) (T309789) [admin]
03:55 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (T309789) [admin]
03:54 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=0) [admin]
03:54 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.undrain_node [admin]
03:51 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.drain_node (exit_code=97) (T309789) [admin]
02:09 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/248 [admin]
02:08 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/248 [admin]
02:08 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/245 [admin]
02:07 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/245 [admin]
01:38 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.drain_node (T309789) [admin]
01:37 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=0) [admin]
01:37 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.undrain_node [admin]
01:36 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) [admin]
01:36 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
01:36 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=97) [admin]
00:17 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.drain_node (exit_code=99) (T309789) [admin]
00:09 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1186.eqiad.wmnet with OS bullseye [production]
00:08 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1185.eqiad.wmnet with OS bullseye [production]
2025-06-13 §
23:58 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1185.eqiad.wmnet with OS bullseye [production]
23:42 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1185.eqiad.wmnet with OS bullseye [production]
23:31 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1185.eqiad.wmnet with OS bullseye [production]
23:28 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
23:27 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
23:22 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1186.eqiad.wmnet with OS bullseye [production]
23:16 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1185.eqiad.wmnet with OS bullseye [production]
23:15 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1186.eqiad.wmnet with OS bullseye [production]
22:26 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1186.eqiad.wmnet with OS bullseye [production]
22:25 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]