1-50 of 10000 results (94ms)
2025-06-15 §
18:09 <aokoth@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10 days, 0:00:00 on doc1003.eqiad.wmnet with reason: Bookworm Migration [production]
2025-06-14 §
22:38 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:35 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
22:24 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:24 <andrew@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: T396940 - andrew@cumin1002" [production]
22:23 <andrew@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: T396940 - andrew@cumin1002" [production]
22:18 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
21:51 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1024.eqiad.wmnet with OS bullseye [production]
21:35 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
21:31 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
21:16 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1024.eqiad.wmnet with OS bullseye [production]
21:15 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1024.eqiad.wmnet'] [production]
21:08 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1024.eqiad.wmnet'] [production]
21:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts cloudcephosd1024.eqiad.wmnet [production]
21:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1024.eqiad.wmnet [production]
20:58 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1024.eqiad.wmnet [production]
20:46 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1024.eqiad.wmnet [production]
19:59 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1023.eqiad.wmnet with OS bullseye [production]
19:45 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage [production]
19:41 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage [production]
19:26 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1023.eqiad.wmnet with OS bullseye [production]
19:17 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1023.eqiad.wmnet'] [production]
19:11 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1023.eqiad.wmnet'] [production]
19:11 <andrew@cumin1002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cloudcephosd1023.eqiad.wmnet [production]
19:11 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host cloudcephosd1023.eqiad.wmnet [production]
19:03 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1023.eqiad.wmnet [production]
18:51 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1023.eqiad.wmnet [production]
13:17 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1022.eqiad.wmnet with OS bullseye [production]
13:01 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1022.eqiad.wmnet with reason: host reimage [production]
12:56 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1022.eqiad.wmnet with reason: host reimage [production]
12:41 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1022.eqiad.wmnet with OS bullseye [production]
12:39 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1022.eqiad.wmnet'] [production]
12:29 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1022.eqiad.wmnet'] [production]
12:26 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts cloudcephosd1022.eqiad.wmnet [production]
12:26 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1022.eqiad.wmnet [production]
12:16 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1022.eqiad.wmnet [production]
12:01 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1022.eqiad.wmnet [production]
07:39 <stevemunene@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host an-worker1157.eqiad.wmnet [production]
05:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1020.eqiad.wmnet with OS bullseye [production]
04:55 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
04:52 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
04:36 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1020.eqiad.wmnet with OS bullseye [production]
04:35 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1020.eqiad.wmnet'] [production]
04:29 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1020.eqiad.wmnet'] [production]
04:29 <andrew@cumin1002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cloudcephosd1020.eqiad.wmnet [production]
04:21 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1020.eqiad.wmnet [production]
04:09 <ryankemper> [WDQS] Restarted blazegraph on `wdqs2009`. Probedown already resolved before the restart so this might be necessary but restarting just in case [production]
00:09 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1186.eqiad.wmnet with OS bullseye [production]
00:08 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1185.eqiad.wmnet with OS bullseye [production]
2025-06-13 §
23:58 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1185.eqiad.wmnet with OS bullseye [production]