4201-4250 of 10000 results (140ms)
2025-06-16 §
05:35 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1162.eqiad.wmnet [production]
05:35 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1160.eqiad.wmnet [production]
05:33 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1160.eqiad.wmnet [production]
05:31 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P77962 and previous config saved to /var/cache/conftool/dbconfig/20250616-053150-root.json [production]
05:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167 (T396130)', diff saved to https://phabricator.wikimedia.org/P77961 and previous config saved to /var/cache/conftool/dbconfig/20250616-052530-marostegui.json [production]
05:16 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P77960 and previous config saved to /var/cache/conftool/dbconfig/20250616-051644-root.json [production]
05:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1167 (T396130)', diff saved to https://phabricator.wikimedia.org/P77959 and previous config saved to /var/cache/conftool/dbconfig/20250616-050637-marostegui.json [production]
05:06 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
05:06 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
05:01 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P77958 and previous config saved to /var/cache/conftool/dbconfig/20250616-050139-root.json [production]
04:58 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2204.codfw.wmnet with reason: Maintenance [production]
04:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2204 T396549', diff saved to https://phabricator.wikimedia.org/P77957 and previous config saved to /var/cache/conftool/dbconfig/20250616-045738-marostegui.json [production]
04:52 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2161.codfw.wmnet with reason: Maintenance [production]
2025-06-15 §
18:09 <aokoth@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10 days, 0:00:00 on doc1003.eqiad.wmnet with reason: Bookworm Migration [production]
2025-06-14 §
22:38 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:35 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
22:24 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:24 <andrew@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: T396940 - andrew@cumin1002" [production]
22:23 <andrew@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: T396940 - andrew@cumin1002" [production]
22:18 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
21:51 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1024.eqiad.wmnet with OS bullseye [production]
21:35 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
21:31 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
21:16 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1024.eqiad.wmnet with OS bullseye [production]
21:15 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1024.eqiad.wmnet'] [production]
21:08 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1024.eqiad.wmnet'] [production]
21:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts cloudcephosd1024.eqiad.wmnet [production]
21:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1024.eqiad.wmnet [production]
20:58 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1024.eqiad.wmnet [production]
20:46 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1024.eqiad.wmnet [production]
19:59 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1023.eqiad.wmnet with OS bullseye [production]
19:45 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage [production]
19:41 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage [production]
19:26 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1023.eqiad.wmnet with OS bullseye [production]
19:17 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1023.eqiad.wmnet'] [production]
19:11 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1023.eqiad.wmnet'] [production]
19:11 <andrew@cumin1002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cloudcephosd1023.eqiad.wmnet [production]
19:11 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host cloudcephosd1023.eqiad.wmnet [production]
19:03 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1023.eqiad.wmnet [production]
18:51 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1023.eqiad.wmnet [production]
13:17 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1022.eqiad.wmnet with OS bullseye [production]
13:01 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1022.eqiad.wmnet with reason: host reimage [production]
12:56 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1022.eqiad.wmnet with reason: host reimage [production]
12:41 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1022.eqiad.wmnet with OS bullseye [production]
12:39 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1022.eqiad.wmnet'] [production]
12:29 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1022.eqiad.wmnet'] [production]
12:26 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts cloudcephosd1022.eqiad.wmnet [production]
12:26 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1022.eqiad.wmnet [production]
12:16 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1022.eqiad.wmnet [production]
12:01 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1022.eqiad.wmnet [production]