551-600 of 8383 results (17ms)
2025-09-30 §
17:58 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
17:58 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
17:57 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
17:57 <bking@cumin2002> START - Cookbook sre.wdqs.data-transfer (T405978, transfer scholarly graph to newly-reimaged host) xfer scholarly_articles from wdqs2023.codfw.wmnet -> wdqs2016.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
17:56 <bking@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wdqs2016.codfw.wmnet with OS bullseye [production]
16:33 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2016.codfw.wmnet with reason: host reimage [production]
16:30 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2016.codfw.wmnet with reason: host reimage [production]
16:12 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2016.codfw.wmnet with OS bullseye [production]
16:11 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts wdqs2016.codfw.wmnet [production]
16:11 <bking@cumin2002> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host wdqs2016.codfw.wmnet [production]
14:58 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:57 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:42 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:41 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:41 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:40 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:40 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:40 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:39 <bking@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:30 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:27 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host wdqs2016.codfw.wmnet [production]
14:26 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts wdqs2016.codfw.wmnet [production]
12:51 <bking@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2016.codfw.wmnet with OS bullseye [production]
2025-09-29 §
22:56 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
22:56 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
22:55 <bking@cumin1002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
22:55 <bking@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
22:54 <bking@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2017.codfw.wmnet with OS bullseye [production]
21:48 <bking@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2016.codfw.wmnet with reason: host reimage [production]
21:45 <bking@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2016.codfw.wmnet with reason: host reimage [production]
21:33 <bking@cumin1002> START - Cookbook sre.hosts.reimage for host wdqs2017.codfw.wmnet with OS bullseye [production]
21:28 <bking@cumin1002> START - Cookbook sre.hosts.reimage for host wdqs2016.codfw.wmnet with OS bullseye [production]
2025-09-23 §
18:08 <bking@cumin1002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in search_codfw [production]
18:08 <bking@cumin1002> START - Cookbook sre.elasticsearch.ban Unbanning all hosts in search_codfw [production]
16:25 <bking@cumin1002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: cirrussearch2093.codfw.wmnet for thread pool rejections - bking@cumin1002 - T399891 [production]
16:25 <bking@cumin1002> START - Cookbook sre.elasticsearch.ban Banning hosts: cirrussearch2093.codfw.wmnet for thread pool rejections - bking@cumin1002 - T399891 [production]
2025-09-16 §
13:53 <bking@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
13:51 <bking@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
13:51 <bking@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:51 <bking@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
2025-09-12 §
17:49 <bking@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
17:49 <bking@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
17:48 <bking@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
17:48 <bking@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
2025-09-10 §
13:50 <bking@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
13:48 <bking@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]