6551-6600 of 7789 results (33ms)
2023-03-08 §
16:25 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "update location of elastic1061 - bking@cumin2002 - T322082" [production]
16:25 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['elastic1063.eqiad.wmnet'] [production]
16:23 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "update location of elastic1061 - bking@cumin2002 - T322082" [production]
16:22 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1060.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
16:19 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1061.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
16:14 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1062.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
16:10 <bking@cumin2002> START - Cookbook sre.hosts.provision for host elastic1062.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
16:08 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on elastic1062.eqiad.wmnet with reason: re-rack [production]
16:08 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on elastic1062.eqiad.wmnet with reason: re-rack [production]
16:08 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host elastic1062.eqiad.wmnet [production]
16:06 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on elastic1061.eqiad.wmnet with reason: re-rack [production]
16:06 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on elastic1061.eqiad.wmnet with reason: re-rack [production]
16:03 <bking@cumin2002> START - Cookbook sre.hosts.provision for host elastic1060.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
16:02 <bking@cumin2002> START - Cookbook sre.hosts.provision for host elastic1061.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
16:00 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host elastic1062.eqiad.wmnet [production]
16:00 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host elastic1061.eqiad.wmnet [production]
15:59 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['elastic1062.eqiad.wmnet'] [production]
15:55 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1063.eqiad.wmnet'] [production]
15:54 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['elastic1063.eqiad.wmnet'] [production]
15:54 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host elastic1061.eqiad.wmnet [production]
15:52 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1062.eqiad.wmnet'] [production]
15:52 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['elastic1062.eqiad.wmnet'] [production]
15:46 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1063.eqiad.wmnet'] [production]
15:42 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1062.eqiad.wmnet'] [production]
15:33 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['elastic1061.eqiad.wmnet'] [production]
15:31 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['elastic1060.eqiad.wmnet'] [production]
15:26 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1061.eqiad.wmnet'] [production]
15:23 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1060.eqiad.wmnet'] [production]
14:25 <inflatador> bking@cumin2002 powering down elastic1060-66 for re-rack T322082 [production]
2023-03-07 §
21:58 <inflatador> bking@cumin2002 depool elastic row D hosts to prepare for T322082 [production]
21:57 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 7 hosts with reason: re-rack [production]
21:56 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 7 hosts with reason: re-rack [production]
21:41 <inflatador> bking@cumin2002 ban elastic row D hosts to prepare for T322082 [production]
21:07 <bking@deploy2002> Finished deploy [airflow-dags/search@d533716]: initial deployment to search platform airflow 2 instance-bk (duration: 00m 41s) [production]
21:07 <bking@deploy2002> Started deploy [airflow-dags/search@d533716]: initial deployment to search platform airflow 2 instance-bk [production]
20:17 <bking@deploy2002> Finished deploy [airflow-dags/search@9924c93]: initial deployment to search platform airflow 2 instance-bk (duration: 01m 18s) [production]
20:16 <bking@deploy2002> Started deploy [airflow-dags/search@9924c93]: initial deployment to search platform airflow 2 instance-bk [production]
17:51 <inflatador> bking@cumin2002 repool wdqs hosts post-maintenance T329073 [production]
16:58 <bking@cumin2002> conftool action : set/pooled=true; selector: dnsdisc=wdqs,name=eqiad [production]
14:56 <inflatador> bking@cumin2002 unban production row A elastic nodes from all clusters T329073 [production]
14:52 <inflatador> bking@cumin2002 unban row A cloudelastic nodes T329073 [production]
2023-03-06 §
23:20 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wcqs1001.eqiad.wmnet,wdqs[1003-1004,1006,1011].eqiad.wmnet with reason: switch maintenance [production]
23:20 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wcqs1001.eqiad.wmnet,wdqs[1003-1004,1006,1011].eqiad.wmnet with reason: switch maintenance [production]
23:19 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 12 hosts with reason: switch maintenance [production]
23:19 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 12 hosts with reason: switch maintenance [production]
23:16 <inflatador> bking@cumin2002 ban row A cloudelastic hosts T329073 [production]
23:04 <inflatador> bking@cumin2002 'depool wcqs and wdqs row A hosts T329073' [production]
2023-03-03 §
20:58 <inflatador> bking@cumin2002 persistently unban all elastic nodes in eqiad T322082 [production]
20:55 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Update location of elastic1059 - bking@cumin2002 - T322082" [production]
20:52 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Update location of elastic1059 - bking@cumin2002 - T322082" [production]