901-950 of 10000 results (93ms)
2023-08-23 ยง
20:09 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2021.codfw.wmnet [production]
20:05 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kubernetes2049.mgmt.codfw.wmnet with reboot policy FORCED [production]
20:03 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kubernetes2050.mgmt.codfw.wmnet with reboot policy FORCED [production]
20:00 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2021.codfw.wmnet [production]
20:00 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2019.codfw.wmnet [production]
19:59 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kubernetes2051.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:55 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2049.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:53 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2050.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:52 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2019.codfw.wmnet [production]
19:52 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2014.codfw.wmnet [production]
19:48 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2051.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:47 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes2052.codfw.wmnet with OS bullseye [production]
19:46 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kubernetes2051.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:45 <pt1979@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['kubernetes2052'] [production]
19:43 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2014.codfw.wmnet [production]
19:43 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2013.codfw.wmnet [production]
19:35 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2013.codfw.wmnet [production]
19:34 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2051.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:32 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:32 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add mgmt DNS for kubernetes2051 - pt1979@cumin2002" [production]
19:32 <pt1979@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['kubernetes2052'] [production]
19:31 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add mgmt DNS for kubernetes2051 - pt1979@cumin2002" [production]
19:31 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2053.codfw.wmnet with OS bullseye [production]
19:31 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
19:31 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kubernetes2052.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:29 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
19:28 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
19:21 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cassandra-dev2003.codfw.wmnet [production]
19:20 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2052.mgmt.codfw.wmnet with reboot policy FORCED [production]
19:14 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host cassandra-dev2003.codfw.wmnet [production]
19:13 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cassandra-dev2002.codfw.wmnet [production]
19:12 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2053.codfw.wmnet with reason: host reimage [production]
19:09 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2053.codfw.wmnet with reason: host reimage [production]
19:06 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host cassandra-dev2002.codfw.wmnet [production]
18:57 <eevans@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host cassandra-dev2001.codfw.wmnet [production]
18:56 <htriedman@deploy1002> Finished deploy [airflow-dags/platform_eng@33de526]: (no justification provided) (duration: 00m 20s) [production]
18:55 <htriedman@deploy1002> Started deploy [airflow-dags/platform_eng@33de526]: (no justification provided) [production]
18:45 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host cassandra-dev2001.codfw.wmnet [production]
18:45 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes2053.codfw.wmnet with OS bullseye [production]
18:38 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2053.codfw.wmnet with OS bullseye [production]
18:19 <dduvall@deploy1002> Synchronized php: group1 wikis to 1.41.0-wmf.23 refs T343725 (duration: 06m 01s) [production]
18:19 <herron> re-enabled icinga meta-monitoring on wikitech-static [production]
18:17 <denisse> alert hosts maintenance finished [production]
18:13 <denisse> making alert1001 the primary alert host [production]
18:09 <denisse> updating DNS to point to alert1001 [production]
18:03 <denisse> failing over from alert2001 to alert1001 [production]
17:51 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert1001.wikimedia.org [production]
17:51 <denisse@cumin1001> START - Cookbook sre.hosts.reboot-single for host alert1001.wikimedia.org [production]
17:47 <denisse> make alert2001 the active host [production]
17:31 <denisse> failing over alert1001 to alert2001 [production]