1451-1500 of 10000 results (82ms)
2023-09-18 ยง
20:12 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on dbstore1008.eqiad.wmnet with reason: host reimage [production]
20:09 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbstore1009.eqiad.wmnet with reason: host reimage [production]
20:06 <cjming@deploy1002> cjming and dreamyjazz: Continuing with sync [production]
20:06 <cjming@deploy1002> cjming and dreamyjazz: Backport for [[gerrit:958024|clienthints: Pin wgCheckUserDisplayClientHints to false (T337942)]] synced to the testservers mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:06 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on dbstore1009.eqiad.wmnet with reason: host reimage [production]
20:05 <cjming@deploy1002> Started scap: Backport for [[gerrit:958024|clienthints: Pin wgCheckUserDisplayClientHints to false (T337942)]] [production]
19:46 <isaranto@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
19:43 <isaranto@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
19:22 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host dbstore1009.eqiad.wmnet with OS bullseye [production]
19:22 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host dbstore1008.eqiad.wmnet with OS bullseye [production]
18:02 <ejegg> re-enabled donor thank you mail send jobs [production]
17:50 <ejegg> civicrm upgraded from 0c2853aa to 0a36997d [production]
17:48 <ejegg> disabled donor thank you mail send jobs for Civi update [production]
16:41 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1145.eqiad.wmnet with OS bullseye [production]
16:30 <jhancock@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['dbstore1009'] [production]
16:29 <jhancock@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['dbstore1008'] [production]
16:25 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1144.eqiad.wmnet with OS bullseye [production]
16:24 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['dbstore1009'] [production]
16:23 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['dbstore1009'] [production]
16:23 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['dbstore1009'] [production]
16:23 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['dbstore1008'] [production]
16:17 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1145.eqiad.wmnet with reason: host reimage [production]
16:15 <stevemunene@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1145.eqiad.wmnet with reason: host reimage [production]
16:14 <jnuche@deploy1002> Installation of scap version "4.61.1" completed for 601 hosts [production]
16:13 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1036.eqiad.wmnet with OS bullseye [production]
16:13 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:12 <jnuche@deploy1002> Installing scap version "4.61.1" for 601 hosts [production]
16:11 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:03 <stevemunene@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1145.eqiad.wmnet with OS bullseye [production]
16:01 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1144.eqiad.wmnet with reason: host reimage [production]
15:57 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1047.eqiad.wmnet with OS bullseye [production]
15:57 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
15:57 <stevemunene@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1144.eqiad.wmnet with reason: host reimage [production]
15:56 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1036.eqiad.wmnet with reason: host reimage [production]
15:53 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
15:53 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1036.eqiad.wmnet with reason: host reimage [production]
15:53 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1038.eqiad.wmnet with OS bullseye [production]
15:53 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
15:53 <jdrewniak@deploy1002> Synchronized portals: Wikimedia Portals Update: [[gerrit:958512| Bumping portals to master (T128546)]] (duration: 08m 31s) [production]
15:51 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes1036.eqiad.wmnet with OS bullseye [production]
15:51 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
15:44 <jdrewniak@deploy1002> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:958512| Bumping portals to master (T128546)]] (duration: 08m 45s) [production]
15:43 <stevemunene@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1144.eqiad.wmnet with OS bullseye [production]
15:36 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1047.eqiad.wmnet with reason: host reimage [production]
15:34 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1038.eqiad.wmnet with reason: host reimage [production]
15:30 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1047.eqiad.wmnet with reason: host reimage [production]
15:30 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1038.eqiad.wmnet with reason: host reimage [production]
15:29 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host kubernetes1036 [production]
15:29 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes1047.eqiad.wmnet with OS bullseye [production]
15:29 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes1038.eqiad.wmnet with OS bullseye [production]