2751-2800 of 10000 results (86ms)
2023-03-02 ยง
15:59 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Fix DNS typo in record for cr2-eqiad gr-3/3/0.2 - cmooney@cumin1001" [production]
15:58 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Fix DNS typo in record for cr2-eqiad gr-3/3/0.2 - cmooney@cumin1001" [production]
15:55 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
15:41 <jynus> restart db2099 T330218 [production]
14:32 <Lucas_WMDE> UTC afternoon backport+config window done [production]
14:29 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:891571|Remove unused Wikibase config variables (T330410)]] (duration: 08m 41s) [production]
14:23 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Backport for [[gerrit:891571|Remove unused Wikibase config variables (T330410)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
14:21 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:891571|Remove unused Wikibase config variables (T330410)]] [production]
13:58 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
13:58 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
13:51 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
13:49 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
13:48 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1010.eqiad.wmnet with OS bullseye [production]
13:48 <dcaro@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - dcaro@cumin1001" [production]
13:47 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
13:47 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . [production]
13:46 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . [production]
13:46 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
13:45 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
13:42 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
13:40 <elukey@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
11:48 <mvolz@deploy2002> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
11:48 <mvolz@deploy2002> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
11:47 <mvolz@deploy2002> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
11:47 <mvolz@deploy2002> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
11:46 <mvolz@deploy2002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:46 <mvolz@deploy2002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:42 <mvolz@deploy2002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:42 <mvolz@deploy2002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:13 <mvolz@deploy2002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:11 <mvolz@deploy2002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:00 <mvolz@deploy2002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
10:42 <claime> Running authdns-update for 893675 [production]
10:27 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-serve1006.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
10:21 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve1008.eqiad.wmnet with OS bullseye [production]
10:16 <aqu@deploy2002> Finished deploy [airflow-dags/analytics_test@9568478]: Re-Deploy Airflow upgrade branch for analytics_test (duration: 00m 12s) [production]
10:16 <aqu@deploy2002> Started deploy [airflow-dags/analytics_test@9568478]: Re-Deploy Airflow upgrade branch for analytics_test [production]
10:08 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve1007.eqiad.wmnet with OS bullseye [production]
10:05 <dcaro@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - dcaro@cumin1001" [production]
10:03 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host ml-serve1006.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
09:50 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1008.eqiad.wmnet with reason: host reimage [production]
09:48 <dcaro@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1010.eqiad.wmnet with reason: host reimage [production]
09:47 <elukey@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1008.eqiad.wmnet with reason: host reimage [production]
09:44 <dcaro@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1010.eqiad.wmnet with reason: host reimage [production]
09:38 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1007.eqiad.wmnet with reason: host reimage [production]
09:35 <elukey@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1007.eqiad.wmnet with reason: host reimage [production]
09:28 <dcaro@cumin1001> START - Cookbook sre.hosts.reimage for host cloudcephosd1010.eqiad.wmnet with OS bullseye [production]
09:20 <elukey@cumin2002> START - Cookbook sre.hosts.reimage for host ml-serve1008.eqiad.wmnet with OS bullseye [production]
09:14 <jnuche@deploy2002> rebuilt and synchronized wikiversions files: all wikis to 1.40.0-wmf.25 refs T325588 [production]
09:13 <dcaro@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cloudcephosd1010'] [production]