4101-4150 of 10000 results (156ms)
2025-06-05 ยง
19:14 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1186 [production]
19:13 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host an-worker1186 [production]
19:13 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:12 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:12 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1186 [production]
19:12 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host an-worker1186 [production]
18:52 <phuedx> Disabled the SDS 2.4.11 Synthetic A/A Test in xLab [production]
18:49 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1185.eqiad.wmnet with OS bullseye [production]
18:48 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:43 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:32 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1185.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:30 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:21 <dduvall@deploy1003> rebuilt and synchronized wikiversions files: group2 to 1.45.0-wmf.4 refs T392174 [production]
18:21 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1186.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:20 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1186 [production]
18:20 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host an-worker1186 [production]
18:19 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1185.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:18 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply [production]
18:17 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
18:17 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-reload reloading scholarly_articles on wdqs1023.eqiad.wmnet from DumpsSource.HDFS (hdfs:///wmf/data/discovery/wikidata/munged_n3_dump/wikidata/scholarly/20250526/ using stat1011.eqiad.wmnet) [production]
18:17 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1185.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:17 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1185.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:05 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
18:04 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
17:28 <bd808@deploy1003> helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply [production]
17:27 <bd808@deploy1003> helmfile [eqiad] START helmfile.d/services/developer-portal: apply [production]
17:25 <bd808@deploy1003> helmfile [codfw] DONE helmfile.d/services/developer-portal: apply [production]
17:24 <bd808@deploy1003> helmfile [codfw] START helmfile.d/services/developer-portal: apply [production]
17:24 <bd808@deploy1003> helmfile [staging] DONE helmfile.d/services/developer-portal: apply [production]
17:24 <bd808@deploy1003> helmfile [staging] START helmfile.d/services/developer-portal: apply [production]
17:21 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) config_reloading P{lvs1013.eqiad.wmnet} and A:liberica [production]
17:21 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin config_reloading P{lvs1013.eqiad.wmnet} and A:liberica [production]
17:15 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs1013.eqiad.wmnet [production]
17:15 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs1013.eqiad.wmnet [production]
16:54 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for acmechief2002.codfw.wmnet,acmechief1002.eqiad.wmnet,acmechief-test2001.codfw.wmnet,acmechief-test1001.eqiad.wmnet [production]
16:54 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for acmechief2002.codfw.wmnet,acmechief1002.eqiad.wmnet,acmechief-test2001.codfw.wmnet,acmechief-test1001.eqiad.wmnet [production]
16:51 <mfossati@deploy1003> Finished deploy [airflow-dags/platform_eng@930d28b]: adapt check_bad_parsing to dumps 2.0 (duration: 01m 16s) [production]
16:50 <mfossati@deploy1003> Started deploy [airflow-dags/platform_eng@930d28b]: adapt check_bad_parsing to dumps 2.0 [production]
16:50 <brett@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on acmechief2002.codfw.wmnet,acmechief1002.eqiad.wmnet,acmechief-test2001.codfw.wmnet,acmechief-test1001.eqiad.wmnet with reason: Reboots [production]
16:27 <jdlrobson@deploy1003> Finished scap sync-world: Backport for [[gerrit:1153750|Revert "Deploy survey to en at twenty percent"]] (duration: 11m 23s) [production]
16:20 <jdlrobson@deploy1003> jdlrobson, jdrewniak: Continuing with sync [production]
16:18 <jdlrobson@deploy1003> jdlrobson, jdrewniak: Backport for [[gerrit:1153750|Revert "Deploy survey to en at twenty percent"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
16:17 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2237 (T395241)', diff saved to https://phabricator.wikimedia.org/P77191 and previous config saved to /var/cache/conftool/dbconfig/20250605-161701-fceratto.json [production]
16:16 <jdlrobson@deploy1003> Started scap sync-world: Backport for [[gerrit:1153750|Revert "Deploy survey to en at twenty percent"]] [production]
16:12 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2244.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:03 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host db2244.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:02 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host db2244 [production]
16:02 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host db2244 [production]
16:01 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2237', diff saved to https://phabricator.wikimedia.org/P77190 and previous config saved to /var/cache/conftool/dbconfig/20250605-160154-fceratto.json [production]
16:01 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]