2101-2150 of 10000 results (117ms)
2024-08-19 ยง
16:28 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage [production]
16:26 <ladsgroup@deploy1003> Finished scap sync-world: Backport for [[gerrit:1063837|Reduce rate-limit for trusted editors of commons to 1500 every 3m (T370304)]] (duration: 06m 33s) [production]
16:25 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage [production]
16:23 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2025.codfw.wmnet with OS bullseye [production]
16:23 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2024.codfw.wmnet with OS bullseye [production]
16:23 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2023.codfw.wmnet with OS bullseye [production]
16:23 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs2022.codfw.wmnet with OS bullseye [production]
16:21 <ladsgroup@deploy1003> ladsgroup: Continuing with sync [production]
16:21 <ladsgroup@deploy1003> ladsgroup: Backport for [[gerrit:1063837|Reduce rate-limit for trusted editors of commons to 1500 every 3m (T370304)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
16:20 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-sd2001.codfw.wmnet with OS bookworm [production]
16:20 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:20 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-sd2003.codfw.wmnet with OS bookworm [production]
16:20 <jhancock@cumin2002> END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:19 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1063837|Reduce rate-limit for trusted editors of commons to 1500 every 3m (T370304)]] [production]
16:13 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:08 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:07 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1036.eqiad.wmnet with OS bullseye [production]
15:53 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-sd2001.codfw.wmnet with reason: host reimage [production]
15:50 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-sd2003.codfw.wmnet with reason: host reimage [production]
15:49 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
15:49 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
15:46 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logging-sd2001.codfw.wmnet with reason: host reimage [production]
15:46 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logging-sd2003.codfw.wmnet with reason: host reimage [production]
15:39 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp2035.codfw.wmnet [reason: T372160] [production]
15:36 <sukhe@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade/restart of Apache Traffic Server on P{cp6016*} and A:cp for 9.2.5-1wm2 [production]
15:32 <sukhe@cumin1002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on P{cp6016*} and A:cp for 9.2.5-1wm2 [production]
15:30 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" [production]
15:30 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host logging-sd2003.codfw.wmnet with OS bookworm [production]
15:30 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host logging-sd2002.codfw.wmnet with OS bookworm [production]
15:30 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host logging-sd2001.codfw.wmnet with OS bookworm [production]
15:29 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['logging-sd2003'] [production]
15:29 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logging-sd2003'] [production]
15:27 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
15:26 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['logging-sd2002'] [production]
15:25 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
15:25 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logging-sd2002'] [production]
15:25 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host logging-sd2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
15:24 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" [production]
15:19 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" [production]
15:16 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
15:13 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" [production]
15:08 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" [production]
15:05 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
15:04 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1301.eqiad.wmnet with reason: host reimage [production]
15:02 <jhancock@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['logging-sd2001'] [production]
15:02 <jhancock@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['logging-sd2003'] [production]
15:00 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1302.eqiad.wmnet with reason: host reimage [production]
14:57 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1304.eqiad.wmnet with reason: host reimage [production]
14:55 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logging-sd2003'] [production]
14:55 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logging-sd2001'] [production]