2101-2150 of 10000 results (104ms)
2024-06-28 ยง
13:53 <akosiaris@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on deploy1003.eqiad.wmnet with reason: host reimage [production]
13:49 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on krb1002.eqiad.wmnet with reason: host reimage [production]
13:49 <hnowlan@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2298 to wikikube-worker2025 - hnowlan@cumin1002" [production]
13:46 <hnowlan@cumin1002> END (FAIL) - Cookbook sre.hosts.rename (exit_code=93) from mw2300 to wikikube-worker2026 [production]
13:46 <hnowlan@cumin1002> START - Cookbook sre.hosts.rename from mw2300 to wikikube-worker2026 [production]
13:46 <hnowlan@cumin1002> START - Cookbook sre.dns.netbox [production]
13:46 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1027.eqiad.wmnet with reason: host reimage [production]
13:45 <hnowlan@cumin1002> START - Cookbook sre.hosts.rename from mw2298 to wikikube-worker2025 [production]
13:45 <jclark@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on krb1002.eqiad.wmnet with reason: host reimage [production]
13:43 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1027.eqiad.wmnet with reason: host reimage [production]
13:42 <dani@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
13:42 <dani@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
13:42 <dani@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
13:42 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host deploy1003.eqiad.wmnet with OS bullseye [production]
13:42 <dani@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
13:42 <dani@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
13:41 <dani@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
13:41 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host deploy1003.eqiad.wmnet with OS bookworm [production]
13:41 <akosiaris@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - akosiaris@cumin1002" [production]
13:38 <btullis@deploy1002> helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: apply [production]
13:38 <btullis@deploy1002> helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: apply [production]
13:38 <btullis@deploy1002> helmfile [codfw] DONE helmfile.d/services/eventgate-analytics-external: apply [production]
13:37 <btullis@deploy1002> helmfile [codfw] START helmfile.d/services/eventgate-analytics-external: apply [production]
13:32 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host krb1002.eqiad.wmnet with OS bookworm [production]
13:29 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1027.eqiad.wmnet with OS bullseye [production]
13:29 <akosiaris@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - akosiaris@cumin1002" [production]
13:28 <hnowlan> running `decommission` on 5 codfw api appservers [production]
13:27 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1028.mgmt.eqiad.wmnet on all recursors [production]
13:27 <cmooney@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker1028.mgmt.eqiad.wmnet on all recursors [production]
13:27 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1027.mgmt.eqiad.wmnet on all recursors [production]
13:27 <cmooney@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker1027.mgmt.eqiad.wmnet on all recursors [production]
13:26 <cgoubert@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1027.eqiad.wmnet with OS bullseye [production]
13:25 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:25 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix entries for wikikube-worker102[7-8] - cmooney@cumin1002" [production]
13:24 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix entries for wikikube-worker102[7-8] - cmooney@cumin1002" [production]
13:21 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
13:18 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1028.mgmt.eqiad.wmnet on all recursors [production]
13:18 <cmooney@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker1028.mgmt.eqiad.wmnet on all recursors [production]
13:18 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1027.mgmt.eqiad.wmnet on all recursors [production]
13:18 <cmooney@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker1027.mgmt.eqiad.wmnet on all recursors [production]
13:15 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on deploy1003.eqiad.wmnet with reason: host reimage [production]
13:12 <akosiaris@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on deploy1003.eqiad.wmnet with reason: host reimage [production]
13:11 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1029.eqiad.wmnet with OS bullseye [production]
13:06 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1028.eqiad.wmnet with reason: mgmt ip issue [production]
13:05 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1028.eqiad.wmnet with reason: mgmt ip issue [production]
13:01 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host deploy1003.eqiad.wmnet with OS bookworm [production]
12:59 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2204 (T367856)', diff saved to https://phabricator.wikimedia.org/P65547 and previous config saved to /var/cache/conftool/dbconfig/20240628-125926-marostegui.json [production]
12:55 <hashar@deploy1002> Finished deploy [gerrit/gerrit@0db053e]: Upgrade Gerrit 3.10.0-32-gf77960412e to 3.10.0-71-gf6e9431fff - T367029 T341291 (duration: 00m 09s) [production]
12:55 <hashar@deploy1002> Started deploy [gerrit/gerrit@0db053e]: Upgrade Gerrit 3.10.0-32-gf77960412e to 3.10.0-71-gf6e9431fff - T367029 T341291 [production]
12:53 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1029.eqiad.wmnet with reason: host reimage [production]