2024-06-27
ยง
|
15:09 |
<hashar@deploy1002> |
Started deploy [gerrit/gerrit@8c94fee]: Revert "Add image-diff JavaScript plugin" |
[production] |
15:09 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts mw1373.eqiad.wmnet |
[production] |
15:09 |
<cgoubert@cumin1002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts mw1373.eqiad.wmnet |
[production] |
15:08 |
<cgoubert@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
15:08 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.rename from mw1404 to wikikube-worker1026 |
[production] |
15:04 |
<hashar@deploy1002> |
Finished deploy [gerrit/gerrit@9652bc3]: Add image-diff JavaScript plugin - T341291 (duration: 00m 07s) |
[production] |
15:04 |
<hashar@deploy1002> |
Started deploy [gerrit/gerrit@9652bc3]: Add image-diff JavaScript plugin - T341291 |
[production] |
15:03 |
<dcaro@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1007.eqiad.wmnet with OS bullseye |
[production] |
15:02 |
<jgiannelos@deploy1002> |
Finished deploy [kartotherian/deploy@483e8c3] (codfw): Bump kartotherian src to latest master (duration: 02m 49s) |
[production] |
15:00 |
<topranks> |
rebooting lsw1-e7-eqiad to upgrade JunOS on switch T365988 |
[production] |
15:00 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts mw1366.eqiad.wmnet |
[production] |
15:00 |
<cgoubert@cumin1002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts mw1366.eqiad.wmnet |
[production] |
15:00 |
<jgiannelos@deploy1002> |
Started deploy [kartotherian/deploy@483e8c3] (codfw): Bump kartotherian src to latest master |
[production] |
14:59 |
<jgiannelos@deploy1002> |
Finished deploy [kartotherian/deploy@483e8c3] (eqiad): Bump kartotherian src to latest master (duration: 03m 10s) |
[production] |
14:58 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:40:00 on an-worker[1163-1165].eqiad.wmnet,es1037.eqiad.wmnet,ms-be1078.eqiad.wmnet with reason: JunOS upgrade lsw1-e7-eqiad |
[production] |
14:58 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:40:00 on an-worker[1163-1165].eqiad.wmnet,es1037.eqiad.wmnet,ms-be1078.eqiad.wmnet with reason: JunOS upgrade lsw1-e7-eqiad |
[production] |
14:57 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from mw1365 to wikikube-worker1023 |
[production] |
14:57 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1023 |
[production] |
14:57 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:40:00 on lsw1-e7-eqiad,lsw1-e7-eqiad IPv6,ssw1-e1-eqiad.mgmt,ssw1-f1-eqiad.mgmt with reason: JunOS upgrade lsw1-e7-eqiad |
[production] |
14:57 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS bullseye |
[production] |
14:57 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:40:00 on lsw1-e7-eqiad,lsw1-e7-eqiad IPv6,ssw1-e1-eqiad.mgmt,ssw1-f1-eqiad.mgmt with reason: JunOS upgrade lsw1-e7-eqiad |
[production] |
14:56 |
<jgiannelos@deploy1002> |
Started deploy [kartotherian/deploy@483e8c3] (eqiad): Bump kartotherian src to latest master |
[production] |
14:56 |
<cgoubert@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1023 |
[production] |
14:56 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
14:55 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw1365 to wikikube-worker1023 - cgoubert@cumin1002" |
[production] |
14:54 |
<cgoubert@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw1365 to wikikube-worker1023 - cgoubert@cumin1002" |
[production] |
14:53 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host deploy1003.eqiad.wmnet with OS bullseye |
[production] |
14:52 |
<brennen@deploy1002> |
Finished deploy [phabricator/deployment@0df351e]: deploy phab1004 for minor update (duration: 00m 32s) |
[production] |
14:52 |
<cgoubert@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
14:52 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.rename from mw1365 to wikikube-worker1023 |
[production] |
14:52 |
<brennen@deploy1002> |
Started deploy [phabricator/deployment@0df351e]: deploy phab1004 for minor update |
[production] |
14:51 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from mw1359 to wikikube-worker1022 |
[production] |
14:51 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1022 |
[production] |
14:51 |
<brennen@deploy1002> |
Finished deploy [phabricator/deployment@0df351e]: test deploy phab2002 (duration: 00m 34s) |
[production] |
14:50 |
<brennen@deploy1002> |
Started deploy [phabricator/deployment@0df351e]: test deploy phab2002 |
[production] |
14:50 |
<cgoubert@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1022 |
[production] |
14:50 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
14:50 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw1359 to wikikube-worker1022 - cgoubert@cumin1002" |
[production] |
14:48 |
<cgoubert@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw1359 to wikikube-worker1022 - cgoubert@cumin1002" |
[production] |
14:46 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:50:00 on lsw1-e7-eqiad.mgmt with reason: prep JunOS upgrade lsw1-e7-eqiad |
[production] |
14:46 |
<dcaro@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1007.eqiad.wmnet with reason: host reimage |
[production] |
14:46 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:50:00 on lsw1-e7-eqiad.mgmt with reason: prep JunOS upgrade lsw1-e7-eqiad |
[production] |
14:46 |
<cgoubert@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
14:46 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.rename from mw1359 to wikikube-worker1022 |
[production] |
14:43 |
<dcaro@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1007.eqiad.wmnet with reason: host reimage |
[production] |
14:38 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1037.eqiad.wmnet with reason: T365988 |
[production] |
14:38 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on es1037.eqiad.wmnet with reason: T365988 |
[production] |
14:37 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'T365988 - depool es1037', diff saved to https://phabricator.wikimedia.org/P65531 and previous config saved to /var/cache/conftool/dbconfig/20240627-143741-arnaudb.json |
[production] |
14:15 |
<fnegri@cumin1002> |
conftool action : set/pooled=no; selector: name=clouddb1015.eqiad.wmnet,service=s4 |
[production] |
14:12 |
<dcaro@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1007.eqiad.wmnet with OS bullseye |
[production] |