2023-02-24
§
|
23:15 |
<mutante> |
people2002 - for each user who has a public_html dir that is not empty (for pubdir in $(find . -name public_html -type d -not -empty); ..); rsync it from people1003 with --delete (rsync -avp rsync://people1003.eqiad.wmnet/people-home/${pubdiruser}/public_html/ /home/${pubdiruser}/public_html/); T330091 |
[production] |
22:49 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2013.codfw.wmnet with OS bullseye |
[production] |
22:49 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" |
[production] |
22:40 |
<pt1979@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" |
[production] |
22:21 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-fe2013.codfw.wmnet with reason: host reimage |
[production] |
22:18 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-fe2013.codfw.wmnet with reason: host reimage |
[production] |
21:54 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-fe2013.codfw.wmnet with OS bullseye |
[production] |
21:26 |
<mutante> |
people2002 - performing the usual dance when device names changed after editing virtual hardware (s/ens13/ens14 in /etc/network/interfaces ... reboot) |
[production] |
21:19 |
<mutante> |
rebooting people2002 |
[production] |
21:17 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dns3002.wikimedia.org with OS bullseye |
[production] |
21:06 |
<mutante> |
ganeti2021 - adding a virtual 20G disk to people2002 - to temp get some space for backups and syncing T330091 |
[production] |
20:59 |
<fab@deploy1002> |
Finished deploy [airflow-dags/research@5edcd7b]: (no justification provided) (duration: 00m 10s) |
[production] |
20:58 |
<fab@deploy1002> |
Started deploy [airflow-dags/research@5edcd7b]: (no justification provided) |
[production] |
20:55 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dns3002.wikimedia.org with reason: host reimage |
[production] |
20:52 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dns3002.wikimedia.org with reason: host reimage |
[production] |
20:46 |
<fab@deploy1002> |
Finished deploy [airflow-dags/research@5edcd7b]: (no justification provided) (duration: 00m 19s) |
[production] |
20:45 |
<fab@deploy1002> |
Started deploy [airflow-dags/research@5edcd7b]: (no justification provided) |
[production] |
20:33 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-fe2014'] |
[production] |
20:33 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['thanos-fe2004'] |
[production] |
20:33 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-fe2013'] |
[production] |
20:32 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host dns3002.wikimedia.org with OS bullseye |
[production] |
20:11 |
<brett@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dns3002.wikimedia.org with OS bullseye |
[production] |
20:06 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
19:36 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
19:36 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['thanos-fe2004'] |
[production] |
19:35 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-fe2014'] |
[production] |
19:33 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-fe2013'] |
[production] |
19:32 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['thanos-fe2004'] |
[production] |
19:29 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-fe2014'] |
[production] |
19:28 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-fe2013'] |
[production] |
19:21 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['thanos-fe2004'] |
[production] |
19:19 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host thanos-fe2004.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
19:18 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-fe2014'] |
[production] |
19:18 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host dns3002.wikimedia.org with OS bullseye |
[production] |
19:15 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-fe2013'] |
[production] |
19:14 |
<pt1979@cumin2002> |
END (ERROR) - Cookbook sre.hardware.upgrade-firmware (exit_code=97) upgrade firmware for hosts ['ms-fe2013'] |
[production] |
19:14 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-fe2013'] |
[production] |
19:11 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-fe2014.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |