2023-10-27
§
|
10:14 |
<fabfur@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp1101.eqiad.wmnet with reason: host reimage |
[production] |
10:14 |
<jiji@deploy2002> |
helmfile [codfw] START helmfile.d/services/tegola-vector-tiles: apply |
[production] |
10:14 |
<jiji@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/tegola-vector-tiles: apply |
[production] |
10:13 |
<jiji@deploy2002> |
helmfile [eqiad] START helmfile.d/services/tegola-vector-tiles: apply |
[production] |
09:59 |
<fabfur@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp1101.eqiad.wmnet with OS bullseye |
[production] |
09:59 |
<fabfur@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1101.eqiad.wmnet with OS bullseye |
[production] |
09:34 |
<fabfur@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp1101.eqiad.wmnet with OS bullseye |
[production] |
09:19 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) |
[production] |
09:19 |
<btullis@cumin1001> |
Added views for new wiki: tlywiki T345169 |
[production] |
09:02 |
<moritzm> |
deployment-prep app servers are now using ICU67/Unicode 13 |
[production] |
08:49 |
<moritzm> |
uploaded libxml2 2.9.4+dfsg1-7+deb10u6+icu67+wmf1 to component/icu67 for buster-wikimedia (rebase of the ICU compat patches on top of the latest buster security update for libxml2) T345561 |
[production] |
08:48 |
<btullis@cumin1001> |
START - Cookbook sre.wikireplicas.add-wiki |
[production] |
08:41 |
<moritzm> |
downgrading dh-python on build2001 to the version which is in Bullseye. Before, 5.20230130~bpo11+1 was installed from bullseye-backports, but that version has dropped the python2 sequence we still need for some Buster builds |
[production] |
08:25 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudmetrics1004.eqiad.wmnet with OS bookworm |
[production] |
08:10 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudmetrics1004.eqiad.wmnet with reason: host reimage |
[production] |
08:07 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudmetrics1004.eqiad.wmnet with reason: host reimage |
[production] |
07:55 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudmetrics1004.eqiad.wmnet with OS bookworm |
[production] |
07:54 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudmetrics1003.eqiad.wmnet with OS bookworm |
[production] |
07:54 |
<ayounsi@cumin1001> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest1004.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
07:48 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudmetrics1004.eqiad.wmnet with reason: cloudmetrics1003 reimage |
[production] |
07:48 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudmetrics1004.eqiad.wmnet with reason: cloudmetrics1003 reimage |
[production] |
07:39 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudmetrics1003.eqiad.wmnet with reason: host reimage |
[production] |
07:36 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudmetrics1003.eqiad.wmnet with reason: host reimage |
[production] |
07:32 |
<ayounsi@cumin1001> |
START - Cookbook sre.hosts.provision for host sretest1004.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
07:24 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudmetrics1003.eqiad.wmnet with OS bookworm |
[production] |
06:12 |
<cmooney@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2003.codfw.wmnet with OS bullseye |
[production] |
01:49 |
<cstone> |
civicrm upgraded from 70e0b88d to 74781efd |
[production] |
2023-10-26
§
|
22:49 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dns2006.wikimedia.org with OS bookworm |
[production] |
22:10 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dns2006.wikimedia.org with reason: host reimage |
[production] |
22:07 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dns2006.wikimedia.org with reason: host reimage |
[production] |
21:47 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host dns2006.wikimedia.org with OS bookworm |
[production] |
21:45 |
<ebernhardson@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
21:45 |
<ebernhardson@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
21:32 |
<cstone> |
payments-wiki upgraded from f7407053 to 04428d6e |
[production] |
21:16 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on cloudvirt-wdqs1001.eqiad.wmnet with reason: still trying to get nova to schedule hosts there |
[production] |
21:16 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on cloudvirt-wdqs1001.eqiad.wmnet with reason: still trying to get nova to schedule hosts there |
[production] |
21:12 |
<taavi@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host cloudvirt-wdqs1001.eqiad.wmnet |
[production] |
21:00 |
<taavi@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host cloudvirt-wdqs1001.eqiad.wmnet |
[production] |
20:45 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt-wdqs1001.eqiad.wmnet with OS bookworm |
[production] |
20:45 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - taavi@cumin1001" |
[production] |
20:44 |
<cstone> |
payments-wiki upgraded from f7407053 to 99b330be |
[production] |
20:44 |
<taavi@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - taavi@cumin1001" |
[production] |
20:42 |
<brennen> |
end of utc late backport & config window |
[production] |
20:42 |
<brennen@deploy2002> |
Finished scap: Backport for [[gerrit:969151|OIDC: Return '' instead of null for email in profile (T283456)]] (duration: 07m 25s) |
[production] |
20:41 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dns2005.wikimedia.org with OS bookworm |
[production] |
20:37 |
<brennen@deploy2002> |
brennen and tgr: Continuing with sync |
[production] |
20:36 |
<brennen@deploy2002> |
brennen and tgr: Backport for [[gerrit:969151|OIDC: Return '' instead of null for email in profile (T283456)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:35 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "cloudvirt-wdqs1001 - taavi@cumin1001" |
[production] |
20:34 |
<brennen@deploy2002> |
Started scap: Backport for [[gerrit:969151|OIDC: Return '' instead of null for email in profile (T283456)]] |
[production] |
20:34 |
<taavi@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "cloudvirt-wdqs1001 - taavi@cumin1001" |
[production] |