|
2025-12-02
ยง
|
| 16:23 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:20 |
<jhathaway@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:19 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:18 |
<swfrench-wmf> |
restarted navtiming on webperf1003 - T352245 |
[production] |
| 16:14 |
<swfrench-wmf> |
begin rolling restarts of eqiad-associated confds - T352245 |
[production] |
| 16:12 |
<moritzm> |
installing nodejs security updates |
[production] |
| 16:12 |
<swfrench@deploy2002> |
Unlocked for deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 (duration: 03m 45s) |
[production] |
| 16:12 |
<jhathaway@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:10 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest1005.eqiad.wmnet with OS bookworm |
[production] |
| 16:08 |
<swfrench@deploy2002> |
Locking from deployment [MediaWiki]: Hold deployments during etcd certificate change - T352245 |
[production] |
| 16:08 |
<swfrench-wmf> |
migrating etcd to PKI certs on conf1008 - T352245 |
[production] |
| 16:08 |
<jhathaway@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 16:02 |
<moritzm> |
installing libsndfile security updates |
[production] |
| 16:01 |
<jhathaway@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest1005.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 16:00 |
<gehel> |
restarting wdqs@codfw - system overloaded |
[production] |
| 15:58 |
<jhathaway@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on sretest1005.eqiad.wmnet with reason: ipxe |
[production] |
| 15:50 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool db1251 gradually with 4 steps - Pool db1251.eqiad.wmnet in after cloning |
[production] |
| 15:48 |
<moritzm> |
upgrade Envoy on Yarn T405808 |
[production] |
| 15:45 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1088.eqiad.wmnet with OS bullseye |
[production] |
| 15:29 |
<mvernon@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1088.eqiad.wmnet with reason: host reimage |
[production] |
| 15:25 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1088.eqiad.wmnet with reason: host reimage |
[production] |
| 15:13 |
<moritzm> |
upgrade Envoy on Turnilo T405808 |
[production] |
| 15:12 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.reimage for host ms-be1088.eqiad.wmnet with OS bullseye |
[production] |
| 14:51 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
| 14:47 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1213988|[Growth] Enable Add Link for 3 wikis (T407818)]] (duration: 07m 46s) |
[production] |
| 14:43 |
<urbanecm@deploy2002> |
urbanecm: Continuing with sync |
[production] |
| 14:41 |
<urbanecm@deploy2002> |
urbanecm: Backport for [[gerrit:1213988|[Growth] Enable Add Link for 3 wikis (T407818)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:41 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1198 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86314 and previous config saved to /var/cache/conftool/dbconfig/20251202-144148-marostegui.json |
[production] |
| 14:41 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1198.eqiad.wmnet with reason: Maintenance |
[production] |
| 14:41 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1189 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86313 and previous config saved to /var/cache/conftool/dbconfig/20251202-144123-marostegui.json |
[production] |
| 14:39 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1213988|[Growth] Enable Add Link for 3 wikis (T407818)]] |
[production] |
| 14:35 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti-test2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 14:30 |
<derick@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1214041|user: Mark users created with User::addToDatabase() as primary (T410652)]] (duration: 08m 34s) |
[production] |
| 14:28 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.provision for host ganeti-test2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 14:26 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P86312 and previous config saved to /var/cache/conftool/dbconfig/20251202-142616-marostegui.json |
[production] |
| 14:26 |
<derick@deploy2002> |
d3r1ck01, derick: Continuing with sync |
[production] |
| 14:25 |
<derick@deploy2002> |
d3r1ck01, derick: Backport for [[gerrit:1214041|user: Mark users created with User::addToDatabase() as primary (T410652)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:21 |
<derick@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1214041|user: Mark users created with User::addToDatabase() as primary (T410652)]] |
[production] |
| 14:21 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti-test2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 14:18 |
<lucaswerkmeister-wmde@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1208357|Growth: Enable Revise Tone feature on pilot wikis (T409606)]] (duration: 13m 03s) |
[production] |
| 14:14 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.provision for host ganeti-test2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 14:13 |
<lucaswerkmeister-wmde@deploy2002> |
lucaswerkmeister-wmde, migr: Continuing with sync |
[production] |
| 14:12 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti-test2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 14:11 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.provision for host ganeti-test2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 14:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P86311 and previous config saved to /var/cache/conftool/dbconfig/20251202-141108-marostegui.json |
[production] |
| 14:11 |
<ayounsi@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on ganeti-test2001.codfw.wmnet with reason: test CR1207804 |
[production] |
| 14:10 |
<jgleeson> |
payments-wiki upgraded from b405d6db to 6d39e545 |
[production] |
| 14:07 |
<lucaswerkmeister-wmde@deploy2002> |
lucaswerkmeister-wmde, migr: Backport for [[gerrit:1208357|Growth: Enable Revise Tone feature on pilot wikis (T409606)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:05 |
<lucaswerkmeister-wmde@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1208357|Growth: Enable Revise Tone feature on pilot wikis (T409606)]] |
[production] |
| 13:58 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1251 - Depool db1251.eqiad.wmnet to then clone it to db1169.eqiad.wmnet - marostegui@cumin1003 |
[production] |