2025-06-19
ยง
|
08:31 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti2048.codfw.wmnet to cluster codfw and group B |
[production] |
08:29 |
<mvernon@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: thanos-be[1001-1004].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - mvernon@cumin1003" |
[production] |
08:28 |
<akosiaris@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host aux-k8s-worker2006.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART |
[production] |
08:25 |
<mvernon@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
08:23 |
<akosiaris@cumin1003> |
START - Cookbook sre.hosts.provision for host aux-k8s-worker2006.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART |
[production] |
08:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1179 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P78415 and previous config saved to /var/cache/conftool/dbconfig/20250619-082326-root.json |
[production] |
08:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2168', diff saved to https://phabricator.wikimedia.org/P78414 and previous config saved to /var/cache/conftool/dbconfig/20250619-082320-marostegui.json |
[production] |
08:17 |
<Ammar> |
Ran fixStuckGlobalRename.php for T397384 T397219 T397218 |
[production] |
08:11 |
<hashar@deploy1003> |
rebuilt and synchronized wikiversions files: group2 to 1.45.0-wmf.6 refs T392176 |
[production] |
08:10 |
<mvernon@cumin1003> |
START - Cookbook sre.hosts.decommission for hosts thanos-be[1001-1004].eqiad.wmnet |
[production] |
08:08 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1179 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78413 and previous config saved to /var/cache/conftool/dbconfig/20250619-080820-root.json |
[production] |
08:08 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2168 (T396130)', diff saved to https://phabricator.wikimedia.org/P78412 and previous config saved to /var/cache/conftool/dbconfig/20250619-080812-marostegui.json |
[production] |
08:07 |
<moritzm> |
installing python-tornado security updates |
[production] |
08:02 |
<JJMC89> |
copypatrol-backend-prod-01 deploy 8995949..3d93e79 - security update |
[copypatrol] |
07:56 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1179.eqiad.wmnet with reason: Maintenance |
[production] |
07:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1179', diff saved to https://phabricator.wikimedia.org/P78411 and previous config saved to /var/cache/conftool/dbconfig/20250619-075548-root.json |
[production] |
07:50 |
<moritzm> |
installing glib2.0 security updates |
[production] |
07:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2168 (T396130)', diff saved to https://phabricator.wikimedia.org/P78410 and previous config saved to /var/cache/conftool/dbconfig/20250619-074731-marostegui.json |
[production] |
07:47 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2168.codfw.wmnet with reason: Maintenance |
[production] |
07:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2159 (T396130)', diff saved to https://phabricator.wikimedia.org/P78409 and previous config saved to /var/cache/conftool/dbconfig/20250619-074708-marostegui.json |
[production] |
07:41 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db[1164,1217].eqiad.wmnet with reason: Maintenance |
[production] |
07:41 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.addnode for new host ganeti2048.codfw.wmnet to cluster codfw and group B |
[production] |
07:41 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti2047.codfw.wmnet to cluster codfw and group B |
[production] |
07:39 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.addnode for new host ganeti2047.codfw.wmnet to cluster codfw and group B |
[production] |
07:37 |
<jynus> |
just started es read only backup regeneration T387892 |
[production] |
07:36 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2048.codfw.wmnet |
[production] |
07:33 |
<marostegui> |
Failover m2 from db1164 to db1250 - T397182 |
[production] |
07:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P78407 and previous config saved to /var/cache/conftool/dbconfig/20250619-073201-marostegui.json |
[production] |
07:31 |
<kartik@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1161182|Enable the Contribute menu in Egyptian Arabic, Igbo, and Uzbek]] (duration: 10m 59s) |
[production] |
07:29 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2048.codfw.wmnet |
[production] |
07:26 |
<slyngshede@cumin1002> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) netbox to netbox-dev2003.codfw.wmnet with reason: Release v4.0.11 to netbox-next - slyngshede@cumin1002 - T397300 |
[production] |
07:25 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2047.codfw.wmnet |
[production] |
07:25 |
<slyngshede@cumin1002> |
START - Cookbook sre.deploy.python-code netbox to netbox-dev2003.codfw.wmnet with reason: Release v4.0.11 to netbox-next - slyngshede@cumin1002 - T397300 |
[production] |
07:25 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2233].codfw.wmnet,db[1164,1217,1250].eqiad.wmnet with reason: Primary switchover m2 T397182 |
[production] |
07:24 |
<kartik@deploy1003> |
kartik: Continuing with sync |
[production] |
07:22 |
<kartik@deploy1003> |
kartik: Backport for [[gerrit:1161182|Enable the Contribute menu in Egyptian Arabic, Igbo, and Uzbek]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
07:20 |
<kartik@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1161182|Enable the Contribute menu in Egyptian Arabic, Igbo, and Uzbek]] |
[production] |
07:18 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2047.codfw.wmnet |
[production] |
07:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P78405 and previous config saved to /var/cache/conftool/dbconfig/20250619-071654-marostegui.json |
[production] |
07:15 |
<gkyziridis@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1160797|ores-extension: enable extension with revertrisk filter for azwiki (T395824)]] (duration: 11m 50s) |
[production] |
07:08 |
<gkyziridis@deploy1003> |
gkyziridis: Continuing with sync |
[production] |
07:06 |
<gkyziridis@deploy1003> |
gkyziridis: Backport for [[gerrit:1160797|ores-extension: enable extension with revertrisk filter for azwiki (T395824)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
07:04 |
<moritzm> |
installing edk2 security updates |
[production] |
07:04 |
<gkyziridis@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1160797|ores-extension: enable extension with revertrisk filter for azwiki (T395824)]] |
[production] |
07:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2159 (T396130)', diff saved to https://phabricator.wikimedia.org/P78404 and previous config saved to /var/cache/conftool/dbconfig/20250619-070146-marostegui.json |
[production] |
06:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2159 (T396130)', diff saved to https://phabricator.wikimedia.org/P78403 and previous config saved to /var/cache/conftool/dbconfig/20250619-064108-marostegui.json |
[production] |
06:41 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2159.codfw.wmnet with reason: Maintenance |
[production] |
06:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2150 (T396130)', diff saved to https://phabricator.wikimedia.org/P78402 and previous config saved to /var/cache/conftool/dbconfig/20250619-064045-marostegui.json |
[production] |
06:39 |
<stevemunene@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on an-worker1176.eqiad.wmnet with reason: Upgrade an-worker hard drives from 4TB to 8TB group 9 |
[production] |
06:39 |
<stevemunene@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on an-worker1154.eqiad.wmnet with reason: Upgrade an-worker hard drives from 4TB to 8TB group 9 |
[production] |