|
2026-06-02
ยง
|
| 11:19 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P93527 and previous config saved to /var/cache/conftool/dbconfig/20260602-111954-fceratto.json |
[production] |
| 11:15 |
<cwilliams@cumin1003> |
dbctl commit (dc=all): 'Depool db2161 T427892', diff saved to https://phabricator.wikimedia.org/P93525 and previous config saved to /var/cache/conftool/dbconfig/20260602-111511-cwilliams.json |
[production] |
| 11:12 |
<cwilliams@cumin1003> |
dbctl commit (dc=all): 'Promote db2165 to s8 primary T427892', diff saved to https://phabricator.wikimedia.org/P93524 and previous config saved to /var/cache/conftool/dbconfig/20260602-111200-cwilliams.json |
[production] |
| 11:10 |
<cezmunsta> |
Starting s8 codfw failover from db2161 to db2165 - T427892 |
[production] |
| 11:09 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P93523 and previous config saved to /var/cache/conftool/dbconfig/20260602-110947-fceratto.json |
[production] |
| 11:09 |
<blake@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1057.eqiad.wmnet with OS trixie |
[production] |
| 11:09 |
<blake@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1056.eqiad.wmnet with OS trixie |
[production] |
| 11:04 |
<cwilliams@cumin1003> |
dbctl commit (dc=all): 'Set db2165 with weight 0 T427892', diff saved to https://phabricator.wikimedia.org/P93522 and previous config saved to /var/cache/conftool/dbconfig/20260602-110420-cwilliams.json |
[production] |
| 11:03 |
<cwilliams@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 26 hosts with reason: Primary switchover s8 T427892 |
[production] |
| 11:01 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool pool es2056: repool after upgrade |
[production] |
| 11:01 |
<marostegui@cumin1003> |
END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) |
[production] |
| 10:59 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T426633)', diff saved to https://phabricator.wikimedia.org/P93520 and previous config saved to /var/cache/conftool/dbconfig/20260602-105939-fceratto.json |
[production] |
| 10:52 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db1161 (T426633)', diff saved to https://phabricator.wikimedia.org/P93519 and previous config saved to /var/cache/conftool/dbconfig/20260602-105239-fceratto.json |
[production] |
| 10:52 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
| 10:52 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1161.eqiad.wmnet with reason: Maintenance |
[production] |
| 10:52 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T426633)', diff saved to https://phabricator.wikimedia.org/P93518 and previous config saved to /var/cache/conftool/dbconfig/20260602-105202-fceratto.json |
[production] |
| 10:45 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2056.codfw.wmnet with OS trixie |
[production] |
| 10:42 |
<moritzm> |
installing busybox security updates |
[production] |
| 10:42 |
<claime> |
Enabling puppet on A:cp-text for ATS rest-gateway cleanup - T422937 |
[production] |
| 10:41 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P93517 and previous config saved to /var/cache/conftool/dbconfig/20260602-104154-fceratto.json |
[production] |
| 10:31 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P93516 and previous config saved to /var/cache/conftool/dbconfig/20260602-103146-fceratto.json |
[production] |
| 10:28 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2056.codfw.wmnet with reason: host reimage |
[production] |
| 10:27 |
<claime> |
Disabling puppet on A:cp-text for ATS rest-gateway cleanup - T422937 |
[production] |
| 10:25 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on es2056.codfw.wmnet with reason: host reimage |
[production] |
| 10:21 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T426633)', diff saved to https://phabricator.wikimedia.org/P93515 and previous config saved to /var/cache/conftool/dbconfig/20260602-102139-fceratto.json |
[production] |
| 10:09 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host es2056.codfw.wmnet with OS trixie |
[production] |
| 10:08 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool es2056: Upgrading es2056.codfw.wmnet |
[production] |
| 10:08 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool es2056: Upgrading es2056.codfw.wmnet |
[production] |
| 10:08 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.major-upgrade |
[production] |
| 10:06 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/eventstreams-internal: apply |
[production] |
| 10:06 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/eventstreams-internal: apply |
[production] |
| 09:56 |
<claime> |
Enabling puppet on A:cp-text for ATS rest-gateway cleanup - T422937 |
[production] |
| 09:46 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on cumin2003.codfw.wmnet with reason: in setup |
[production] |
| 09:45 |
<fceratto@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1187: Pooling |
[production] |
| 09:37 |
<claime> |
Running puppet on cp6010 and cp6011 - T422937 |
[production] |
| 09:37 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of netflow2004.codfw.wmnet to plain |
[production] |
| 09:37 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db1159 (T426633)', diff saved to https://phabricator.wikimedia.org/P93511 and previous config saved to /var/cache/conftool/dbconfig/20260602-093716-fceratto.json |
[production] |
| 09:37 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1159.eqiad.wmnet with reason: Maintenance |
[production] |
| 09:35 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of netflow2004.codfw.wmnet to plain |
[production] |
| 09:34 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of rpki2003.codfw.wmnet to plain |
[production] |
| 09:34 |
<claime> |
Disabling puppet on A:cp-text for ATS rest-gateway cleanup - T422937 |
[production] |
| 09:34 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of rpki2003.codfw.wmnet to plain |
[production] |
| 09:32 |
<moritzm> |
temporarily remove ganeti2045 from the codfw cluster T427357 |
[production] |
| 09:30 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1055.eqiad.wmnet with OS trixie |
[production] |
| 09:15 |
<fceratto@cumin1003> |
START - Cookbook sre.mysql.pool pool db1187: Pooling |
[production] |
| 09:14 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1055.eqiad.wmnet with reason: host reimage |
[production] |
| 09:11 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1187 (T426633)', diff saved to https://phabricator.wikimedia.org/P93508 and previous config saved to /var/cache/conftool/dbconfig/20260602-091126-fceratto.json |
[production] |
| 09:09 |
<blake@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1055.eqiad.wmnet with reason: host reimage |
[production] |
| 09:04 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db1187 (T426633)', diff saved to https://phabricator.wikimedia.org/P93506 and previous config saved to /var/cache/conftool/dbconfig/20260602-090432-fceratto.json |
[production] |
| 09:04 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance |
[production] |