2025-07-16
ยง
|
08:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2241 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P79196 and previous config saved to /var/cache/conftool/dbconfig/20250716-083509-root.json |
[production] |
08:30 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
08:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2156 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P79193 and previous config saved to /var/cache/conftool/dbconfig/20250716-082530-marostegui.json |
[production] |
08:25 |
<root@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |
08:23 |
<jynus@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on backup1007.eqiad.wmnet with reason: Stop minio |
[production] |
08:21 |
<jelto@cumin1003> |
START - Cookbook sre.hosts.reimage for host gitlab1004.wikimedia.org with OS bookworm |
[production] |
08:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2241 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P79192 and previous config saved to /var/cache/conftool/dbconfig/20250716-082004-root.json |
[production] |
08:17 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
08:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1185 (T399249)', diff saved to https://phabricator.wikimedia.org/P79190 and previous config saved to /var/cache/conftool/dbconfig/20250716-081615-marostegui.json |
[production] |
08:16 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db2214.codfw.wmnet with reason: maintenance |
[production] |
08:16 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
08:15 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T399249)', diff saved to https://phabricator.wikimedia.org/P79189 and previous config saved to /var/cache/conftool/dbconfig/20250716-081553-marostegui.json |
[production] |
08:15 |
<jelto@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host gitlab1004.wikimedia.org with OS bookworm |
[production] |
08:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2214 T399533', diff saved to https://phabricator.wikimedia.org/P79187 and previous config saved to /var/cache/conftool/dbconfig/20250716-081350-marostegui.json |
[production] |
08:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db2229 to s6 primary T399533', diff saved to https://phabricator.wikimedia.org/P79186 and previous config saved to /var/cache/conftool/dbconfig/20250716-081302-marostegui.json |
[production] |
08:12 |
<marostegui> |
Starting s6 codfw failover from db2214 to db2229 - T399533 |
[production] |
08:11 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
08:06 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Primary switchover s6 T399533 |
[production] |
08:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db2229 with weight 0 T399533', diff saved to https://phabricator.wikimedia.org/P79185 and previous config saved to /var/cache/conftool/dbconfig/20250716-080639-root.json |
[production] |
08:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2241 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P79184 and previous config saved to /var/cache/conftool/dbconfig/20250716-080458-root.json |
[production] |
08:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P79183 and previous config saved to /var/cache/conftool/dbconfig/20250716-080046-marostegui.json |
[production] |
07:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2241 T399456', diff saved to https://phabricator.wikimedia.org/P79182 and previous config saved to /var/cache/conftool/dbconfig/20250716-075534-marostegui.json |
[production] |
07:55 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
07:54 |
<marostegui> |
Starting x3 codfw failover from db2241 to db2162 - T399456 |
[production] |
07:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db2162 to x3 primary T399456', diff saved to https://phabricator.wikimedia.org/P79181 and previous config saved to /var/cache/conftool/dbconfig/20250716-075448-marostegui.json |
[production] |
07:54 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
07:50 |
<jelto@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gitlab1004.wikimedia.org with reason: host reimage |
[production] |
07:49 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: Primary switchover x3 T399456 |
[production] |
07:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db2162 with weight 0 T399456', diff saved to https://phabricator.wikimedia.org/P79180 and previous config saved to /var/cache/conftool/dbconfig/20250716-074931-root.json |
[production] |
07:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1257 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P79179 and previous config saved to /var/cache/conftool/dbconfig/20250716-074855-root.json |
[production] |
07:46 |
<jelto@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on gitlab1004.wikimedia.org with reason: host reimage |
[production] |
07:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P79178 and previous config saved to /var/cache/conftool/dbconfig/20250716-074538-marostegui.json |
[production] |
07:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1257 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P79177 and previous config saved to /var/cache/conftool/dbconfig/20250716-073349-root.json |
[production] |
07:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T399249)', diff saved to https://phabricator.wikimedia.org/P79176 and previous config saved to /var/cache/conftool/dbconfig/20250716-073031-marostegui.json |
[production] |
07:29 |
<jelto@cumin1003> |
START - Cookbook sre.hosts.reimage for host gitlab1004.wikimedia.org with OS bookworm |
[production] |
07:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P79175 and previous config saved to /var/cache/conftool/dbconfig/20250716-072205-root.json |
[production] |
07:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1257 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P79174 and previous config saved to /var/cache/conftool/dbconfig/20250716-071844-root.json |
[production] |
07:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P79173 and previous config saved to /var/cache/conftool/dbconfig/20250716-070659-root.json |
[production] |
07:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1257 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P79172 and previous config saved to /var/cache/conftool/dbconfig/20250716-070338-root.json |
[production] |
07:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1161 (T399249)', diff saved to https://phabricator.wikimedia.org/P79171 and previous config saved to /var/cache/conftool/dbconfig/20250716-070130-marostegui.json |
[production] |
07:01 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
07:01 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1161.eqiad.wmnet with reason: Maintenance |
[production] |
07:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T399249)', diff saved to https://phabricator.wikimedia.org/P79170 and previous config saved to /var/cache/conftool/dbconfig/20250716-070101-marostegui.json |
[production] |
06:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1257 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P79169 and previous config saved to /var/cache/conftool/dbconfig/20250716-065626-marostegui.json |
[production] |
06:56 |
<root@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1257.eqiad.wmnet with reason: Maintenance |
[production] |
06:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2149 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P79168 and previous config saved to /var/cache/conftool/dbconfig/20250716-065152-root.json |
[production] |
06:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1256 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P79167 and previous config saved to /var/cache/conftool/dbconfig/20250716-064705-root.json |
[production] |
06:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P79166 and previous config saved to /var/cache/conftool/dbconfig/20250716-064553-marostegui.json |
[production] |
06:44 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: Upgrade x3 codfw master |
[production] |
06:39 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Upgrade s6 codfw master |
[production] |