|
2026-02-02
§
|
| 06:23 |
<marostegui> |
Starting s2 eqiad failover from db1222 to db1162 - T415983 |
[production] |
| 06:22 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db1162 with weight 0 T415983', diff saved to https://phabricator.wikimedia.org/P88358 and previous config saved to /var/cache/conftool/dbconfig/20260202-062212-marostegui.json |
[production] |
| 06:21 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 26 hosts with reason: Primary switchover s2 T415983 |
[production] |
| 06:14 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2161.codfw.wmnet with reason: long schema change |
[production] |
| 06:13 |
<marostegui@dns1006> |
END - running authdns-update |
[production] |
| 06:13 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db2161 T415748', diff saved to https://phabricator.wikimedia.org/P88357 and previous config saved to /var/cache/conftool/dbconfig/20260202-061310-marostegui.json |
[production] |
| 06:12 |
<marostegui@dns1006> |
START - running authdns-update |
[production] |
| 06:12 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db2165 to s8 primary and set section read-write T415748', diff saved to https://phabricator.wikimedia.org/P88356 and previous config saved to /var/cache/conftool/dbconfig/20260202-061217-marostegui.json |
[production] |
| 06:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set s8 codfw as read-only for maintenance - T415748', diff saved to https://phabricator.wikimedia.org/P88355 and previous config saved to /var/cache/conftool/dbconfig/20260202-061150-marostegui.json |
[production] |
| 06:11 |
<marostegui> |
Starting s8 codfw failover from db2161 to db2165 - T415748 |
[production] |
| 06:04 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db2165 with weight 0 T415748', diff saved to https://phabricator.wikimedia.org/P88354 and previous config saved to /var/cache/conftool/dbconfig/20260202-060437-marostegui.json |
[production] |
| 06:02 |
<marostegui> |
Deploy schema change on old s8 eqiad master db1193 T411164 T411163 |
[production] |
| 05:59 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db1193.eqiad.wmnet with reason: long schema change |
[production] |
| 05:57 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1193 T416107', diff saved to https://phabricator.wikimedia.org/P88353 and previous config saved to /var/cache/conftool/dbconfig/20260202-055755-marostegui.json |
[production] |
| 05:57 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db1209 to s8 primary T416107', diff saved to https://phabricator.wikimedia.org/P88352 and previous config saved to /var/cache/conftool/dbconfig/20260202-055717-marostegui.json |
[production] |
| 05:56 |
<marostegui> |
Starting s8 eqiad failover from db1193 to db1209 - T416107 |
[production] |
| 05:53 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Primary switchover s8 T416107 |
[production] |
| 05:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db1209 with weight 0 T416107', diff saved to https://phabricator.wikimedia.org/P88351 and previous config saved to /var/cache/conftool/dbconfig/20260202-055304-marostegui.json |
[production] |
| 02:14 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 13m 22s) |
[production] |
| 02:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
|
2026-01-31
§
|
| 21:49 |
<James_F> |
Deleted Jenkins's job entry for castor-save-workspace-cache 6193776 and this seems to have unstuck things for T416078? |
[releng] |
| 21:45 |
<James_F> |
Running `sudo systemctl restart jenkins` on contint for T416078 |
[releng] |
| 21:44 |
<James_F> |
Fighting T416078, took integration-castor-5 offline, disconnected, sshed in to kill threads, then reconnected; no change in aspect. |
[releng] |
| 19:03 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/1235380 |
[releng] |
| 02:23 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 22m 42s) |
[production] |
| 02:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
| 00:31 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
| 00:31 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226 (T415786)', diff saved to https://phabricator.wikimedia.org/P88349 and previous config saved to /var/cache/conftool/dbconfig/20260131-003142-marostegui.json |
[production] |
| 00:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P88348 and previous config saved to /var/cache/conftool/dbconfig/20260131-001634-marostegui.json |
[production] |
| 00:01 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P88347 and previous config saved to /var/cache/conftool/dbconfig/20260131-000125-marostegui.json |
[production] |
|
2026-01-30
§
|
| 23:46 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226 (T415786)', diff saved to https://phabricator.wikimedia.org/P88346 and previous config saved to /var/cache/conftool/dbconfig/20260130-234616-marostegui.json |
[production] |
| 23:36 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1226 (T415786)', diff saved to https://phabricator.wikimedia.org/P88345 and previous config saved to /var/cache/conftool/dbconfig/20260130-233652-marostegui.json |
[production] |
| 23:36 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1226.eqiad.wmnet with reason: Maintenance |
[production] |
| 23:36 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T415786)', diff saved to https://phabricator.wikimedia.org/P88344 and previous config saved to /var/cache/conftool/dbconfig/20260130-233638-marostegui.json |
[production] |
| 23:21 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P88343 and previous config saved to /var/cache/conftool/dbconfig/20260130-232129-marostegui.json |
[production] |
| 23:06 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P88342 and previous config saved to /var/cache/conftool/dbconfig/20260130-230620-marostegui.json |
[production] |
| 22:51 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T415786)', diff saved to https://phabricator.wikimedia.org/P88341 and previous config saved to /var/cache/conftool/dbconfig/20260130-225111-marostegui.json |
[production] |
| 22:41 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1214 (T415786)', diff saved to https://phabricator.wikimedia.org/P88340 and previous config saved to /var/cache/conftool/dbconfig/20260130-224108-marostegui.json |
[production] |
| 22:41 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1214.eqiad.wmnet with reason: Maintenance |
[production] |
| 22:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1209 (T415786)', diff saved to https://phabricator.wikimedia.org/P88339 and previous config saved to /var/cache/conftool/dbconfig/20260130-224043-marostegui.json |
[production] |
| 22:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1209', diff saved to https://phabricator.wikimedia.org/P88338 and previous config saved to /var/cache/conftool/dbconfig/20260130-222534-marostegui.json |
[production] |
| 22:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1209', diff saved to https://phabricator.wikimedia.org/P88337 and previous config saved to /var/cache/conftool/dbconfig/20260130-221025-marostegui.json |
[production] |
| 21:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1209 (T415786)', diff saved to https://phabricator.wikimedia.org/P88336 and previous config saved to /var/cache/conftool/dbconfig/20260130-215517-marostegui.json |
[production] |
| 21:45 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1209 (T415786)', diff saved to https://phabricator.wikimedia.org/P88335 and previous config saved to /var/cache/conftool/dbconfig/20260130-214548-marostegui.json |
[production] |
| 21:45 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1209.eqiad.wmnet with reason: Maintenance |
[production] |
| 21:45 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1203 (T415786)', diff saved to https://phabricator.wikimedia.org/P88334 and previous config saved to /var/cache/conftool/dbconfig/20260130-214534-marostegui.json |
[production] |
| 21:30 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P88333 and previous config saved to /var/cache/conftool/dbconfig/20260130-213025-marostegui.json |
[production] |
| 21:15 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P88332 and previous config saved to /var/cache/conftool/dbconfig/20260130-211516-marostegui.json |
[production] |