|
2026-02-02
§
|
| 08:56 |
<kharlan@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1235458|BlockUtils: Log x-provenance and IP reputation fields (T415354)]] (duration: 10m 05s) |
[production] |
| 08:50 |
<kharlan@deploy2002> |
kharlan: Continuing with sync |
[production] |
| 08:48 |
<kharlan@deploy2002> |
kharlan: Backport for [[gerrit:1235458|BlockUtils: Log x-provenance and IP reputation fields (T415354)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T415786)', diff saved to https://phabricator.wikimedia.org/P88363 and previous config saved to /var/cache/conftool/dbconfig/20260202-084806-marostegui.json |
[production] |
| 08:46 |
<kharlan@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1235458|BlockUtils: Log x-provenance and IP reputation fields (T415354)]] |
[production] |
| 08:45 |
<kharlan@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1233651|Enable watchlist labels everywhere (prod and beta) (T413967)]] (duration: 41m 47s) |
[production] |
| 08:31 |
<kharlan@deploy2002> |
kharlan, samwilson: Continuing with sync |
[production] |
| 08:27 |
<kharlan@deploy2002> |
kharlan, samwilson: Backport for [[gerrit:1233651|Enable watchlist labels everywhere (prod and beta) (T413967)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:12 |
<moritzm> |
installing openssl security updates |
[production] |
| 08:09 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1150.eqiad.wmnet with reason: Maintenance |
[production] |
| 08:04 |
<kharlan@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1233651|Enable watchlist labels everywhere (prod and beta) (T413967)]] |
[production] |
| 08:02 |
<joal> |
Restarting druid middle-managers to recover from OOM - T415799 |
[production] |
| 06:33 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2149 (T415786)', diff saved to https://phabricator.wikimedia.org/P88361 and previous config saved to /var/cache/conftool/dbconfig/20260202-063304-marostegui.json |
[production] |
| 06:32 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2149.codfw.wmnet with reason: Maintenance |
[production] |
| 06:27 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1222.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1222 T415983', diff saved to https://phabricator.wikimedia.org/P88360 and previous config saved to /var/cache/conftool/dbconfig/20260202-062554-marostegui.json |
[production] |
| 06:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db1162 to s2 primary T415983', diff saved to https://phabricator.wikimedia.org/P88359 and previous config saved to /var/cache/conftool/dbconfig/20260202-062522-marostegui.json |
[production] |
| 06:23 |
<marostegui> |
Starting s2 eqiad failover from db1222 to db1162 - T415983 |
[production] |
| 06:22 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db1162 with weight 0 T415983', diff saved to https://phabricator.wikimedia.org/P88358 and previous config saved to /var/cache/conftool/dbconfig/20260202-062212-marostegui.json |
[production] |
| 06:21 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 26 hosts with reason: Primary switchover s2 T415983 |
[production] |
| 06:14 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2161.codfw.wmnet with reason: long schema change |
[production] |
| 06:13 |
<marostegui@dns1006> |
END - running authdns-update |
[production] |
| 06:13 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db2161 T415748', diff saved to https://phabricator.wikimedia.org/P88357 and previous config saved to /var/cache/conftool/dbconfig/20260202-061310-marostegui.json |
[production] |
| 06:12 |
<marostegui@dns1006> |
START - running authdns-update |
[production] |
| 06:12 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db2165 to s8 primary and set section read-write T415748', diff saved to https://phabricator.wikimedia.org/P88356 and previous config saved to /var/cache/conftool/dbconfig/20260202-061217-marostegui.json |
[production] |
| 06:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set s8 codfw as read-only for maintenance - T415748', diff saved to https://phabricator.wikimedia.org/P88355 and previous config saved to /var/cache/conftool/dbconfig/20260202-061150-marostegui.json |
[production] |
| 06:11 |
<marostegui> |
Starting s8 codfw failover from db2161 to db2165 - T415748 |
[production] |
| 06:04 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db2165 with weight 0 T415748', diff saved to https://phabricator.wikimedia.org/P88354 and previous config saved to /var/cache/conftool/dbconfig/20260202-060437-marostegui.json |
[production] |
| 06:02 |
<marostegui> |
Deploy schema change on old s8 eqiad master db1193 T411164 T411163 |
[production] |
| 05:59 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db1193.eqiad.wmnet with reason: long schema change |
[production] |
| 05:57 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1193 T416107', diff saved to https://phabricator.wikimedia.org/P88353 and previous config saved to /var/cache/conftool/dbconfig/20260202-055755-marostegui.json |
[production] |
| 05:57 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db1209 to s8 primary T416107', diff saved to https://phabricator.wikimedia.org/P88352 and previous config saved to /var/cache/conftool/dbconfig/20260202-055717-marostegui.json |
[production] |
| 05:56 |
<marostegui> |
Starting s8 eqiad failover from db1193 to db1209 - T416107 |
[production] |
| 05:53 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Primary switchover s8 T416107 |
[production] |
| 05:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db1209 with weight 0 T416107', diff saved to https://phabricator.wikimedia.org/P88351 and previous config saved to /var/cache/conftool/dbconfig/20260202-055304-marostegui.json |
[production] |
| 02:14 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 13m 22s) |
[production] |
| 02:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
|
2026-01-31
§
|
| 21:49 |
<James_F> |
Deleted Jenkins's job entry for castor-save-workspace-cache 6193776 and this seems to have unstuck things for T416078? |
[releng] |
| 21:45 |
<James_F> |
Running `sudo systemctl restart jenkins` on contint for T416078 |
[releng] |
| 21:44 |
<James_F> |
Fighting T416078, took integration-castor-5 offline, disconnected, sshed in to kill threads, then reconnected; no change in aspect. |
[releng] |
| 19:03 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/1235380 |
[releng] |
| 02:23 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 22m 42s) |
[production] |
| 02:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
| 00:31 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
| 00:31 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226 (T415786)', diff saved to https://phabricator.wikimedia.org/P88349 and previous config saved to /var/cache/conftool/dbconfig/20260131-003142-marostegui.json |
[production] |
| 00:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P88348 and previous config saved to /var/cache/conftool/dbconfig/20260131-001634-marostegui.json |
[production] |
| 00:01 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P88347 and previous config saved to /var/cache/conftool/dbconfig/20260131-000125-marostegui.json |
[production] |