|
2025-11-04
§
|
| 07:52 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
| 07:52 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2176 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P84694 and previous config saved to /var/cache/conftool/dbconfig/20251104-075213-root.json |
[production] |
| 07:48 |
<ozge@deploy2002> |
helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply |
[production] |
| 07:47 |
<ozge@deploy2002> |
helmfile [staging] START helmfile.d/services/linkrecommendation: apply |
[production] |
| 07:37 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2176 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P84693 and previous config saved to /var/cache/conftool/dbconfig/20251104-073707-root.json |
[production] |
| 07:28 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db2176 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P84692 and previous config saved to /var/cache/conftool/dbconfig/20251104-072854-marostegui.json |
[production] |
| 07:28 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2176.codfw.wmnet with reason: Maintenance |
[production] |
| 07:22 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P84691 and previous config saved to /var/cache/conftool/dbconfig/20251104-072201-marostegui.json |
[production] |
| 07:06 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T407997)', diff saved to https://phabricator.wikimedia.org/P84690 and previous config saved to /var/cache/conftool/dbconfig/20251104-070653-marostegui.json |
[production] |
| 07:03 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1161 (T407997)', diff saved to https://phabricator.wikimedia.org/P84689 and previous config saved to /var/cache/conftool/dbconfig/20251104-070356-marostegui.json |
[production] |
| 07:03 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
| 07:03 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1161.eqiad.wmnet with reason: Maintenance |
[production] |
| 07:03 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T407997)', diff saved to https://phabricator.wikimedia.org/P84688 and previous config saved to /var/cache/conftool/dbconfig/20251104-070311-marostegui.json |
[production] |
| 06:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P84687 and previous config saved to /var/cache/conftool/dbconfig/20251104-064803-marostegui.json |
[production] |
| 06:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P84686 and previous config saved to /var/cache/conftool/dbconfig/20251104-063253-marostegui.json |
[production] |
| 06:17 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T407997)', diff saved to https://phabricator.wikimedia.org/P84685 and previous config saved to /var/cache/conftool/dbconfig/20251104-061745-marostegui.json |
[production] |
| 06:14 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1159 (T407997)', diff saved to https://phabricator.wikimedia.org/P84684 and previous config saved to /var/cache/conftool/dbconfig/20251104-061449-marostegui.json |
[production] |
| 06:14 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1159.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:12 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2204.codfw.wmnet with reason: Maintenance |
[production] |
| 05:02 |
<mwpresync@deploy2002> |
Pruned MediaWiki: 1.45.0-wmf.23 (duration: 02m 28s) |
[production] |
| 04:51 |
<eileen> |
civicrm upgraded from c9f9d2b5 to 77cad331 |
[production] |
| 03:03 |
<inflatador> |
bking@cumin2002 restart wdqs-blazegraph.service in CODFW to apply 1201326 T409132 |
[production] |
| 02:30 |
<eileen> |
civicrm upgraded from 1c0619b6 to c9f9d2b5 |
[production] |
| 00:58 |
<eileen> |
civicrm upgraded from 025f3ef3 to 1c0619b6 |
[production] |
| 00:32 |
<zabe@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1199522|Using Hadoop for MostTranscludedPages on enwiki (T309738)]] (duration: 09m 05s) |
[production] |
| 00:26 |
<zabe@deploy2002> |
zabe: Continuing with sync |
[production] |
| 00:25 |
<zabe@deploy2002> |
zabe: Backport for [[gerrit:1199522|Using Hadoop for MostTranscludedPages on enwiki (T309738)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 00:23 |
<zabe@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1199522|Using Hadoop for MostTranscludedPages on enwiki (T309738)]] |
[production] |
| 00:10 |
<cdanis@dns1004> |
END - running authdns-update |
[production] |
| 00:09 |
<cdanis@dns1004> |
START - running authdns-update |
[production] |
| 00:05 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) |
[production] |
| 00:04 |
<dzahn@dns1004> |
END - running authdns-update |
[production] |
| 00:03 |
<dzahn@dns1004> |
START - running authdns-update |
[production] |
|
2025-11-03
§
|
| 23:40 |
<eileen> |
civicrm upgraded from b0c68b4a to 025f3ef3 |
[production] |
| 23:01 |
<inflatador> |
bking@cumin2002 repool wdqs2008 and 2012 |
[production] |
| 22:56 |
<inflatador> |
bking@cumin2002 depool wdqs2008 and 2012 so they can catch up on lag |
[production] |
| 22:54 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.restart |
[production] |
| 22:54 |
<ryankemper@cumin2002> |
END (ERROR) - Cookbook sre.wdqs.restart (exit_code=97) |
[production] |
| 22:54 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.restart |
[production] |
| 22:51 |
<ryankemper> |
[WDQS] Restarting all codfw wdqs-main hosts; we're getting slammed by increased triple count (same issue we've been seeing intermittently for a week or two) |
[production] |
| 22:28 |
<eileen> |
civicrm upgraded from 29d3c24f to b0c68b4a |
[production] |
| 22:16 |
<arlolra@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1199880|Deploy Parsoid Read Views to 7 wikis (T408765)]] (duration: 08m 01s) |
[production] |
| 22:11 |
<arlolra@deploy2002> |
arlolra: Continuing with sync |
[production] |
| 22:10 |
<arlolra@deploy2002> |
arlolra: Backport for [[gerrit:1199880|Deploy Parsoid Read Views to 7 wikis (T408765)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 22:08 |
<arlolra@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1199880|Deploy Parsoid Read Views to 7 wikis (T408765)]] |
[production] |
| 22:07 |
<inflatador> |
bking@cumin2002 suppress wdqs2009 alerts for next 90 days T409117 |
[production] |
| 22:06 |
<bking@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 90 days, 0:00:00 on wdqs2009.codfw.wmnet with reason: no SLO for this endpoint |
[production] |
| 22:01 |
<arlolra@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1200475|[enwikivoyage] Enable block feature for AbuseFilter (T408885)]], [[gerrit:1200400|zhwiki: Add SecurePoll Rights to CheckUser (T408902)]] (duration: 07m 05s) |
[production] |
| 21:56 |
<arlolra@deploy2002> |
superpes, zhaofjx, arlolra: Continuing with sync |
[production] |
| 21:56 |
<arlolra@deploy2002> |
superpes, zhaofjx, arlolra: Backport for [[gerrit:1200475|[enwikivoyage] Enable block feature for AbuseFilter (T408885)]], [[gerrit:1200400|zhwiki: Add SecurePoll Rights to CheckUser (T408902)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |