|
2026-02-20
§
|
| 10:48 |
<jgiannelos@deploy2002> |
helmfile [codfw] START helmfile.d/services/mw-parsoid: apply |
[production] |
| 10:47 |
<jgiannelos@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply |
[production] |
| 10:47 |
<jgiannelos@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply |
[production] |
| 10:38 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reimage for host kubestage2002.codfw.wmnet with OS trixie |
[production] |
| 10:37 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on A:wikikube-staging-worker-codfw |
[production] |
| 10:34 |
<ammarpad@deploy2002> |
mwscript-k8s job started: extensions/Translate/scripts/moveTranslatableBundle.php --wiki=metawiki --reason 'Requested at [[phab:T417862]]' 'Wikimedia Foundation/Advancement/Community Growth/Community Resources and Partnerships' 'Wikimedia Foundation/Advancement/Community Growth/Community Investment and Partnerships' Ammarpad # T417862 |
[production] |
| 10:14 |
<jgiannelos@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply |
[production] |
| 10:13 |
<jgiannelos@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply |
[production] |
| 09:41 |
<hashar> |
Upgraded CI Jenkins from 2.528.3 to 2.541.2 # T417791 |
[production] |
| 08:29 |
<brouberol@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cephosd1003.eqiad.wmnet |
[production] |
| 08:19 |
<brouberol@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host cephosd1003.eqiad.wmnet |
[production] |
| 07:18 |
<arnaudb@cumin1003> |
END (PASS) - Cookbook sre.gerrit.sync-instances (exit_code=0) sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org |
[production] |
| 07:13 |
<arnaudb@cumin1003> |
START - Cookbook sre.gerrit.sync-instances sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org |
[production] |
| 01:49 |
<zabe@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] (duration: 07m 17s) |
[production] |
| 01:45 |
<zabe@deploy2002> |
zabe: Continuing with sync |
[production] |
| 01:44 |
<zabe@deploy2002> |
zabe: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 01:41 |
<zabe@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] |
[production] |
| 00:33 |
<ryankemper> |
[WDQS] Restarted blazegraph on `wdqs1014` as well. all 3 hosts were deadlocked |
[production] |
| 00:32 |
<ryankemper> |
[WDQS] Restarted blazegraph on `wdqs101[1,3]` |
[production] |
|
2026-02-19
§
|
| 23:41 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1241 (T415786)', diff saved to https://phabricator.wikimedia.org/P88911 and previous config saved to /var/cache/conftool/dbconfig/20260219-234101-marostegui.json |
[production] |
| 23:40 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1241.eqiad.wmnet with reason: Maintenance |
[production] |
| 23:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238 (T415786)', diff saved to https://phabricator.wikimedia.org/P88910 and previous config saved to /var/cache/conftool/dbconfig/20260219-234036-marostegui.json |
[production] |
| 23:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P88909 and previous config saved to /var/cache/conftool/dbconfig/20260219-232528-marostegui.json |
[production] |
| 23:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2210 (T415786)', diff saved to https://phabricator.wikimedia.org/P88908 and previous config saved to /var/cache/conftool/dbconfig/20260219-231101-marostegui.json |
[production] |
| 23:10 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2210.codfw.wmnet with reason: Maintenance |
[production] |
| 23:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2206 (T415786)', diff saved to https://phabricator.wikimedia.org/P88907 and previous config saved to /var/cache/conftool/dbconfig/20260219-231037-marostegui.json |
[production] |
| 23:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P88906 and previous config saved to /var/cache/conftool/dbconfig/20260219-231020-marostegui.json |
[production] |
| 23:00 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.hadoop.reboot-workers (exit_code=0) for Hadoop test cluster |
[production] |
| 22:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2206', diff saved to https://phabricator.wikimedia.org/P88904 and previous config saved to /var/cache/conftool/dbconfig/20260219-225529-marostegui.json |
[production] |
| 22:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238 (T415786)', diff saved to https://phabricator.wikimedia.org/P88903 and previous config saved to /var/cache/conftool/dbconfig/20260219-225512-marostegui.json |
[production] |
| 22:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2206', diff saved to https://phabricator.wikimedia.org/P88902 and previous config saved to /var/cache/conftool/dbconfig/20260219-224020-marostegui.json |
[production] |
| 22:32 |
<egardner@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1240814|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] (duration: 07m 31s) |
[production] |
| 22:27 |
<egardner@deploy2002> |
egardner: Continuing with sync |
[production] |
| 22:26 |
<egardner@deploy2002> |
egardner: Backport for [[gerrit:1240814|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 22:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2206 (T415786)', diff saved to https://phabricator.wikimedia.org/P88901 and previous config saved to /var/cache/conftool/dbconfig/20260219-222512-marostegui.json |
[production] |
| 22:24 |
<egardner@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1240814|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] |
[production] |
| 22:24 |
<ryankemper> |
T415696 Will be merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/1237142 shortly, which will permanently decom the LDF endpoint for wdqs services |
[production] |
| 22:16 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop test cluster |
[production] |
| 22:14 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.hadoop.reboot-workers (exit_code=99) for Hadoop test cluster |
[production] |
| 22:14 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop test cluster |
[production] |
| 21:35 |
<cscott@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1240782|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270|Deploy PRV to 19 wikis (T417349)]] (duration: 10m 28s) |
[production] |
| 21:33 |
<jhathaway@dns1004> |
END - running authdns-update |
[production] |
| 21:32 |
<jhathaway@dns1004> |
START - running authdns-update |
[production] |
| 21:31 |
<cscott@deploy2002> |
cscott, arlolra: Continuing with sync |
[production] |
| 21:26 |
<cscott@deploy2002> |
cscott, arlolra: Backport for [[gerrit:1240782|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270|Deploy PRV to 19 wikis (T417349)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:24 |
<cscott@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1240782|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270|Deploy PRV to 19 wikis (T417349)]] |
[production] |
| 21:16 |
<arlolra@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1240773|Update Qids according to communication with communities (v20260219) (T417902)]], [[gerrit:1240779|Fix finding joiner in the face of pwrapping (T411935)]] (duration: 07m 16s) |
[production] |
| 21:12 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2005.codfw.wmnet with OS bookworm |
[production] |
| 21:12 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2004.codfw.wmnet with OS bookworm |
[production] |
| 21:12 |
<arlolra@deploy2002> |
arlolra, aude: Continuing with sync |
[production] |