101-150 of 10000 results (101ms)
2026-02-20 §
10:48 <jgiannelos@deploy2002> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
10:47 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
10:47 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
10:38 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host kubestage2002.codfw.wmnet with OS trixie [production]
10:37 <jayme@cumin1003> START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on A:wikikube-staging-worker-codfw [production]
10:34 <ammarpad@deploy2002> mwscript-k8s job started: extensions/Translate/scripts/moveTranslatableBundle.php --wiki=metawiki --reason 'Requested at [[phab:T417862]]' 'Wikimedia Foundation/Advancement/Community Growth/Community Resources and Partnerships' 'Wikimedia Foundation/Advancement/Community Growth/Community Investment and Partnerships' Ammarpad # T417862 [production]
10:14 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
10:13 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
09:41 <hashar> Upgraded CI Jenkins from 2.528.3 to 2.541.2 # T417791 [production]
08:29 <brouberol@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cephosd1003.eqiad.wmnet [production]
08:19 <brouberol@cumin1003> START - Cookbook sre.hosts.reboot-single for host cephosd1003.eqiad.wmnet [production]
07:18 <arnaudb@cumin1003> END (PASS) - Cookbook sre.gerrit.sync-instances (exit_code=0) sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org [production]
07:13 <arnaudb@cumin1003> START - Cookbook sre.gerrit.sync-instances sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org [production]
01:49 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] (duration: 07m 17s) [production]
01:45 <zabe@deploy2002> zabe: Continuing with sync [production]
01:44 <zabe@deploy2002> zabe: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
01:41 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] [production]
00:33 <ryankemper> [WDQS] Restarted blazegraph on `wdqs1014` as well. all 3 hosts were deadlocked [production]
00:32 <ryankemper> [WDQS] Restarted blazegraph on `wdqs101[1,3]` [production]
2026-02-19 §
23:41 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1241 (T415786)', diff saved to https://phabricator.wikimedia.org/P88911 and previous config saved to /var/cache/conftool/dbconfig/20260219-234101-marostegui.json [production]
23:40 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1241.eqiad.wmnet with reason: Maintenance [production]
23:40 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1238 (T415786)', diff saved to https://phabricator.wikimedia.org/P88910 and previous config saved to /var/cache/conftool/dbconfig/20260219-234036-marostegui.json [production]
23:25 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P88909 and previous config saved to /var/cache/conftool/dbconfig/20260219-232528-marostegui.json [production]
23:11 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2210 (T415786)', diff saved to https://phabricator.wikimedia.org/P88908 and previous config saved to /var/cache/conftool/dbconfig/20260219-231101-marostegui.json [production]
23:10 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2210.codfw.wmnet with reason: Maintenance [production]
23:10 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2206 (T415786)', diff saved to https://phabricator.wikimedia.org/P88907 and previous config saved to /var/cache/conftool/dbconfig/20260219-231037-marostegui.json [production]
23:10 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P88906 and previous config saved to /var/cache/conftool/dbconfig/20260219-231020-marostegui.json [production]
23:00 <ryankemper@cumin2002> END (PASS) - Cookbook sre.hadoop.reboot-workers (exit_code=0) for Hadoop test cluster [production]
22:55 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2206', diff saved to https://phabricator.wikimedia.org/P88904 and previous config saved to /var/cache/conftool/dbconfig/20260219-225529-marostegui.json [production]
22:55 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1238 (T415786)', diff saved to https://phabricator.wikimedia.org/P88903 and previous config saved to /var/cache/conftool/dbconfig/20260219-225512-marostegui.json [production]
22:40 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2206', diff saved to https://phabricator.wikimedia.org/P88902 and previous config saved to /var/cache/conftool/dbconfig/20260219-224020-marostegui.json [production]
22:32 <egardner@deploy2002> Finished scap sync-world: Backport for [[gerrit:1240814|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] (duration: 07m 31s) [production]
22:27 <egardner@deploy2002> egardner: Continuing with sync [production]
22:26 <egardner@deploy2002> egardner: Backport for [[gerrit:1240814|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
22:25 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2206 (T415786)', diff saved to https://phabricator.wikimedia.org/P88901 and previous config saved to /var/cache/conftool/dbconfig/20260219-222512-marostegui.json [production]
22:24 <egardner@deploy2002> Started scap sync-world: Backport for [[gerrit:1240814|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] [production]
22:24 <ryankemper> T415696 Will be merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/1237142 shortly, which will permanently decom the LDF endpoint for wdqs services [production]
22:16 <ryankemper@cumin2002> START - Cookbook sre.hadoop.reboot-workers for Hadoop test cluster [production]
22:14 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hadoop.reboot-workers (exit_code=99) for Hadoop test cluster [production]
22:14 <ryankemper@cumin2002> START - Cookbook sre.hadoop.reboot-workers for Hadoop test cluster [production]
21:35 <cscott@deploy2002> Finished scap sync-world: Backport for [[gerrit:1240782|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270|Deploy PRV to 19 wikis (T417349)]] (duration: 10m 28s) [production]
21:33 <jhathaway@dns1004> END - running authdns-update [production]
21:32 <jhathaway@dns1004> START - running authdns-update [production]
21:31 <cscott@deploy2002> cscott, arlolra: Continuing with sync [production]
21:26 <cscott@deploy2002> cscott, arlolra: Backport for [[gerrit:1240782|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270|Deploy PRV to 19 wikis (T417349)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
21:24 <cscott@deploy2002> Started scap sync-world: Backport for [[gerrit:1240782|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270|Deploy PRV to 19 wikis (T417349)]] [production]
21:16 <arlolra@deploy2002> Finished scap sync-world: Backport for [[gerrit:1240773|Update Qids according to communication with communities (v20260219) (T417902)]], [[gerrit:1240779|Fix finding joiner in the face of pwrapping (T411935)]] (duration: 07m 16s) [production]
21:12 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2005.codfw.wmnet with OS bookworm [production]
21:12 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2004.codfw.wmnet with OS bookworm [production]
21:12 <arlolra@deploy2002> arlolra, aude: Continuing with sync [production]