1-50 of 10000 results (25ms)
2026-02-20 ยง
12:16 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host kubestage2004.codfw.wmnet with OS trixie [production]
12:15 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage2003.codfw.wmnet with OS trixie [production]
12:00 <dhinus> DROP DATABASE toollabs_p; (was used by updatetools.py, see T415383) [admin]
11:54 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestage2003.codfw.wmnet with reason: host reimage [production]
11:52 <jiji@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
11:51 <jiji@deploy2002> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
11:51 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
11:50 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
11:50 <jiji@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply [production]
11:49 <jiji@deploy2002> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
11:49 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
11:49 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
11:47 <jayme@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestage2003.codfw.wmnet with reason: host reimage [production]
11:44 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2219 (T415786)', diff saved to https://phabricator.wikimedia.org/P88918 and previous config saved to /var/cache/conftool/dbconfig/20260220-114437-marostegui.json [production]
11:44 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2219.codfw.wmnet with reason: Maintenance [production]
11:44 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2210 (T415786)', diff saved to https://phabricator.wikimedia.org/P88917 and previous config saved to /var/cache/conftool/dbconfig/20260220-114412-marostegui.json [production]
11:29 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2210', diff saved to https://phabricator.wikimedia.org/P88916 and previous config saved to /var/cache/conftool/dbconfig/20260220-112903-marostegui.json [production]
11:28 <jiji@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
11:28 <jiji@deploy2002> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
11:26 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host kubestage2003.codfw.wmnet with OS trixie [production]
11:20 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage2002.codfw.wmnet with OS trixie [production]
11:13 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2210', diff saved to https://phabricator.wikimedia.org/P88915 and previous config saved to /var/cache/conftool/dbconfig/20260220-111355-marostegui.json [production]
11:04 <wmftkbot> Test Kitchen edge-unique experiments (poll 157950) - adds: none; removes: mobile-toc-abc; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
11:03 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestage2002.codfw.wmnet with reason: host reimage [production]
10:58 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2210 (T415786)', diff saved to https://phabricator.wikimedia.org/P88914 and previous config saved to /var/cache/conftool/dbconfig/20260220-105847-marostegui.json [production]
10:57 <jayme@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestage2002.codfw.wmnet with reason: host reimage [production]
10:48 <jgiannelos@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
10:48 <jgiannelos@deploy2002> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
10:47 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
10:47 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
10:38 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host kubestage2002.codfw.wmnet with OS trixie [production]
10:37 <jayme@cumin1003> START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on A:wikikube-staging-worker-codfw [production]
10:36 <filippo@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) by 10 cores, 40 gigabytes, 20480 ram (T416803) [glamwikidashboard]
10:36 <filippo@cloudcumin1001> START - Cookbook wmcs.openstack.quota_increase by 10 cores, 40 gigabytes, 20480 ram (T416803) [glamwikidashboard]
10:34 <ammarpad@deploy2002> mwscript-k8s job started: extensions/Translate/scripts/moveTranslatableBundle.php --wiki=metawiki --reason 'Requested at [[phab:T417862]]' 'Wikimedia Foundation/Advancement/Community Growth/Community Resources and Partnerships' 'Wikimedia Foundation/Advancement/Community Growth/Community Investment and Partnerships' Ammarpad # T417862 [production]
10:14 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
10:13 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
09:41 <hashar> Upgraded CI Jenkins from 2.528.3 to 2.541.2 # T417791 [production]
08:34 <wm-bot2> Deployment completed: https://github.com/cluebotng/component-configs/actions/runs/22216991615 (https://github.com/cluebotng/component-configs/commits/64d521535aa35454c28900f70009efc0e9ff4a10) [tools.cluebotng-review]
08:29 <brouberol@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cephosd1003.eqiad.wmnet [production]
08:19 <brouberol@cumin1003> START - Cookbook sre.hosts.reboot-single for host cephosd1003.eqiad.wmnet [production]
07:18 <arnaudb@cumin1003> END (PASS) - Cookbook sre.gerrit.sync-instances (exit_code=0) sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org [production]
07:13 <arnaudb@cumin1003> START - Cookbook sre.gerrit.sync-instances sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org [production]
05:06 <wm-bot2> Deployment completed: https://github.com/cluebotng/component-configs/actions/runs/22212024170 (https://github.com/cluebotng/component-configs/commits/9b0508c1c5a875dd795c865e67f2a93d4f247597) [tools.cluebotng-review]
01:49 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] (duration: 07m 17s) [production]
01:45 <zabe@deploy2002> zabe: Continuing with sync [production]
01:44 <zabe@deploy2002> zabe: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
01:41 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1239497|Start reading from new file tables on mediawikiwiki (T416548)]] [production]
00:33 <ryankemper> [WDQS] Restarted blazegraph on `wdqs1014` as well. all 3 hosts were deadlocked [production]
00:32 <ryankemper> [WDQS] Restarted blazegraph on `wdqs101[1,3]` [production]