1151-1200 of 10000 results (151ms)
2025-10-07 §
07:12 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host es1050.eqiad.wmnet with OS bookworm [production]
07:11 <dcausse@deploy2002> dcausse: Backport for [[gerrit:1193052|cirrus: stop copying ores weighted_tags (T389053)]], [[gerrit:1193092|cirrus: test completion with default sort on simplewiki [2/3] (T404858)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:10 <marostegui@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host es1050.eqiad.wmnet with OS bookworm [production]
07:05 <dcausse@deploy2002> Started scap sync-world: Backport for [[gerrit:1193052|cirrus: stop copying ores weighted_tags (T389053)]], [[gerrit:1193092|cirrus: test completion with default sort on simplewiki [2/3] (T404858)]] [production]
06:58 <marostegui@cumin1003> dbctl commit (dc=all): 'db2219 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P83623 and previous config saved to /var/cache/conftool/dbconfig/20251007-065825-root.json [production]
06:50 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db2219 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83622 and previous config saved to /var/cache/conftool/dbconfig/20251007-065019-marostegui.json [production]
06:50 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2219.codfw.wmnet with reason: Maintenance [production]
06:44 <kart_> Updated cxserver to 2025-10-06-084053-production (T394982, T403574) [production]
06:42 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
06:42 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
06:41 <ryankemper@cumin2002> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
06:40 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
06:40 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
06:37 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host es1050.eqiad.wmnet with OS bookworm [production]
06:35 <marostegui@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1050.eqiad.wmnet with OS bookworm [production]
06:30 <marostegui@cumin1003> dbctl commit (dc=all): 'db2237 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P83621 and previous config saved to /var/cache/conftool/dbconfig/20251007-063014-root.json [production]
06:24 <moritzm> rebalance Ganeti eqiad/B following vmscape reboots [production]
06:24 <moritzm> rebalance Ganeti codfw/B following vmscape reboots [production]
06:15 <marostegui@cumin1003> dbctl commit (dc=all): 'db2237 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P83620 and previous config saved to /var/cache/conftool/dbconfig/20251007-061509-root.json [production]
06:07 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
06:06 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
06:00 <marostegui@cumin1003> dbctl commit (dc=all): 'db2237 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P83619 and previous config saved to /var/cache/conftool/dbconfig/20251007-060003-root.json [production]
05:52 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host es1050.eqiad.wmnet with OS bookworm [production]
05:44 <marostegui@cumin1003> dbctl commit (dc=all): 'db2237 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P83618 and previous config saved to /var/cache/conftool/dbconfig/20251007-054457-root.json [production]
05:36 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2237.codfw.wmnet with reason: Maintenance [production]
05:36 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db2237 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83617 and previous config saved to /var/cache/conftool/dbconfig/20251007-053628-root.json [production]
05:36 <root@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2237.codfw.wmnet with reason: Maintenance [production]
05:03 <ryankemper@cumin2002> START - Cookbook sre.wdqs.restart [production]
05:02 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1018.eqiad.wmnet with OS bullseye [production]
04:02 <mwpresync@deploy2002> Pruned MediaWiki: 1.45.0-wmf.19 (duration: 02m 32s) [production]
03:48 <mwpresync@deploy2002> Finished scap sync-world: testwikis to 1.45.0-wmf.22 refs T405678 (duration: 45m 18s) [production]
03:03 <mwpresync@deploy2002> Started scap sync-world: testwikis to 1.45.0-wmf.22 refs T405678 [production]
01:14 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 13m 28s) [production]
01:01 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
00:27 <andrew@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcontrol2005-dev.codfw.wmnet with OS trixie [production]
2025-10-06 §
23:35 <jdlrobson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1193932|tempUserBanner: Set `relative` position to enable `z-index` (T404122)]] (duration: 11m 30s) [production]
23:30 <jdlrobson@deploy2002> jdlrobson: Continuing with sync [production]
23:29 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
23:28 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
23:28 <jdlrobson@deploy2002> jdlrobson: Backport for [[gerrit:1193932|tempUserBanner: Set `relative` position to enable `z-index` (T404122)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
23:23 <jdlrobson@deploy2002> Started scap sync-world: Backport for [[gerrit:1193932|tempUserBanner: Set `relative` position to enable `z-index` (T404122)]] [production]
23:13 <jdlrobson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1193447|Remove old, unused ArticleSummaries Stream (T406361)]] (duration: 09m 47s) [production]
23:08 <jdlrobson@deploy2002> jdlrobson, lmora: Continuing with sync [production]
23:07 <jdlrobson@deploy2002> jdlrobson, lmora: Backport for [[gerrit:1193447|Remove old, unused ArticleSummaries Stream (T406361)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
23:03 <jhancock@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS bookworm [production]
23:03 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2017.codfw.wmnet with OS bullseye [production]
23:03 <jdlrobson@deploy2002> Started scap sync-world: Backport for [[gerrit:1193447|Remove old, unused ArticleSummaries Stream (T406361)]] [production]
22:49 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1020.eqiad.wmnet with OS bullseye [production]
22:48 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1020.eqiad.wmnet with OS bullseye [production]
22:43 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]