3251-3300 of 10000 results (129ms)
2025-03-04 ยง
09:41 <elukey@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
09:39 <elukey@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
09:37 <marostegui@cumin1002> dbctl commit (dc=all): 'db2147 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74028 and previous config saved to /var/cache/conftool/dbconfig/20250304-093723-root.json [production]
09:34 <elukey@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
09:33 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
09:33 <elukey@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
09:32 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs5006.eqsin.wmnet with OS bookworm [production]
09:32 <sgimeno@deploy2002> Finished scap sync-world: Backport for [[gerrit:1124362|analytics(HomepageHooks,BeforePageDisplayHandler): log experiment_enrollment interaction on new accounts (T387286)]] (duration: 12m 01s) [production]
09:28 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
09:28 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74027 and previous config saved to /var/cache/conftool/dbconfig/20250304-092839-root.json [production]
09:27 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1155.eqiad.wmnet with reason: Rebuilding index [production]
09:25 <sgimeno@deploy2002> sgimeno: Continuing with sync [production]
09:23 <sgimeno@deploy2002> sgimeno: Backport for [[gerrit:1124362|analytics(HomepageHooks,BeforePageDisplayHandler): log experiment_enrollment interaction on new accounts (T387286)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:23 <jayme@deploy1003> helmfile [staging] DONE helmfile.d/services/ipoid: apply [production]
09:23 <jayme@deploy1003> helmfile [staging] START helmfile.d/services/ipoid: apply [production]
09:22 <jayme@deploy1003> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:22 <marostegui@cumin1002> dbctl commit (dc=all): 'db2147 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74026 and previous config saved to /var/cache/conftool/dbconfig/20250304-092217-root.json [production]
09:21 <jayme@deploy1003> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
09:21 <jayme@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
09:20 <sgimeno@deploy2002> Started scap sync-world: Backport for [[gerrit:1124362|analytics(HomepageHooks,BeforePageDisplayHandler): log experiment_enrollment interaction on new accounts (T387286)]] [production]
09:19 <jayme@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
09:16 <sgimeno@deploy2002> Finished scap sync-world: Backport for [[gerrit:1123607|[Growth] Add mediawiki.product_metrics.growth_product_interaction stream config (T387286)]] (duration: 16m 01s) [production]
09:13 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74025 and previous config saved to /var/cache/conftool/dbconfig/20250304-091334-root.json [production]
09:08 <sgimeno@deploy2002> sgimeno: Continuing with sync [production]
09:07 <marostegui@cumin1002> dbctl commit (dc=all): 'db2147 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74024 and previous config saved to /var/cache/conftool/dbconfig/20250304-090712-root.json [production]
09:05 <sgimeno@deploy2002> sgimeno: Backport for [[gerrit:1123607|[Growth] Add mediawiki.product_metrics.growth_product_interaction stream config (T387286)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:00 <sgimeno@deploy2002> Started scap sync-world: Backport for [[gerrit:1123607|[Growth] Add mediawiki.product_metrics.growth_product_interaction stream config (T387286)]] [production]
09:00 <dcausse@deploy2002> helmfile [codfw] DONE helmfile.d/services/eventgate-main: sync [production]
08:59 <dcausse@deploy2002> helmfile [codfw] START helmfile.d/services/eventgate-main: sync [production]
08:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74023 and previous config saved to /var/cache/conftool/dbconfig/20250304-085829-root.json [production]
08:58 <aborrero@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcontrol1005.eqiad.wmnet [production]
08:57 <dcausse@deploy2002> helmfile [eqiad] DONE helmfile.d/services/eventgate-main: sync [production]
08:56 <dcausse@deploy2002> helmfile [eqiad] START helmfile.d/services/eventgate-main: sync [production]
08:55 <dcausse> restarting eventgate-main to pickup to new streams (T375821) [production]
08:54 <dcausse@deploy2002> Finished scap sync-world: Backport for [[gerrit:1114955|cirrus: add v1 stream for the search update pipeline (T375821)]] (duration: 41m 17s) [production]
08:52 <marostegui@cumin1002> dbctl commit (dc=all): 'db2147 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74022 and previous config saved to /var/cache/conftool/dbconfig/20250304-085207-root.json [production]
08:45 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
08:45 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
08:44 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
08:44 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs5006.eqsin.wmnet with reason: host reimage [production]
08:43 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74021 and previous config saved to /var/cache/conftool/dbconfig/20250304-084325-root.json [production]
08:40 <vgutierrez@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs5006.eqsin.wmnet with reason: host reimage [production]
08:40 <dcausse@deploy2002> dcausse: Continuing with sync [production]
08:29 <dcausse@deploy2002> dcausse: Backport for [[gerrit:1114955|cirrus: add v1 stream for the search update pipeline (T375821)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:28 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74020 and previous config saved to /var/cache/conftool/dbconfig/20250304-082819-root.json [production]
08:24 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on clouddb1015.eqiad.wmnet with reason: Rebuilding index [production]
08:17 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reimage for host lvs5006.eqsin.wmnet with OS bookworm [production]
08:13 <dcausse@deploy2002> Started scap sync-world: Backport for [[gerrit:1114955|cirrus: add v1 stream for the search update pipeline (T375821)]] [production]
08:08 <hashar@deploy2002> sync-world aborted: testwikis to 1.44.0-wmf.19 refs T386214 (duration: 05m 10s) [production]
08:03 <hashar@deploy2002> Started scap sync-world: testwikis to 1.44.0-wmf.19 refs T386214 [production]