301-350 of 10000 results (100ms)
2025-11-12 ยง
14:47 <fceratto@cumin1003> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host db-test1001.eqiad.wmnet [production]
14:47 <marostegui@cumin1003> conftool action : set/pooled=yes; selector: name=clouddb1013.eqiad.wmnet,service=s3 [production]
14:47 <fceratto@cumin1003> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host db-test1002.eqiad.wmnet [production]
14:46 <fceratto@cumin1003> START - Cookbook sre.ganeti.makevm for new host db-test1002.eqiad.wmnet [production]
14:46 <fceratto@cumin1003> START - Cookbook sre.ganeti.makevm for new host db-test1001.eqiad.wmnet [production]
14:45 <fceratto@cumin1003> START - Cookbook sre.dns.netbox [production]
14:45 <fceratto@cumin1003> START - Cookbook sre.ganeti.makevm for new host db-test1003.eqiad.wmnet [production]
14:33 <kharlan@deploy2002> kharlan: Continuing with sync [production]
14:27 <kharlan@deploy2002> kharlan: Backport for [[gerrit:1204367|hCaptcha instrumentation: Log editor_interface for editAttempStep (T409701)]], [[gerrit:1204576|Support an "always challenge" SiteKey when shouldForceShowCaptcha is enabled (T405595)]], [[gerrit:1204581|hCaptcha: Define configuration for "always challenge" mode (T405595)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can [production]
14:14 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db-test2001.codfw.wmnet with reason: Clone T400056 [production]
14:13 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2230.codfw.wmnet with reason: Clone T400056 [production]
14:01 <kharlan@deploy2002> Started scap sync-world: Backport for [[gerrit:1204367|hCaptcha instrumentation: Log editor_interface for editAttempStep (T409701)]], [[gerrit:1204576|Support an "always challenge" SiteKey when shouldForceShowCaptcha is enabled (T405595)]], [[gerrit:1204581|hCaptcha: Define configuration for "always challenge" mode (T405595)]] [production]
13:55 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host ml-serve1012 [production]
13:50 <elukey@cumin2002> START - Cookbook sre.hosts.powercycle for host ml-serve1012 [production]
13:48 <taavi> updating cr firewall policy with new caprica definitions, to pick up new clouddb hosts [production]
13:31 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 100%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85288 and previous config saved to /var/cache/conftool/dbconfig/20251112-133127-root.json [production]
13:24 <klausman@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve1012.eqiad.wmnet with OS trixie [production]
13:23 <moritzm> installing glib2.0 security updates [production]
13:16 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 75%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85287 and previous config saved to /var/cache/conftool/dbconfig/20251112-131621-root.json [production]
13:04 <taavi@deploy2002> Finished scap sync-world: Backport for [[gerrit:1204376|reverse-proxy: Add new eqiad/codfw per-rack subnets]], [[gerrit:1204377|Add script to update reverse-proxy.php]] (duration: 09m 51s) [production]
13:03 <klausman@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1012.eqiad.wmnet with reason: host reimage [production]
13:01 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 50%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85286 and previous config saved to /var/cache/conftool/dbconfig/20251112-130115-root.json [production]
13:00 <klausman@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1012.eqiad.wmnet with reason: host reimage [production]
12:59 <taavi@deploy2002> taavi: Continuing with sync [production]
12:57 <taavi@deploy2002> taavi: Backport for [[gerrit:1204376|reverse-proxy: Add new eqiad/codfw per-rack subnets]], [[gerrit:1204377|Add script to update reverse-proxy.php]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
12:54 <taavi@deploy2002> Started scap sync-world: Backport for [[gerrit:1204376|reverse-proxy: Add new eqiad/codfw per-rack subnets]], [[gerrit:1204377|Add script to update reverse-proxy.php]] [production]
12:47 <klausman@cumin1003> START - Cookbook sre.hosts.reimage for host ml-serve1012.eqiad.wmnet with OS trixie [production]
12:46 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 45%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85285 and previous config saved to /var/cache/conftool/dbconfig/20251112-124609-root.json [production]
12:43 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.clone (exit_code=99) of db2230.codfw.wmnet onto db-test2001.codfw.wmnet [production]
12:41 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1013,1022].eqiad.wmnet with reason: Cloning clouddb1022:s3 [production]
12:41 <marostegui@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1013.eqiad.wmnet,service=s3 [production]
12:31 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 40%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85282 and previous config saved to /var/cache/conftool/dbconfig/20251112-123103-root.json [production]
12:30 <kart_> Updated cxserver to 2025-11-12-114324-production (T408515) [production]
12:21 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
12:20 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
12:20 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
12:19 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
12:15 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 35%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85281 and previous config saved to /var/cache/conftool/dbconfig/20251112-121557-root.json [production]
12:14 <topranks> shut down link from ssw1-d8-eqiad ethernet-1/28 <-> asw2-c7-eqiad et-7/0/49 to observe results T409800 [production]
12:14 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
12:14 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
12:09 <mvolz@deploy2002> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
12:09 <mvolz@deploy2002> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
12:08 <mvolz@deploy2002> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
12:07 <mvolz@deploy2002> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
12:06 <mvolz@deploy2002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
12:05 <mvolz@deploy2002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
12:00 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 30%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85280 and previous config saved to /var/cache/conftool/dbconfig/20251112-120051-root.json [production]
11:45 <marostegui@cumin1003> dbctl commit (dc=all): 'es1033 (re)pooling @ 25%: Moved it to es7', diff saved to https://phabricator.wikimedia.org/P85279 and previous config saved to /var/cache/conftool/dbconfig/20251112-114545-root.json [production]
11:44 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]