751-800 of 10000 results (27ms)
2025-07-30 ยง
09:53 <jynus@cumin1003> START - Cookbook sre.hosts.remove-downtime for db[1204-1205].eqiad.wmnet [production]
09:44 <gkyziridis@deploy1003> helmfile [ml-serve-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
09:43 <gkyziridis@deploy1003> helmfile [ml-serve-eqiad] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
09:43 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
09:38 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P80284 and previous config saved to /var/cache/conftool/dbconfig/20250730-093839-fceratto.json [production]
09:37 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
09:30 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
09:23 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1199 (T399728)', diff saved to https://phabricator.wikimedia.org/P80283 and previous config saved to /var/cache/conftool/dbconfig/20250730-092332-fceratto.json [production]
09:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1199 (T399728)', diff saved to https://phabricator.wikimedia.org/P80282 and previous config saved to /var/cache/conftool/dbconfig/20250730-091829-fceratto.json [production]
09:18 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1199.eqiad.wmnet with reason: Maintenance [production]
09:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1190 (T399728)', diff saved to https://phabricator.wikimedia.org/P80281 and previous config saved to /var/cache/conftool/dbconfig/20250730-091817-fceratto.json [production]
09:16 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2010.codfw.wmnet with OS bookworm [production]
09:16 <elukey@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - elukey@cumin1003" [production]
09:16 <elukey@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - elukey@cumin1003" [production]
09:11 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:11 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
09:10 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db[1204-1205].eqiad.wmnet with reason: upgrade mariadb [production]
09:08 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component logging [toolsbeta]
09:03 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2010.codfw.wmnet with reason: host reimage [production]
09:03 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P80280 and previous config saved to /var/cache/conftool/dbconfig/20250730-090309-fceratto.json [production]
08:59 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for db[2183-2184].codfw.wmnet [production]
08:59 <jynus@cumin1003> START - Cookbook sre.hosts.remove-downtime for db[2183-2184].codfw.wmnet [production]
08:59 <elukey@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2010.codfw.wmnet with reason: host reimage [production]
08:53 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component logging [toolsbeta]
08:48 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P80279 and previous config saved to /var/cache/conftool/dbconfig/20250730-084800-fceratto.json [production]
08:38 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
08:38 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2184.codfw.wmnet with reason: replication will stop [production]
08:36 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2183.codfw.wmnet with reason: upgrade mariadb [production]
08:36 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS bookworm [production]
08:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1190 (T399728)', diff saved to https://phabricator.wikimedia.org/P80278 and previous config saved to /var/cache/conftool/dbconfig/20250730-083252-fceratto.json [production]
08:32 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [tools]
08:28 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
08:28 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
08:27 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1190 (T399728)', diff saved to https://phabricator.wikimedia.org/P80276 and previous config saved to /var/cache/conftool/dbconfig/20250730-082758-fceratto.json [production]
08:27 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1190.eqiad.wmnet with reason: Maintenance [production]
08:27 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1160 (T399728)', diff saved to https://phabricator.wikimedia.org/P80275 and previous config saved to /var/cache/conftool/dbconfig/20250730-082735-fceratto.json [production]
08:22 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [tools]
08:22 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
08:13 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]
08:12 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1160', diff saved to https://phabricator.wikimedia.org/P80274 and previous config saved to /var/cache/conftool/dbconfig/20250730-081228-fceratto.json [production]
08:09 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest2010.codfw.wmnet with OS bookworm [production]
08:05 <mlitn@deploy1003> Finished scap sync-world: Backport for [[gerrit:1171239|Add new MediaSearch config/coefficients (T385286)]] (duration: 09m 42s) [production]
08:03 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS bookworm [production]
08:03 <jelto@cumin1003> END (PASS) - Cookbook sre.gitlab.failover (exit_code=0) Failover of gitlab from gitlab2002.wikimedia.org to gitlab1004.wikimedia.org [production]
08:01 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2010.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
08:00 <mlitn@deploy1003> mlitn: Continuing with sync [production]
07:58 <mlitn@deploy1003> mlitn: Backport for [[gerrit:1171239|Add new MediaSearch config/coefficients (T385286)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:57 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1160', diff saved to https://phabricator.wikimedia.org/P80273 and previous config saved to /var/cache/conftool/dbconfig/20250730-075720-fceratto.json [production]
07:56 <jelto@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) 'https://gitlab.wikimedia.org/ https://gitlab-replica-b.wikimedia.org/' on all recursors [production]
07:56 <jelto@cumin1003> START - Cookbook sre.dns.wipe-cache 'https://gitlab.wikimedia.org/ https://gitlab-replica-b.wikimedia.org/' on all recursors [production]