1251-1300 of 10000 results (161ms)
2026-02-24 ยง
13:24 <fceratto@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:24 <fceratto@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Deploy manual changes from netbox - fceratto@cumin1003" [production]
13:21 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
13:20 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
13:20 <arnaudb@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) gerrit-replica.discovery.wmnet gerrit-spare.discovery.wmnet on all recursors [production]
13:20 <arnaudb@cumin1003> START - Cookbook sre.dns.wipe-cache gerrit-replica.discovery.wmnet gerrit-spare.discovery.wmnet on all recursors [production]
13:15 <fceratto@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Deploy manual changes from netbox - fceratto@cumin1003" [production]
13:14 <brouberol@deploy2002> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
13:14 <brouberol@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
13:11 <fceratto@cumin1003> START - Cookbook sre.dns.netbox [production]
13:07 <fceratto@dns1004> START - running authdns-update [production]
13:04 <fceratto@dns1004> START - running authdns-update [production]
13:02 <arnaudb@dns1004> START - running authdns-update [production]
13:01 <fceratto@dns1004> START - running authdns-update [production]
12:55 <slyngshede@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp2045.codfw.wmnet with OS trixie [production]
12:52 <fceratto@dns1004> START - running authdns-update [production]
12:40 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
12:38 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
12:38 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
12:38 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
12:37 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . [production]
12:37 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . [production]
12:37 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
12:37 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
12:36 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'readability' for release 'main' . [production]
12:36 <aikochou@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
12:36 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
12:35 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
12:35 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
12:35 <aikochou@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . [production]
12:33 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
12:32 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
12:32 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:30 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
12:30 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'article-descriptions' for release 'main' . [production]
12:30 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'article-models' for release 'main' . [production]
12:29 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'logo-detection' for release 'main' . [production]
12:29 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
12:23 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-cluster (exit_code=0) pool all services in codfw/ml-staging-codfw: maintenance [production]
12:23 <dpogorzelski@cumin1003> START - Cookbook sre.k8s.pool-depool-cluster pool all services in codfw/ml-staging-codfw: maintenance [production]
12:05 <slyngshede@cumin1003> START - Cookbook sre.hosts.reimage for host cp2045.codfw.wmnet with OS trixie [production]
12:05 <slyngshede@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2045.codfw.wmnet with OS trixie [production]
11:52 <slyngshede@cumin1003> START - Cookbook sre.hosts.reimage for host cp2045.codfw.wmnet with OS trixie [production]
11:52 <slyngshede@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2045.codfw.wmnet with OS trixie [production]
11:48 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1204.eqiad.wmnet [production]
11:42 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1260 (T415786)', diff saved to https://phabricator.wikimedia.org/P89008 and previous config saved to /var/cache/conftool/dbconfig/20260224-114242-marostegui.json [production]
11:42 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1260.eqiad.wmnet with reason: Maintenance [production]
11:42 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1252 (T415786)', diff saved to https://phabricator.wikimedia.org/P89007 and previous config saved to /var/cache/conftool/dbconfig/20260224-114217-marostegui.json [production]
11:40 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host an-worker1204.eqiad.wmnet [production]
11:38 <fceratto@cumin1003> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host dborch1003.eqiad.wmnet [production]