601-650 of 10000 results (117ms)
2026-04-15 ยง
11:27 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
11:24 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2158 (T419961)', diff saved to https://phabricator.wikimedia.org/P90754 and previous config saved to /var/cache/conftool/dbconfig/20260415-112445-fceratto.json [production]
11:24 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2158.codfw.wmnet with reason: Maintenance [production]
11:24 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2151 (T419961)', diff saved to https://phabricator.wikimedia.org/P90753 and previous config saved to /var/cache/conftool/dbconfig/20260415-112413-fceratto.json [production]
11:20 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:19 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P90752 and previous config saved to /var/cache/conftool/dbconfig/20260415-111905-fceratto.json [production]
11:18 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
11:17 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:14 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P90751 and previous config saved to /var/cache/conftool/dbconfig/20260415-111405-fceratto.json [production]
11:12 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
11:11 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:08 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P90750 and previous config saved to /var/cache/conftool/dbconfig/20260415-110856-fceratto.json [production]
11:05 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2069.codfw.wmnet with reason: host reimage [production]
11:03 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P90749 and previous config saved to /var/cache/conftool/dbconfig/20260415-110357-fceratto.json [production]
11:01 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
10:59 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2069.codfw.wmnet with reason: host reimage [production]
10:58 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1203 (T419635)', diff saved to https://phabricator.wikimedia.org/P90748 and previous config saved to /var/cache/conftool/dbconfig/20260415-105848-fceratto.json [production]
10:53 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2151 (T419961)', diff saved to https://phabricator.wikimedia.org/P90747 and previous config saved to /var/cache/conftool/dbconfig/20260415-105349-fceratto.json [production]
10:53 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1203 (T419635)', diff saved to https://phabricator.wikimedia.org/P90746 and previous config saved to /var/cache/conftool/dbconfig/20260415-105338-fceratto.json [production]
10:53 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1203.eqiad.wmnet with reason: Maintenance [production]
10:53 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1193 (T419635)', diff saved to https://phabricator.wikimedia.org/P90745 and previous config saved to /var/cache/conftool/dbconfig/20260415-105314-fceratto.json [production]
10:45 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2151 (T419961)', diff saved to https://phabricator.wikimedia.org/P90744 and previous config saved to /var/cache/conftool/dbconfig/20260415-104535-fceratto.json [production]
10:45 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2151.codfw.wmnet with reason: Maintenance [production]
10:44 <taavi@dns1004> END - running authdns-update [production]
10:43 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1193', diff saved to https://phabricator.wikimedia.org/P90743 and previous config saved to /var/cache/conftool/dbconfig/20260415-104306-fceratto.json [production]
10:42 <taavi@dns1004> START - running authdns-update [production]
10:39 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2069.codfw.wmnet with OS trixie [production]
10:37 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 365 days, 0:00:00 on dborch1001.wikimedia.org with reason: T416582 [production]
10:32 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1193', diff saved to https://phabricator.wikimedia.org/P90742 and previous config saved to /var/cache/conftool/dbconfig/20260415-103258-fceratto.json [production]
10:29 <mvernon@cumin2002> END (FAIL) - Cookbook sre.swift.convert-disks (exit_code=99) for host ms-be2069 [production]
10:29 <mvernon@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2069.codfw.wmnet with OS bullseye [production]
10:22 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1193 (T419635)', diff saved to https://phabricator.wikimedia.org/P90741 and previous config saved to /var/cache/conftool/dbconfig/20260415-102250-fceratto.json [production]
10:19 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1193 (T419635)', diff saved to https://phabricator.wikimedia.org/P90740 and previous config saved to /var/cache/conftool/dbconfig/20260415-101942-fceratto.json [production]
10:19 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1193.eqiad.wmnet with reason: Maintenance [production]
10:19 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1192 (T419635)', diff saved to https://phabricator.wikimedia.org/P90739 and previous config saved to /var/cache/conftool/dbconfig/20260415-101917-fceratto.json [production]
10:10 <jayme@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2280.codfw.wmnet [production]
10:10 <jayme@cumin2002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2280.codfw.wmnet [production]
10:10 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for wikikube-worker2280.codfw.wmnet [production]
10:10 <jayme@cumin2002> START - Cookbook sre.hosts.remove-downtime for wikikube-worker2280.codfw.wmnet [production]
10:10 <elukey> upgrade spicerack on cumin nodes [production]
10:09 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1192', diff saved to https://phabricator.wikimedia.org/P90738 and previous config saved to /var/cache/conftool/dbconfig/20260415-100908-fceratto.json [production]
10:08 <elukey> uploaded spicerack_12.4.0 to apt.wikimedia.org bookworm-wikimedia [production]
10:00 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
09:59 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2069.codfw.wmnet with reason: host reimage [production]
09:59 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1192', diff saved to https://phabricator.wikimedia.org/P90737 and previous config saved to /var/cache/conftool/dbconfig/20260415-095901-fceratto.json [production]
09:58 <jayme@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on wikikube-worker2280.codfw.wmnet with reason: hardware issues [production]
09:56 <jayme@cumin2002> END (FAIL) - Cookbook sre.k8s.pool-depool-node (exit_code=99) depool for host wikikube-worker2280.codfw.wmnet [production]
09:53 <jayme@cumin2002> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker2280.codfw.wmnet [production]
09:53 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2069.codfw.wmnet with reason: host reimage [production]
09:51 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]