151-200 of 10000 results (107ms)
2026-04-20 ยง
17:34 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-be1005.eqiad.wmnet with reason: host reimage [production]
17:27 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply [production]
17:27 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-be1005.eqiad.wmnet with reason: host reimage [production]
17:26 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox-video: apply [production]
17:26 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox-timeline: apply [production]
17:25 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox-timeline: apply [production]
17:25 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox-syntaxhighlight: apply [production]
17:24 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox-syntaxhighlight: apply [production]
17:24 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox-media: apply [production]
17:24 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox-media: apply [production]
17:23 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox-constraints: apply [production]
17:23 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox-constraints: apply [production]
17:22 <swfrench@deploy1003> helmfile [codfw] DONE helmfile.d/services/shellbox: apply [production]
17:21 <swfrench@deploy1003> helmfile [codfw] START helmfile.d/services/shellbox: apply [production]
17:18 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox-video: apply [production]
17:18 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudelastic1012.eqiad.wmnet with reason: host reimage [production]
17:18 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox-video: apply [production]
17:18 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox-timeline: apply [production]
17:18 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox-timeline: apply [production]
17:18 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox-syntaxhighlight: apply [production]
17:17 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox-syntaxhighlight: apply [production]
17:17 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox-media: apply [production]
17:17 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox-media: apply [production]
17:17 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox-constraints: apply [production]
17:17 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox-constraints: apply [production]
17:17 <swfrench@deploy1003> helmfile [staging] DONE helmfile.d/services/shellbox: apply [production]
17:16 <swfrench@deploy1003> helmfile [staging] START helmfile.d/services/shellbox: apply [production]
17:15 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host thanos-be1005.eqiad.wmnet with OS bullseye [production]
17:14 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudelastic1012.eqiad.wmnet with reason: host reimage [production]
17:02 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1012.eqiad.wmnet with OS trixie [production]
16:55 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1161 (T419635)', diff saved to https://phabricator.wikimedia.org/P91231 and previous config saved to /var/cache/conftool/dbconfig/20260420-165459-fceratto.json [production]
16:54 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
16:54 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
16:54 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T419635)', diff saved to https://phabricator.wikimedia.org/P91230 and previous config saved to /var/cache/conftool/dbconfig/20260420-165423-fceratto.json [production]
16:52 <moritzm> installing imagemagick security updates [production]
16:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti5006.eqsin.wmnet with OS bookworm [production]
16:48 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM aux-k8s-etcd1003.eqiad.wmnet [production]
16:44 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM aux-k8s-etcd1003.eqiad.wmnet [production]
16:44 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P91229 and previous config saved to /var/cache/conftool/dbconfig/20260420-164415-fceratto.json [production]
16:38 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM aux-k8s-etcd1003.eqiad.wmnet [production]
16:36 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es2036: Moving to another rack [production]
16:34 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM aux-k8s-etcd1003.eqiad.wmnet [production]
16:34 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P91227 and previous config saved to /var/cache/conftool/dbconfig/20260420-163407-fceratto.json [production]
16:33 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM aux-k8s-etcd1003.eqiad.wmnet [production]
16:29 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM aux-k8s-etcd1003.eqiad.wmnet [production]
16:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM backupmon1001.eqiad.wmnet [production]
16:27 <marostegui@dns1004> END - running authdns-update [production]
16:26 <marostegui> Switchover m3 proxy (phabricator) [production]
16:26 <marostegui@dns1004> START - running authdns-update [production]
16:25 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti5006.eqsin.wmnet with reason: host reimage [production]