251-300 of 10000 results (127ms)
2026-03-11 ยง
13:35 <bking@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
13:30 <jdlrobson@deploy2002> jdlrobson, sfaci: Continuing with sync [production]
13:29 <jdlrobson@deploy2002> jdlrobson, sfaci: Backport for [[gerrit:1247547|Remove `MetricsPlatform` configuration from production (T416865)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:22 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir4004.ulsfo.wmnet with reason: host reimage [production]
13:18 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum4003.ulsfo.wmnet with reason: host reimage [production]
13:18 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir4004.ulsfo.wmnet with reason: host reimage [production]
13:13 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum4003.ulsfo.wmnet with reason: host reimage [production]
13:08 <jdlrobson@deploy2002> Started scap sync-world: Backport for [[gerrit:1247547|Remove `MetricsPlatform` configuration from production (T416865)]] [production]
13:00 <moritzm> installing libcommons-lang3-java security updates [production]
12:57 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ncredir4004.ulsfo.wmnet with OS bookworm [production]
12:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir4003.ulsfo.wmnet with OS bookworm [production]
12:46 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host durum4003.ulsfo.wmnet with OS trixie [production]
12:46 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM durum4003.ulsfo.wmnet - jmm@cumin2002" [production]
12:46 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM durum4003.ulsfo.wmnet - jmm@cumin2002" [production]
12:46 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) durum4003.ulsfo.wmnet on all recursors [production]
12:45 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache durum4003.ulsfo.wmnet on all recursors [production]
12:45 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:45 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM durum4003.ulsfo.wmnet - jmm@cumin2002" [production]
12:41 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM durum4003.ulsfo.wmnet - jmm@cumin2002" [production]
12:37 <moritzm> installing inetutils security updates [production]
12:36 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
12:36 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host durum4003.ulsfo.wmnet [production]
12:35 <tappof> completed migration from prometheus4002 to prometheus4003 (ulsfo) (TT419430) [production]
12:34 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir4003.ulsfo.wmnet with reason: host reimage [production]
12:28 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir4003.ulsfo.wmnet with reason: host reimage [production]
12:28 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1010.eqiad.wmnet with OS bookworm [production]
12:24 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2073.codfw.wmnet with OS bullseye [production]
12:23 <btullis@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host dse-k8s-worker1010.eqiad.wmnet with OS bookworm [production]
12:18 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow4003.ulsfo.wmnet [production]
12:18 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1011.eqiad.wmnet with OS bookworm [production]
12:17 <btullis@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dse-k8s-worker1011 [production]
12:17 <btullis@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host dse-k8s-worker1011 [production]
12:14 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2072.codfw.wmnet with OS bullseye [production]
12:11 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow4003.ulsfo.wmnet [production]
12:05 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2073.codfw.wmnet with reason: host reimage [production]
12:04 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ncredir4003.ulsfo.wmnet with OS bookworm [production]
12:01 <btullis@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dse-k8s-worker1010 [production]
11:59 <btullis@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host dse-k8s-worker1010 [production]
11:58 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2073.codfw.wmnet with reason: host reimage [production]
11:54 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2072.codfw.wmnet with reason: host reimage [production]
11:48 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2072.codfw.wmnet with reason: host reimage [production]
11:41 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239954|[Growth] Enable on every new Wikipedia by default (T304052)]] (duration: 06m 39s) [production]
11:38 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2073 [production]
11:38 <mvernon@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2073 [production]
11:37 <vgutierrez> upgrading to acme-chief 0.39 on acme-chief production instances - T419352 [production]
11:37 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
11:36 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:1239954|[Growth] Enable on every new Wikipedia by default (T304052)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
11:36 <mvernon@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ms-be2073 [production]
11:36 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be2073.codfw.wmnet 212.48.192.10.in-addr.arpa 2.1.2.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
11:36 <mvernon@cumin2002> START - Cookbook sre.dns.wipe-cache ms-be2073.codfw.wmnet 212.48.192.10.in-addr.arpa 2.1.2.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]