751-800 of 10000 results (21ms)
2026-03-18 ยง
17:12 <brett@cumin2002> cookbooks.sre.cdn.roll-reboot finished rebooting cp2047.codfw.wmnet [production]
17:11 <swfrench@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
17:09 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1017.eqiad.wmnet with OS bookworm [production]
17:09 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=cp3078.* [production]
17:08 <swfrench@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply [production]
17:08 <swfrench@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
17:08 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp3079.* [production]
17:08 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot finished rebooting dns3003.wikimedia.org [production]
17:08 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp3078.* [production]
17:07 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-eqiad and A:ncredir [production]
17:07 <swfrench@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-jobrunner: apply [production]
17:07 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-esams and A:ncredir [production]
17:06 <swfrench@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-jobrunner: apply [production]
17:06 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=ncredir2002.* [production]
17:05 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-drmrs and A:ncredir [production]
17:05 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir2002.codfw.wmnet [production]
17:05 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=0) rolling reboot on A:ncredir-eqsin and A:ncredir [production]
17:05 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-ulsfo and A:ncredir [production]
17:04 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=0) rolling reboot on A:ncredir-magru and A:ncredir [production]
17:03 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp3078.esams.wmnet with OS trixie [production]
17:02 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker1347 [production]
17:02 <jayme@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1347 [production]
17:02 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp3078.esams.wmnet [reason: trixie reimaging] [production]
17:01 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp3076.esams.wmnet [reason: trixie reimaging] [production]
17:01 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp3077.esams.wmnet with OS trixie [production]
17:01 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp3077.esams.wmnet [reason: trixie reimaging] [production]
17:00 <wmftkbot> Test Kitchen edge-unique experiments (poll 270190) - adds: none; removes: none; fields: synth-aa-test-traffic-impact-2, synth-aa-test-traffic-impact-3 - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
16:59 <wmftkbot> Test Kitchen edge-unique experiments (poll 270189) - adds: none; removes: none; fields: synth-aa-test-traffic-impact-1 - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
16:59 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host ncredir2002.codfw.wmnet [production]
16:58 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=ncredir2002.* [production]
16:56 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 8 hosts with reason: upgrade [production]
16:55 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=ncredir2001.* [production]
16:55 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ncredir2001.codfw.wmnet [production]
16:55 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for ncredir2001.codfw.wmnet [production]
16:55 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3076.esams.wmnet with OS trixie [production]
16:53 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2014.codfw.wmnet [production]
16:52 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-eqsin and A:ncredir [production]
16:52 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbproxy2008.codfw.wmnet with reason: kernel update [production]
16:51 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-magru and A:ncredir [production]
16:51 <klausman@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on ml-serve1013.eqiad.wmnet with reason: Reboot for security update [production]
16:50 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2013.codfw.wmnet [production]
16:49 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=ncredir2001.* [production]
16:49 <brett@cumin2002> END (ERROR) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=97) rolling reboot on A:ncredir and A:ncredir [production]
16:48 <jayme@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1347 [production]
16:48 <jayme@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1347.eqiad.wmnet 199.48.64.10.in-addr.arpa 9.9.1.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
16:48 <jayme@cumin1003> START - Cookbook sre.dns.wipe-cache wikikube-worker1347.eqiad.wmnet 199.48.64.10.in-addr.arpa 9.9.1.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
16:48 <jayme@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:47 <jayme@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker1347 - jayme@cumin1003" [production]
16:47 <jayme@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker1347 - jayme@cumin1003" [production]
16:47 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot begin reboot of dns3003.wikimedia.org [production]