701-750 of 10000 results (118ms)
2026-03-18 ยง
17:05 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir2002.codfw.wmnet [production]
17:05 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=0) rolling reboot on A:ncredir-eqsin and A:ncredir [production]
17:05 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-ulsfo and A:ncredir [production]
17:04 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=0) rolling reboot on A:ncredir-magru and A:ncredir [production]
17:03 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp3078.esams.wmnet with OS trixie [production]
17:02 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker1347 [production]
17:02 <jayme@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1347 [production]
17:02 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp3078.esams.wmnet [reason: trixie reimaging] [production]
17:01 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp3076.esams.wmnet [reason: trixie reimaging] [production]
17:01 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp3077.esams.wmnet with OS trixie [production]
17:01 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp3077.esams.wmnet [reason: trixie reimaging] [production]
16:59 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host ncredir2002.codfw.wmnet [production]
16:58 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=ncredir2002.* [production]
16:56 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 8 hosts with reason: upgrade [production]
16:55 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=ncredir2001.* [production]
16:55 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ncredir2001.codfw.wmnet [production]
16:55 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for ncredir2001.codfw.wmnet [production]
16:55 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3076.esams.wmnet with OS trixie [production]
16:53 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2014.codfw.wmnet [production]
16:52 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-eqsin and A:ncredir [production]
16:52 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbproxy2008.codfw.wmnet with reason: kernel update [production]
16:51 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir-magru and A:ncredir [production]
16:51 <klausman@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on ml-serve1013.eqiad.wmnet with reason: Reboot for security update [production]
16:50 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2013.codfw.wmnet [production]
16:49 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=ncredir2001.* [production]
16:49 <brett@cumin2002> END (ERROR) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=97) rolling reboot on A:ncredir and A:ncredir [production]
16:48 <jayme@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1347 [production]
16:48 <jayme@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker1347.eqiad.wmnet 199.48.64.10.in-addr.arpa 9.9.1.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
16:48 <jayme@cumin1003> START - Cookbook sre.dns.wipe-cache wikikube-worker1347.eqiad.wmnet 199.48.64.10.in-addr.arpa 9.9.1.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
16:48 <jayme@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:47 <jayme@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker1347 - jayme@cumin1003" [production]
16:47 <jayme@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker1347 - jayme@cumin1003" [production]
16:47 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot begin reboot of dns3003.wikimedia.org [production]
16:47 <klausman@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1012.eqiad.wmnet [production]
16:47 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling reboot on A:ncredir and A:ncredir [production]
16:47 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2012.codfw.wmnet [production]
16:47 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup2014.codfw.wmnet [production]
16:46 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp3075.esams.wmnet [reason: trixie reimaging] [production]
16:46 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2003.codfw.wmnet [production]
16:45 <fnegri@cumin1003> conftool action : set/pooled=yes; selector: name=clouddb1013.eqiad.wmnet [production]
16:44 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup2013.codfw.wmnet [production]
16:44 <jayme@cumin1003> START - Cookbook sre.dns.netbox [production]
16:43 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host backup2009.codfw.wmnet [production]
16:43 <jayme@cumin1003> START - Cookbook sre.hosts.move-vlan for host wikikube-worker1347 [production]
16:43 <brett@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=99) [production]
16:43 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host wikikube-worker1347.eqiad.wmnet with OS trixie [production]
16:43 <brett@cumin2002> START - Cookbook sre.hosts.reboot-cluster [production]
16:41 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbproxy2007.codfw.wmnet with reason: kernel update [production]
16:40 <jynus@cumin1003> START - Cookbook sre.hosts.reboot-single for host backup2012.codfw.wmnet [production]
16:39 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3079.esams.wmnet with OS trixie [production]