351-400 of 10000 results (96ms)
2024-08-06 ยง
18:28 <sukhe> sudo cumin "lvs6001*" 'disable-puppet "rebooting" && systemctl stop pybal.service' [production]
18:18 <brett> stop pybal on lvs5005 for server reboot [production]
18:13 <dancy@deploy1003> Started scap sync-world: testing T370934 [production]
17:53 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs5004.eqsin.wmnet [production]
17:50 <sukhe@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs5004.eqsin.wmnet [production]
17:47 <sukhe> stop pybal on lvs5004 for server reboot [production]
17:40 <mutante> CI - adding a new SSH key to jenkins - in the same file without removing the old key yet - this is expected to have no effect, but if CI breaks will revert - T177826 [production]
17:01 <fnegri@cumin1002> conftool action : set/pooled=yes; selector: name=clouddb1020.eqiad.wmnet,service=s5 [production]
17:01 <fnegri@cumin1002> conftool action : set/pooled=yes; selector: name=clouddb1020.eqiad.wmnet,service=s8 [production]
16:56 <fnegri@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host clouddb1020.eqiad.wmnet with OS bookworm [production]
16:44 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1023.eqiad.wmnet with OS bullseye [production]
16:39 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:39 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding payments200 to codfw - jhancock@cumin2002" [production]
16:39 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding payments200 to codfw - jhancock@cumin2002" [production]
16:35 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
16:23 <fnegri@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on clouddb1020.eqiad.wmnet with reason: host reimage [production]
16:21 <fnegri@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on clouddb1020.eqiad.wmnet with reason: host reimage [production]
16:08 <fnegri@cumin1002> START - Cookbook sre.hosts.reimage for host clouddb1020.eqiad.wmnet with OS bookworm [production]
16:08 <sukhe> sudo cumin "A:dnsbox" "run-puppet-agent --enable 'upgrading anycast-hc'": finish anycast-hc upgrade: T370068 [production]
16:08 <sukhe> sudo cumin "A:dnsbox" "run-puppet-agent --enable 'upgrading anycast-hc'": finish anycast-hc upgrade [production]
16:03 <fnegri@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on clouddb1020.eqiad.wmnet with reason: Reimaging clouddb1020 T365424 [production]
16:03 <fnegri@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on clouddb1020.eqiad.wmnet with reason: Reimaging clouddb1020 T365424 [production]
15:46 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:46 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2011 to codfw - jhancock@cumin2002" [production]
15:46 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2011 to codfw - jhancock@cumin2002" [production]
15:41 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
15:39 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:39 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2010 to codfw - jhancock@cumin2002" [production]
15:39 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ml-serve2010 to codfw - jhancock@cumin2002" [production]
15:35 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
15:30 <dcausse@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
15:30 <dcausse@deploy1003> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
15:26 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
15:26 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
15:25 <sukhe@cumin1002> conftool action : set/pooled=yes; selector: name=dns1006.wikimedia.org [reason: [done] anycast-healthchecker 0.9.8 upgrade] [production]
15:25 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker2035.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
15:23 <ryankemper@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1023.eqiad.wmnet with OS bullseye [production]
15:23 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host wikikube-worker2035.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
15:23 <sukhe@cumin1002> conftool action : set/pooled=no; selector: name=dns1006.wikimedia.org [reason: anycast-healthchecker 0.9.8 upgrade] [production]
15:21 <sukhe@cumin1002> conftool action : set/pooled=yes; selector: name=dns1005.wikimedia.org [reason: [done] anycast-healthchecker 0.9.8 upgrade] [production]
15:20 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
15:18 <sukhe@cumin1002> conftool action : set/pooled=no; selector: name=dns1005.wikimedia.org [reason: anycast-healthchecker 0.9.8 upgrade] [production]
15:16 <sukhe@cumin1002> conftool action : set/pooled=yes; selector: name=dns1004.wikimedia.org [reason: [done] anycast-healthchecker 0.9.8 upgrade] [production]
15:14 <sukhe@cumin1002> conftool action : set/pooled=no; selector: name=dns1004.wikimedia.org [reason: anycast-healthchecker 0.9.8 upgrade] [production]
15:12 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host sretest2002.mgmt.codfw.wmnet with reboot policy FORCED [production]
15:11 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest1001.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
15:10 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host sretest1001.mgmt.eqiad.wmnet with reboot policy GRACEFUL [production]
15:10 <cdanis> re-enabling puppet on cp nodes to deploy https://gerrit.wikimedia.org/r/1059126 [production]
15:02 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker1296.mgmt.eqiad.wmnet with reboot policy FORCED [production]
15:01 <cdanis> disabling puppet on cp nodes to deploy https://gerrit.wikimedia.org/r/1059126 [production]