4501-4550 of 10000 results (112ms)
2024-06-11 ยง
13:55 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd1036.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:55 <logmsgbot> lucaswerkmeister-wmde@deploy1002 lucaswerkmeister-wmde, cmelo: Backport for [[gerrit:1041096|Enable CampaignEvents on swahili wikipedia (T366502)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1035.eqiad.wmnet [production]
13:52 <logmsgbot> lucaswerkmeister-wmde@deploy1002 Started scap: Backport for [[gerrit:1041096|Enable CampaignEvents on swahili wikipedia (T366502)]] [production]
13:52 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1036.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:51 <logmsgbot> lucaswerkmeister-wmde@deploy1002 Finished scap: Backport for [[gerrit:1041094|Configures the necessary user rights for CampaignEvents on swahili (T366502)]] (duration: 44m 51s) [production]
13:50 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts stat1007.eqiad.wmnet [production]
13:50 <btullis@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:50 <fnegri@cumin1002> START - Cookbook sre.hosts.reboot-single for host clouddb1017.eqiad.wmnet [production]
13:49 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd1036.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:49 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1038.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:48 <btullis@cumin1002> START - Cookbook sre.dns.netbox [production]
13:47 <fnegri@cumin1002> conftool action : set/pooled=no; selector: name=clouddb1017.eqiad.wmnet,service=s3 [production]
13:47 <fnegri@cumin1002> conftool action : set/pooled=no; selector: name=clouddb1017.eqiad.wmnet,service=s1 [production]
13:46 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudcephosd1038.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:46 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1038.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:46 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1037.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:46 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1036.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:46 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host cloudcephosd1035.mgmt.eqiad.wmnet with reboot policy FORCED [production]
13:45 <jclark@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:45 <jclark@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt for cloudcephosd1035-38 - jclark@cumin1002" [production]
13:45 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1035.eqiad.wmnet [production]
13:45 <vgutierrez> rolling switch from tcp-mss-clamper to ferm based MSS clamping on A:ncredir - T365689 [production]
13:44 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt for cloudcephosd1035-38 - jclark@cumin1002" [production]
13:43 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1032.eqiad.wmnet [production]
13:43 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1032.eqiad.wmnet [production]
13:42 <jiji@cumin1002> END (ERROR) - Cookbook sre.k8s.reboot-nodes (exit_code=97) rolling reboot on A:wikikube-worker-eqiad [production]
13:40 <btullis@cumin1002> START - Cookbook sre.hosts.decommission for hosts stat1007.eqiad.wmnet [production]
13:40 <jclark@cumin1002> START - Cookbook sre.dns.netbox [production]
13:40 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts stat1006.eqiad.wmnet [production]
13:40 <btullis@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:40 <btullis@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: stat1006.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1002" [production]
13:36 <vgutierrez> repool ncredir6001 - T365689 [production]
13:36 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:restbase-codfw [production]
13:36 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1032.eqiad.wmnet [production]
13:33 <moritzm> failover ganeti cluster for esams01 to ganeti3005 [production]
13:32 <moritzm> failover ganeti cluster for esams02 to ganeti3006 [production]
13:26 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3006.esams.wmnet [production]
13:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3006.esams.wmnet [production]
13:22 <fnegri@cumin1002> conftool action : set/pooled=yes; selector: name=clouddb1016.eqiad.wmnet,service=s5 [production]
13:22 <fnegri@cumin1002> conftool action : set/pooled=yes; selector: name=clouddb1016.eqiad.wmnet,service=s8 [production]
13:21 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1032.eqiad.wmnet [production]
13:20 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2205 (T352010)', diff saved to https://phabricator.wikimedia.org/P64630 and previous config saved to /var/cache/conftool/dbconfig/20240611-132043-ladsgroup.json [production]
13:19 <logmsgbot> lucaswerkmeister-wmde@deploy1002 cmelo, lucaswerkmeister-wmde: Continuing with sync [production]
13:18 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3006.esams.wmnet [production]
13:18 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1031.eqiad.wmnet [production]
13:17 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1031.eqiad.wmnet [production]
13:15 <vgutierrez> depool ncredir6001 - T365689 [production]
13:11 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1031.eqiad.wmnet [production]
13:10 <logmsgbot> lucaswerkmeister-wmde@deploy1002 cmelo, lucaswerkmeister-wmde: Backport for [[gerrit:1041094|Configures the necessary user rights for CampaignEvents on swahili (T366502)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]