3401-3450 of 10000 results (41ms)
2024-11-06 ยง
16:08 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:08 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:01 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs4010.ulsfo.wmnet [production]
15:59 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:58 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:57 <mfossati@deploy2002> Finished deploy [airflow-dags/platform_eng@294093b]: remove section alignment image suggestions, now in section topics v1.0.0 (duration: 01m 23s) [production]
15:57 <vriley@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:57 <vriley@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt fransc1001 - vriley@cumin1002" [production]
15:57 <vriley@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt fransc1001 - vriley@cumin1002" [production]
15:57 <mfossati@deploy2002> Started deploy [airflow-dags/platform_eng@294093b]: remove section alignment image suggestions, now in section topics v1.0.0 [production]
15:55 <topranks> rebooting lvs4010 to verify new IPv6 sysctl's for RA processing work T358260 [production]
15:55 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:25:00 on cr[3-4]-ulsfo with reason: prevent bgp alerts firing while lvs4010 is rebooted [production]
15:55 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 0:25:00 on cr[3-4]-ulsfo with reason: prevent bgp alerts firing while lvs4010 is rebooted [production]
15:55 <cmooney@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs4010.ulsfo.wmnet [production]
15:53 <vriley@cumin1002> START - Cookbook sre.dns.netbox [production]
15:51 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:50 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:48 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component api-gateway [tools]
15:48 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:48 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:43 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component api-gateway [tools]
15:43 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:42 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host fransc1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:36 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component api-gateway [toolsbeta]
15:31 <moritzm> installing Linux 5.10.226 on bullseye hosts [production]
15:31 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component api-gateway [toolsbeta]
15:24 <arnaudb@cumin1002> START - Cookbook sre.mysql.pool db2136 gradually with 4 steps - cloned on db2236 [production]
15:18 <mutante> gitlab1004 - systemctl start wmf_auto_restart_ssh-gitlab (because it had failed with "Service ssh-gitlab not present or not running") but now it's just fine and exits with "No restart necessary" T379166 [production]
15:13 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host ms-be2083.codfw.wmnet with OS bullseye [production]
15:12 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1087877|Document available wbformatvalue options (T323778)]] (duration: 38m 45s) [production]
15:07 <arnaudb@cumin1002> END (PASS) - Cookbook sre.mysql.clone (exit_code=0) of db2136.codfw.wmnet onto db2236.codfw.wmnet [production]
15:00 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Continuing with sync [production]
14:59 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Backport for [[gerrit:1087877|Document available wbformatvalue options (T323778)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:51 <moritzm> installing php7.4 security updates [production]
14:50 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1046.eqiad.wmnet [production]
14:48 <moritzm> installing usb.ids updates from Bookworm point release [production]
14:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1046.eqiad.wmnet [production]
14:42 <jmm@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ganeti1046 [production]
14:36 <jmm@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ganeti1046 [production]
14:33 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1087877|Document available wbformatvalue options (T323778)]] [production]
14:31 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1085572|Cleanup for logo related file]] (duration: 15m 01s) [production]
14:31 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: pool site eqiad for service: ncredir-addrs [reason: no reason specified, T378453] [production]
14:31 <vgutierrez@cumin1002> START - Cookbook sre.dns.admin DNS admin: pool site eqiad for service: ncredir-addrs [reason: no reason specified, T378453] [production]
14:27 <lucaswerkmeister-wmde@deploy2002> hamishz, lucaswerkmeister-wmde: Continuing with sync [production]
14:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1045.eqiad.wmnet [production]
14:20 <sukhe@puppetserver1001> conftool action : set/pooled=no; selector: name=cp2031.codfw.wmnet [production]
14:19 <sukhe> depool cp2031 [production]
14:19 <lucaswerkmeister-wmde@deploy2002> hamishz, lucaswerkmeister-wmde: Backport for [[gerrit:1085572|Cleanup for logo related file]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:19 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1045.eqiad.wmnet [production]
14:16 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1085572|Cleanup for logo related file]] [production]