951-1000 of 10000 results (18ms)
2025-07-04 ยง
12:31 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp7006.magru.wmnet [production]
12:31 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for cp7006.magru.wmnet [production]
12:11 <mfossati@deploy1003> Finished deploy [airflow-dags/platform_eng@38ba3ec]: bump section topics to v1.8.0 (duration: 00m 49s) [production]
12:11 <mfossati@deploy1003> Started deploy [airflow-dags/platform_eng@38ba3ec]: bump section topics to v1.8.0 [production]
11:08 <cmooney@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Release v10.0.2 with ibgp function in plugin - cmooney@cumin1003 [production]
11:05 <cmooney@cumin1003> START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Release v10.0.2 with ibgp function in plugin - cmooney@cumin1003 [production]
10:56 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 32 hosts with reason: maintenance [production]
10:51 <elukey@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest2006.codfw.wmnet with OS bookworm [production]
10:43 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2203,2212].codfw.wmnet with reason: Maintenance [production]
10:41 <elukey@cumin2002> START - Cookbook sre.hosts.reimage for host sretest2006.codfw.wmnet with OS bookworm [production]
10:27 <cgoubert@deploy1003> Unlocked for deployment [ALL REPOSITORIES]: Dragonfly supernodes reboot (duration: 09m 07s) [production]
10:26 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode1001.eqiad.wmnet [production]
10:23 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode1001.eqiad.wmnet [production]
10:22 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode2001.codfw.wmnet [production]
10:18 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode2001.codfw.wmnet [production]
10:18 <cgoubert@deploy1003> Locking from deployment [ALL REPOSITORIES]: Dragonfly supernodes reboot [production]
10:13 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply [production]
10:13 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply [production]
10:01 <jynus@cumin1002> DONE (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for backupmon1001.eqiad.wmnet: Renew puppet certificate - jynus@cumin1002 [production]
09:16 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of install6002.wikimedia.org to drbd [production]
09:07 <jynus@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on backupmon1001.eqiad.wmnet with reason: Maintenance and reboot [production]
08:59 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of install6002.wikimedia.org to drbd [production]
08:58 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir6001.drmrs.wmnet to drbd [production]
08:48 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2077.codfw.wmnet [production]
08:48 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir6001.drmrs.wmnet to drbd [production]
08:37 <mvernon@cumin2002> START - Cookbook sre.hosts.reboot-single for host ms-be2077.codfw.wmnet [production]
08:33 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti6001.drmrs.wmnet to cluster drmrs01 and group B12 [production]
08:32 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti6001.drmrs.wmnet to cluster drmrs01 and group B12 [production]
08:25 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti6001.drmrs.wmnet [production]
08:16 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti6001.drmrs.wmnet [production]
08:04 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts ganeti2020.codfw.wmnet [production]
08:04 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:04 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ganeti2020.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin1003" [production]
08:03 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti6001.drmrs.wmnet with OS bookworm [production]
08:03 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ganeti2020.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin1003" [production]
07:58 <jmm@cumin1003> START - Cookbook sre.dns.netbox [production]
07:56 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp7006.magru.wmnet with reason: testing [production]
07:53 <jmm@cumin1003> START - Cookbook sre.hosts.decommission for hosts ganeti2020.codfw.wmnet [production]
07:53 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts ganeti2019.codfw.wmnet [production]
07:53 <jmm@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:53 <jmm@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ganeti2019.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin1003" [production]
07:53 <jmm@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: ganeti2019.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin1003" [production]
07:43 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti6001.drmrs.wmnet with reason: host reimage [production]
07:42 <vgutierrez> depooling cp7006 for testing purposes [production]
07:42 <jmm@cumin1003> START - Cookbook sre.dns.netbox [production]
07:39 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti6001.drmrs.wmnet with reason: host reimage [production]
07:36 <jmm@cumin1003> START - Cookbook sre.hosts.decommission for hosts ganeti2019.codfw.wmnet [production]
07:21 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti6001.drmrs.wmnet with OS bookworm [production]
07:19 <jmm@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ganeti6001.drmrs.wmnet with reason: reimage [production]
07:16 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetserver2003.codfw.wmnet [production]