51-100 of 10000 results (118ms)
2026-05-22 ยง
09:26 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/test-kitchen-next: apply [production]
09:26 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/test-kitchen-next: apply [production]
09:26 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3070.esams.wmnet [production]
09:21 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
09:16 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest2010.codfw.wmnet with OS trixie [production]
09:14 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp307[0-1].esams.wmnet} and A:cp [production]
09:11 <slyngshede@cumin1003> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp307[6-7].esams.wmnet} and A:cp [production]
09:11 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3077.esams.wmnet [production]
09:04 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
09:03 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest2010.codfw.wmnet with OS trixie [production]
08:47 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
08:46 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie [production]
08:40 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie [production]
08:33 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wikidata: apply [production]
08:33 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wikidata: apply [production]
08:30 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3076.esams.wmnet [production]
08:18 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp307[6-7].esams.wmnet} and A:cp [production]
08:15 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ganeti1058.eqiad.wmnet on all recursors [production]
08:15 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:15 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: change records for ganeti1058 - cmooney@cumin1003" [production]
08:15 <cmooney@cumin1003> START - Cookbook sre.dns.wipe-cache ganeti1058.eqiad.wmnet on all recursors [production]
08:15 <cmooney@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: change records for ganeti1058 - cmooney@cumin1003" [production]
08:09 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
08:07 <slyngshede@cumin1003> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp306[8-9].esams.wmnet} and A:cp [production]
08:07 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3069.esams.wmnet [production]
08:05 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wikidata: apply [production]
08:05 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wikidata: apply [production]
07:31 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1024.eqiad.wmnet [production]
07:26 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3068.esams.wmnet [production]
07:14 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp306[8-9].esams.wmnet} and A:cp [production]
07:11 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti1057.eqiad.wmnet to cluster eqiad and group A [production]
07:10 <slyngshede@cumin1003> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp3075.esams.wmnet} and A:cp [production]
07:10 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3075.esams.wmnet [production]
07:06 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti1057.eqiad.wmnet to cluster eqiad and group A [production]
07:04 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1057.eqiad.wmnet [production]
07:02 <jmm@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ganeti1057 [production]
07:01 <jmm@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ganeti1057 [production]
06:58 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp3075.esams.wmnet} and A:cp [production]
06:58 <slyngshede@cumin1003> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp3067.esams.wmnet} and A:cp [production]
06:58 <slyngshede@cumin1003> cookbooks.sre.cdn.roll-reboot finished rebooting cp3067.esams.wmnet [production]
06:56 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1057.eqiad.wmnet [production]
06:46 <slyngshede@cumin1003> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp3067.esams.wmnet} and A:cp [production]
06:13 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1024.eqiad.wmnet [production]
06:08 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1024.eqiad.wmnet [production]
06:07 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host bast3007.wikimedia.org [production]
06:01 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host bast3007.wikimedia.org [production]
05:25 <marostegui@dns1004> END - running authdns-update [production]
05:24 <marostegui@dns1004> START - running authdns-update [production]
05:23 <marostegui> Failover m5-master T426633 [production]
05:19 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbproxy1028.eqiad.wmnet with reason: Reboot [production]