401-450 of 10000 results (120ms)
2026-03-19 ยง
10:24 <btullis@cumin1003> END (FAIL) - Cookbook sre.ceph.roll-restart-reboot-server (exit_code=99) rolling reboot on A:cephosd-eqiad [production]
10:22 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4007.ulsfo.wmnet [production]
10:21 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host testvm2006.codfw.wmnet [production]
10:21 <fnegri@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddumps1001.wikimedia.org [production]
10:20 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2008.wikimedia.org [production]
10:19 <daniel@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
10:18 <daniel@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
10:16 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host testvm2008.wikimedia.org [production]
10:14 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2007.codfw.wmnet [production]
10:13 <fnegri@cumin1003> START - Cookbook sre.hosts.reboot-single for host clouddumps1001.wikimedia.org [production]
10:10 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host testvm2007.codfw.wmnet [production]
10:09 <btullis@cumin1003> START - Cookbook sre.opensearch.roll-restart-reboot rolling reboot on A:datahubsearch [production]
10:04 <daniel@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
10:03 <daniel@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
10:00 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti4007.ulsfo.wmnet with OS bookworm [production]
09:58 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 17 hosts with reason: upgrade [production]
09:57 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host schema1003.eqiad.wmnet [production]
09:53 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host schema1003.eqiad.wmnet [production]
09:46 <jnuche@deploy2002> Finished deploy [releng/jenkins-deploy@863e5c2] (releasing): T420477 (duration: 01m 07s) [production]
09:45 <jnuche@deploy2002> Started deploy [releng/jenkins-deploy@863e5c2] (releasing): T420477 [production]
09:43 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1017.eqiad.wmnet with OS bookworm [production]
09:43 <jnuche@deploy2002> Finished deploy [releng/jenkins-deploy@863e5c2] (releasing): T420477 (duration: 00m 59s) [production]
09:42 <jnuche@deploy2002> Started deploy [releng/jenkins-deploy@863e5c2] (releasing): T420477 [production]
09:39 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti4007.ulsfo.wmnet with reason: host reimage [production]
09:35 <moritzm> installing libnginx-mod-http-lua security updates [production]
09:35 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti4007.ulsfo.wmnet with reason: host reimage [production]
09:29 <btullis@cumin1003> START - Cookbook sre.ceph.roll-restart-reboot-server rolling reboot on A:cephosd-eqiad [production]
09:26 <klausman@cumin2002> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:ml-serve-worker-codfw [production]
09:26 <klausman@cumin2002> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:ml-serve-worker-codfw [production]
09:24 <klausman@cumin2002> END (ERROR) - Cookbook sre.k8s.reboot-nodes (exit_code=97) rolling reboot on A:ml-serve-worker-codfw [production]
09:21 <brouberol@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/kafka-mirrormaker: apply [production]
09:21 <brouberol@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/kafka-mirrormaker: apply [production]
09:19 <brouberol@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/kafka-mirrormaker: apply [production]
09:19 <brouberol@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/kafka-mirrormaker: apply [production]
09:13 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti4007.ulsfo.wmnet with OS bookworm [production]
09:11 <klausman@cumin2002> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:ml-serve-worker-codfw [production]
09:01 <moritzm> remove ganeti4007 from classic Ganeti cluster in ulsfo T418993 [production]
08:56 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh4001.wikimedia.org to plain [production]
08:54 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of doh4001.wikimedia.org to plain [production]
08:48 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh4002.wikimedia.org to plain [production]
08:46 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of doh4002.wikimedia.org to plain [production]
08:46 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of hcaptcha-proxy4001.wikimedia.org to plain [production]
08:45 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of hcaptcha-proxy4001.wikimedia.org to plain [production]
08:45 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of hcaptcha-proxy4002.wikimedia.org to plain [production]
08:44 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of hcaptcha-proxy4002.wikimedia.org to plain [production]
08:43 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of install4003.wikimedia.org to plain [production]
08:42 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of install4003.wikimedia.org to plain [production]
08:40 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti4007.ulsfo.wmnet [production]
08:39 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4007.ulsfo.wmnet [production]
08:38 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host doh4003.wikimedia.org [production]