551-600 of 10000 results (144ms)
2026-05-20 ยง
15:37 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host wdqs2023.codfw.wmnet [production]
15:36 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1306-1309].eqiad.wmnet [production]
15:36 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1302-1305].eqiad.wmnet [production]
15:36 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1302-1305].eqiad.wmnet [production]
15:35 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host wdqs2016.codfw.wmnet [production]
15:32 <brett@cumin2002> START - Cookbook sre.hosts.decommission for hosts cp[2041-2042].codfw.wmnet [production]
15:32 <atsuko@deploy1003> helmfile [staging] DONE helmfile.d/services/eventstreams-internal: apply [production]
15:30 <atsuko@deploy1003> helmfile [staging] START helmfile.d/services/eventstreams-internal: apply [production]
15:30 <brouberol@cumin1003> START - Cookbook sre.hosts.reimage for host kafka-jumbo1015.eqiad.wmnet with OS trixie [production]
15:29 <moritzm> failover Ganeti master in codfw02 to ganeti2033 [production]
15:29 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1302-1305].eqiad.wmnet [production]
15:27 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.reboot (exit_code=99) [production]
15:27 <cwilliams@cumin1003> START - Cookbook sre.mysql.pool pool db1257: Migration of db1257.eqiad.wmnet completed [production]
15:26 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1302-1305].eqiad.wmnet [production]
15:26 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1298-1301].eqiad.wmnet [production]
15:26 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1298-1301].eqiad.wmnet [production]
15:25 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: T426560 - bking@cumin2002 [production]
15:25 <bking@cumin2002> START - Cookbook sre.wdqs.reboot [production]
15:25 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1257.eqiad.wmnet with OS trixie [production]
15:23 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: T426560 - bking@cumin2002 [production]
15:23 <brouberol@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-jumbo1014.eqiad.wmnet with OS trixie [production]
15:20 <hashar> Restarted Jenkins CI due to Java upgrade which causes integration/pipelinelib to not be loadable. [production]
15:17 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: T426560 - bking@cumin2002 [production]
15:15 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1298-1301].eqiad.wmnet [production]
15:15 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2033.codfw.wmnet [production]
15:14 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2033.codfw.wmnet [production]
15:13 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1298-1301].eqiad.wmnet [production]
15:13 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1294-1297].eqiad.wmnet [production]
15:13 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1294-1297].eqiad.wmnet [production]
15:09 <bking@cumin2002> conftool action : set/pooled=false; selector: dnsdisc=wdqs-scholarly,name=codfw [production]
15:08 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1257.eqiad.wmnet with reason: host reimage [production]
15:08 <btullis@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host dse-k8s-worker1026.eqiad.wmnet [production]
15:08 <btullis@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host dse-k8s-worker1025.eqiad.wmnet [production]
15:08 <btullis@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host dse-k8s-worker1025.eqiad.wmnet [production]
15:07 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:06 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2033.codfw.wmnet [production]
15:04 <btullis@cumin1003> END (PASS) - Cookbook sre.druid.reboot-workers (exit_code=0) for Druid analytics cluster: Reboot Druid nodes [production]
15:04 <brouberol@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-jumbo1014.eqiad.wmnet with reason: host reimage [production]
15:02 <blake@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1294-1297].eqiad.wmnet [production]
15:02 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1257.eqiad.wmnet with reason: host reimage [production]
15:00 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2033.codfw.wmnet [production]
15:00 <blake@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1294-1297].eqiad.wmnet [production]
15:00 <blake@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on P{wikikube-worker[1294-1327].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
15:00 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti2033.codfw.wmnet [production]
15:00 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2033.codfw.wmnet [production]
14:57 <jynus@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on 14 hosts with reason: restart [production]
14:57 <brouberol@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-jumbo1014.eqiad.wmnet with reason: host reimage [production]
14:56 <btullis@cumin1003> END (FAIL) - Cookbook sre.k8s.pool-depool-node (exit_code=99) depool for host dse-k8s-worker1025.eqiad.wmnet [production]
14:56 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:54 <atsuko@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/turnilo: apply [production]