1051-1100 of 10000 results (27ms)
2025-09-23 ยง
17:18 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply [production]
17:18 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
17:16 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1044.eqiad.wmnet}' [admin]
17:16 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1043.eqiad.wmnet}' [admin]
16:58 <andrewbogott> test [admin]
16:57 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1037.eqiad.wmnet with OS bookworm [production]
16:56 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1036.eqiad.wmnet with OS bookworm [production]
16:56 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1043.eqiad.wmnet}' [admin]
16:55 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1042.eqiad.wmnet}' [admin]
16:55 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
16:55 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
16:53 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=97) [admin]
16:50 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1042.eqiad.wmnet}' [admin]
16:50 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
16:42 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
16:40 <denisse> Upgrade Envoy to v1.29.12 on titan hosts - T403663 [production]
16:39 <denisse> Upgrade Envoy to v1.29.12 on prometheus::pop hosts - T403663 [production]
16:37 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage [production]
16:37 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:37 <denisse> Upgrade Envoy to v1.29.12 on prometheus hosts - T403663 [production]
16:32 <denisse> Upgrade Envoy to v1.29.12 on graphite hosts - T403663 [production]
16:31 <jasmine@cumin1003> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 [production]
16:31 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage [production]
16:27 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
16:26 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' [admin]
16:25 <bking@cumin1002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: cirrussearch2093.codfw.wmnet for thread pool rejections - bking@cumin1002 - T399891 [production]
16:25 <bking@cumin1002> START - Cookbook sre.elasticsearch.ban Banning hosts: cirrussearch2093.codfw.wmnet for thread pool rejections - bking@cumin1002 - T399891 [production]
16:22 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' [admin]
16:22 <denisse> Upgrade Envoy to v1.29.12 on logstash hosts - T403663 [production]
16:20 <denisse> Upgrade Envoy to v1.29.12 on grafana hosts - T403663 [production]
16:19 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1025.eqiad.wmnet with reason: host reimage [production]
16:15 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1025.eqiad.wmnet with reason: host reimage [production]
16:09 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1036.eqiad.wmnet with OS bookworm [production]
16:03 <jasmine@cumin1003> START - Cookbook sre.discovery.datacenter depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 [production]
16:03 <stevemunene@cumin1003> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. [production]
16:03 <jasmine@cumin1003> END (ERROR) - Cookbook sre.discovery.datacenter (exit_code=93) depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 [production]
15:57 <stevemunene@cumin1003> START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. [production]
15:56 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
15:55 <andrew@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "cloudcephosd1025 is no longer failed, I think - andrew@cumin2002" [production]
15:55 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
15:52 <jasmine@cumin1003> START - Cookbook sre.discovery.datacenter depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 [production]
15:51 <stevemunene@cumin1003> END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid jvm daemons. [production]
15:49 <andrew@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "cloudcephosd1025 is no longer failed, I think - andrew@cumin2002" [production]
15:46 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
15:43 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1044.eqiad.wmnet with OS bookworm [production]
15:31 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component jobs-api [toolsbeta]
15:30 <jasmine@cumin1003> END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: depool site eqiad [reason: Moving traffic to codfw, Southward DC Switchover Day 1, T399891] [production]
15:30 <jasmine@cumin1003> START - Cookbook sre.dns.admin DNS admin: depool site eqiad [reason: Moving traffic to codfw, Southward DC Switchover Day 1, T399891] [production]
15:28 <Emperor> restart swift-proxy ms-fe1010 ms-fe2010 ms-fe2011 ms-fe2015 T360913 [production]
15:27 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]