2025-09-23
ยง
|
18:02 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) |
[admin] |
17:53 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.reactivate |
[admin] |
17:43 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1037.eqiad.wmnet with OS bookworm |
[production] |
17:42 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1047.eqiad.wmnet}' |
[admin] |
17:42 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1046.eqiad.wmnet}' |
[admin] |
17:37 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1046.eqiad.wmnet}' |
[admin] |
17:37 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1045.eqiad.wmnet}' |
[admin] |
17:24 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1037.eqiad.wmnet with reason: host reimage |
[production] |
17:22 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1045.eqiad.wmnet}' |
[admin] |
17:22 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1044.eqiad.wmnet}' |
[admin] |
17:22 |
<MacFan4000> |
cleared excess apache logs and set logrotate to be a bit more aggressive |
[wm-bot] |
17:20 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1037.eqiad.wmnet with reason: host reimage |
[production] |
17:18 |
<sfaci@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply |
[production] |
17:18 |
<sfaci@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply |
[production] |
17:16 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1044.eqiad.wmnet}' |
[admin] |
17:16 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1043.eqiad.wmnet}' |
[admin] |
16:58 |
<andrewbogott> |
test |
[admin] |
16:57 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1037.eqiad.wmnet with OS bookworm |
[production] |
16:56 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1036.eqiad.wmnet with OS bookworm |
[production] |
16:56 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1043.eqiad.wmnet}' |
[admin] |
16:55 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1042.eqiad.wmnet}' |
[admin] |
16:55 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) |
[admin] |
16:55 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.reactivate |
[admin] |
16:53 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=97) |
[admin] |
16:50 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1042.eqiad.wmnet}' |
[admin] |
16:50 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' |
[admin] |
16:42 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.bootstrap_and_add |
[admin] |
16:40 |
<denisse> |
Upgrade Envoy to v1.29.12 on titan hosts - T403663 |
[production] |
16:39 |
<denisse> |
Upgrade Envoy to v1.29.12 on prometheus::pop hosts - T403663 |
[production] |
16:37 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage |
[production] |
16:37 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1025.eqiad.wmnet with OS bookworm |
[production] |
16:37 |
<denisse> |
Upgrade Envoy to v1.29.12 on prometheus hosts - T403663 |
[production] |
16:32 |
<denisse> |
Upgrade Envoy to v1.29.12 on graphite hosts - T403663 |
[production] |
16:31 |
<jasmine@cumin1003> |
END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 |
[production] |
16:31 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage |
[production] |
16:27 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' |
[admin] |
16:26 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' |
[admin] |
16:25 |
<bking@cumin1002> |
END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: cirrussearch2093.codfw.wmnet for thread pool rejections - bking@cumin1002 - T399891 |
[production] |
16:25 |
<bking@cumin1002> |
START - Cookbook sre.elasticsearch.ban Banning hosts: cirrussearch2093.codfw.wmnet for thread pool rejections - bking@cumin1002 - T399891 |
[production] |
16:22 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1040.eqiad.wmnet}' |
[admin] |
16:22 |
<denisse> |
Upgrade Envoy to v1.29.12 on logstash hosts - T403663 |
[production] |
16:20 |
<denisse> |
Upgrade Envoy to v1.29.12 on grafana hosts - T403663 |
[production] |
16:19 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1025.eqiad.wmnet with reason: host reimage |
[production] |
16:15 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1025.eqiad.wmnet with reason: host reimage |
[production] |
16:09 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1036.eqiad.wmnet with OS bookworm |
[production] |
16:03 |
<jasmine@cumin1003> |
START - Cookbook sre.discovery.datacenter depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 |
[production] |
16:03 |
<stevemunene@cumin1003> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. |
[production] |
16:03 |
<jasmine@cumin1003> |
END (ERROR) - Cookbook sre.discovery.datacenter (exit_code=93) depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 |
[production] |
15:57 |
<stevemunene@cumin1003> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. |
[production] |
15:56 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm |
[production] |