1601-1650 of 10000 results (35ms)
2025-09-23 ยง
18:32 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1038.eqiad.wmnet with reason: host reimage [production]
18:32 <ebernhardson@deploy1003> ebernhardson: Backport for [[gerrit:1190737|cirrus: Send more_like traffic to eqiad (T405394)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
18:32 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1049.eqiad.wmnet}' [admin]
18:31 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1048.eqiad.wmnet}' [admin]
18:28 <jgleeson> SmashPig upgraded from f805ba74 to 96afe81c [production]
18:28 <ebernhardson@deploy1003> Started scap sync-world: Backport for [[gerrit:1190737|cirrus: Send more_like traffic to eqiad (T405394)]] [production]
18:27 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1038.eqiad.wmnet with reason: host reimage [production]
18:10 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
18:08 <bking@cumin1002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in search_codfw [production]
18:08 <bking@cumin1002> START - Cookbook sre.elasticsearch.ban Unbanning all hosts in search_codfw [production]
18:05 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1038.eqiad.wmnet with OS bookworm [production]
18:05 <aokoth@cumin1003> END (PASS) - Cookbook sre.vrts.upgrade (exit_code=0) on VRTS host vrts1003.eqiad.wmnet [production]
18:04 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1048.eqiad.wmnet}' [admin]
18:04 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1047.eqiad.wmnet}' [admin]
18:03 <aokoth@cumin1003> START - Cookbook sre.vrts.upgrade on VRTS host vrts1003.eqiad.wmnet [production]
18:02 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
17:53 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
17:43 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1037.eqiad.wmnet with OS bookworm [production]
17:42 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1047.eqiad.wmnet}' [admin]
17:42 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1046.eqiad.wmnet}' [admin]
17:37 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1046.eqiad.wmnet}' [admin]
17:37 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1045.eqiad.wmnet}' [admin]
17:24 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1037.eqiad.wmnet with reason: host reimage [production]
17:22 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1045.eqiad.wmnet}' [admin]
17:22 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1044.eqiad.wmnet}' [admin]
17:22 <MacFan4000> cleared excess apache logs and set logrotate to be a bit more aggressive [wm-bot]
17:20 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1037.eqiad.wmnet with reason: host reimage [production]
17:18 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply [production]
17:18 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
17:16 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1044.eqiad.wmnet}' [admin]
17:16 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1043.eqiad.wmnet}' [admin]
16:58 <andrewbogott> test [admin]
16:57 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1037.eqiad.wmnet with OS bookworm [production]
16:56 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1036.eqiad.wmnet with OS bookworm [production]
16:56 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1043.eqiad.wmnet}' [admin]
16:55 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1042.eqiad.wmnet}' [admin]
16:55 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
16:55 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
16:53 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=97) [admin]
16:50 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1042.eqiad.wmnet}' [admin]
16:50 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1041.eqiad.wmnet}' [admin]
16:42 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
16:40 <denisse> Upgrade Envoy to v1.29.12 on titan hosts - T403663 [production]
16:39 <denisse> Upgrade Envoy to v1.29.12 on prometheus::pop hosts - T403663 [production]
16:37 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage [production]
16:37 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:37 <denisse> Upgrade Envoy to v1.29.12 on prometheus hosts - T403663 [production]
16:32 <denisse> Upgrade Envoy to v1.29.12 on graphite hosts - T403663 [production]
16:31 <jasmine@cumin1003> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all services in eqiad: Moving services to codfw, Southward DC Switchover Day 1 - T399891 [production]
16:31 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1036.eqiad.wmnet with reason: host reimage [production]