4451-4500 of 10000 results (38ms)
2025-09-23 ยง
22:41 <Krinkle> Create deployment-poolcounter07 host (debian-12.0-bookworm with 2GB RAM, same as prod; 1 cpu instead of 2 cpu, unlike prod). ref T380881 [releng]
22:31 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1059.eqiad.wmnet}' [admin]
22:09 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1059.eqiad.wmnet}' [admin]
22:09 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1058.eqiad.wmnet}' [admin]
22:03 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1046.eqiad.wmnet with OS bookworm [production]
22:00 <jgleeson> civicrm upgraded from 4304c138 to 8cdce9e0 [production]
21:47 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1058.eqiad.wmnet}' [admin]
21:46 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1057.eqiad.wmnet}' [admin]
21:44 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1046.eqiad.wmnet with reason: host reimage [production]
21:38 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1046.eqiad.wmnet with reason: host reimage [production]
21:25 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1057.eqiad.wmnet}' [admin]
21:25 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1056.eqiad.wmnet}' [admin]
21:18 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1046.eqiad.wmnet with OS bookworm [production]
21:17 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1045.eqiad.wmnet with OS bookworm [production]
21:17 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
21:16 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
21:14 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.reactivate (exit_code=99) [admin]
21:14 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
21:11 <tgr_> UTC late deploys done [production]
21:10 <tgr@deploy1003> Finished scap sync-world: Backport for [[gerrit:1190712|session: Fix date handling for JWT cookies (T399243 T399200)]], [[gerrit:1190713|session: Fix date handling for JWT cookies (T399243 T399200)]] (duration: 41m 51s) [production]
21:06 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1056.eqiad.wmnet}' [admin]
21:06 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1055.eqiad.wmnet}' [admin]
20:59 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1045.eqiad.wmnet with reason: host reimage [production]
20:58 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:57 <tgr@deploy1003> tgr: Continuing with sync [production]
20:55 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:55 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1045.eqiad.wmnet with reason: host reimage [production]
20:55 <tgr@deploy1003> tgr: Backport for [[gerrit:1190712|session: Fix date handling for JWT cookies (T399243 T399200)]], [[gerrit:1190713|session: Fix date handling for JWT cookies (T399243 T399200)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:52 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:51 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:48 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:47 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:46 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment codfw1dev for all services [admin]
20:45 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1055.eqiad.wmnet}' [admin]
20:45 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1054.eqiad.wmnet}' [admin]
20:44 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment codfw1dev for all services [admin]
20:43 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:41 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'P{P:openstack::codfw1dev::nova::compute::service}' [admin]
20:36 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=97) [admin]
20:36 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.undrain_node [admin]
20:35 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1045.eqiad.wmnet with OS bookworm [production]
20:29 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
20:29 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
20:29 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1040.eqiad.wmnet with OS bookworm [production]
20:28 <tgr@deploy1003> Started scap sync-world: Backport for [[gerrit:1190712|session: Fix date handling for JWT cookies (T399243 T399200)]], [[gerrit:1190713|session: Fix date handling for JWT cookies (T399243 T399200)]] [production]
20:28 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1054.eqiad.wmnet}' [admin]
20:28 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1053.eqiad.wmnet}' [admin]
20:09 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1040.eqiad.wmnet with reason: host reimage [production]
20:08 <andrewbogott> creating puppetdbpostgres and adding it to tools-puppetdb-2 to store postgres data; the root volume of that VM was filling up and causing widespread puppet issues [tools]
20:06 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1040.eqiad.wmnet with reason: host reimage [production]