1-50 of 10000 results (75ms)
2025-09-05 ยง
23:58 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1049.eqiad.wmnet with reason: host reimage [production]
23:53 <vriley@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on es1049.eqiad.wmnet with reason: host reimage [production]
23:40 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host es1050.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
23:39 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host es1050 [production]
23:37 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host es1050 [production]
23:37 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
23:37 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt es1050 - vriley@cumin1003" [production]
23:37 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt es1050 - vriley@cumin1003" [production]
23:31 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
23:25 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host es1049.eqiad.wmnet with OS bookworm [production]
22:50 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host es1049.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
22:24 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host es1049.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
22:23 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host es1049 [production]
22:22 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host es1049 [production]
22:22 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:22 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt es1049 - vriley@cumin1003" [production]
22:21 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt es1049 - vriley@cumin1003" [production]
22:18 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
22:13 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2239.codfw.wmnet with reason: Maintenance [production]
22:12 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2237 (T402925)', diff saved to https://phabricator.wikimedia.org/P82667 and previous config saved to /var/cache/conftool/dbconfig/20250905-221244-ladsgroup.json [production]
21:57 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2237', diff saved to https://phabricator.wikimedia.org/P82666 and previous config saved to /var/cache/conftool/dbconfig/20250905-215736-ladsgroup.json [production]
21:47 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dse-k8s-worker1014.eqiad.wmnet with OS bookworm [production]
21:42 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2237', diff saved to https://phabricator.wikimedia.org/P82665 and previous config saved to /var/cache/conftool/dbconfig/20250905-214229-ladsgroup.json [production]
21:27 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2237 (T402925)', diff saved to https://phabricator.wikimedia.org/P82664 and previous config saved to /var/cache/conftool/dbconfig/20250905-212721-ladsgroup.json [production]
21:02 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1014.eqiad.wmnet with OS bookworm [production]
20:59 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:48 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:46 <jclark@cumin1002> END (ERROR) - Cookbook sre.hosts.provision (exit_code=97) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:41 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:32 <kemayo@deploy1003> Finished scap sync-world: Backport for [[gerrit:1185192|Revert "Edit: Split footer lists into columns" (T401066 T403856)]] (duration: 15m 31s) [production]
20:24 <kemayo@deploy1003> kemayo: Continuing with sync [production]
20:23 <kemayo@deploy1003> kemayo: Backport for [[gerrit:1185192|Revert "Edit: Split footer lists into columns" (T401066 T403856)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:18 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance [production]
20:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231 (T401906)', diff saved to https://phabricator.wikimedia.org/P82663 and previous config saved to /var/cache/conftool/dbconfig/20250905-201818-fceratto.json [production]
20:17 <kemayo@deploy1003> Started scap sync-world: Backport for [[gerrit:1185192|Revert "Edit: Split footer lists into columns" (T401066 T403856)]] [production]
20:03 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P82662 and previous config saved to /var/cache/conftool/dbconfig/20250905-200311-fceratto.json [production]
19:48 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P82661 and previous config saved to /var/cache/conftool/dbconfig/20250905-194804-fceratto.json [production]
19:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1231 (T401906)', diff saved to https://phabricator.wikimedia.org/P82660 and previous config saved to /var/cache/conftool/dbconfig/20250905-193256-fceratto.json [production]
19:30 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1231 (T401906)', diff saved to https://phabricator.wikimedia.org/P82659 and previous config saved to /var/cache/conftool/dbconfig/20250905-193047-fceratto.json [production]
19:30 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1231.eqiad.wmnet with reason: Maintenance [production]
19:30 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]
19:30 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T401906)', diff saved to https://phabricator.wikimedia.org/P82658 and previous config saved to /var/cache/conftool/dbconfig/20250905-193007-fceratto.json [production]
19:20 <mutante> pooled ulsfo again - Lumen back up - Arelion still working [production]
19:19 <dzahn@cumin2002> END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: pool site ulsfo [reason: no reason specified, ] [production]
19:19 <dzahn@cumin2002> START - Cookbook sre.dns.admin DNS admin: pool site ulsfo [reason: no reason specified, ] [production]
19:15 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P82657 and previous config saved to /var/cache/conftool/dbconfig/20250905-191500-fceratto.json [production]
18:59 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P82656 and previous config saved to /var/cache/conftool/dbconfig/20250905-185952-fceratto.json [production]
18:44 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T401906)', diff saved to https://phabricator.wikimedia.org/P82655 and previous config saved to /var/cache/conftool/dbconfig/20250905-184445-fceratto.json [production]
18:42 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2237 (T402925)', diff saved to https://phabricator.wikimedia.org/P82654 and previous config saved to /var/cache/conftool/dbconfig/20250905-184245-ladsgroup.json [production]
18:42 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2237.codfw.wmnet with reason: Maintenance [production]