101-150 of 10000 results (79ms)
2025-09-04 ยง
15:22 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:22 <moritzm> upgrade Envoyproxy on cloudweb servers T402584 [production]
15:22 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:20 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:18 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:17 <moritzm> installing apache2 security updates [production]
15:17 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P82576 and previous config saved to /var/cache/conftool/dbconfig/20250904-151729-fceratto.json [production]
15:16 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:13 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1236.eqiad.wmnet with OS bullseye [production]
15:12 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2155 (T402925)', diff saved to https://phabricator.wikimedia.org/P82575 and previous config saved to /var/cache/conftool/dbconfig/20250904-151235-ladsgroup.json [production]
15:12 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
15:12 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147 (T402925)', diff saved to https://phabricator.wikimedia.org/P82574 and previous config saved to /var/cache/conftool/dbconfig/20250904-151223-ladsgroup.json [production]
15:11 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1235.eqiad.wmnet with OS bullseye [production]
15:06 <tappof@dns1004> END - running authdns-update [production]
15:05 <tappof@dns1004> START - running authdns-update [production]
15:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82573 and previous config saved to /var/cache/conftool/dbconfig/20250904-150221-fceratto.json [production]
15:02 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:00 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:00 <jhancock@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82572 and previous config saved to /var/cache/conftool/dbconfig/20250904-150011-fceratto.json [production]
15:00 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2188.codfw.wmnet with reason: Maintenance [production]
14:59 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82571 and previous config saved to /var/cache/conftool/dbconfig/20250904-145948-fceratto.json [production]
14:57 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P82570 and previous config saved to /var/cache/conftool/dbconfig/20250904-145716-ladsgroup.json [production]
14:54 <jhancock@cumin1002> START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
14:52 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:51 <moritzm> upgrade Envoyproxy on Puppet servers T402584 [production]
14:51 <XioNoX> disable OSPF on mr1-ulsfo to test BGP [production]
14:46 <pt1979@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mr1-ulsfo with reason: Bgp testing [production]
14:44 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P82569 and previous config saved to /var/cache/conftool/dbconfig/20250904-144441-fceratto.json [production]
14:44 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir3005.esams.wmnet to drbd [production]
14:42 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P82567 and previous config saved to /var/cache/conftool/dbconfig/20250904-144208-ladsgroup.json [production]
14:41 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:34 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir3005.esams.wmnet to drbd [production]
14:31 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti3007.esams.wmnet to cluster esams03 and group B [production]
14:29 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P82566 and previous config saved to /var/cache/conftool/dbconfig/20250904-142933-fceratto.json [production]
14:28 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti3007.esams.wmnet to cluster esams03 and group B [production]
14:27 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2147 (T402925)', diff saved to https://phabricator.wikimedia.org/P82565 and previous config saved to /var/cache/conftool/dbconfig/20250904-142701-ladsgroup.json [production]
14:25 <moritzm> upgrade Envoyproxy on webperf* T402584 [production]
14:25 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:23 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3007.esams.wmnet [production]
14:14 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:14 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82564 and previous config saved to /var/cache/conftool/dbconfig/20250904-141426-fceratto.json [production]
14:13 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti3007.esams.wmnet [production]
14:12 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82562 and previous config saved to /var/cache/conftool/dbconfig/20250904-141215-fceratto.json [production]
14:12 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2176.codfw.wmnet with reason: Maintenance [production]
14:11 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174 (T401906)', diff saved to https://phabricator.wikimedia.org/P82561 and previous config saved to /var/cache/conftool/dbconfig/20250904-141152-fceratto.json [production]
14:11 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephosd1052.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
14:02 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 270735 [production]
14:01 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'configure' for AS: 270735 [production]
14:00 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]