2025-09-04
ยง
|
15:12 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2155 (T402925)', diff saved to https://phabricator.wikimedia.org/P82575 and previous config saved to /var/cache/conftool/dbconfig/20250904-151235-ladsgroup.json |
[production] |
15:12 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2155.codfw.wmnet with reason: Maintenance |
[production] |
15:12 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2147 (T402925)', diff saved to https://phabricator.wikimedia.org/P82574 and previous config saved to /var/cache/conftool/dbconfig/20250904-151223-ladsgroup.json |
[production] |
15:11 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reimage for host an-worker1235.eqiad.wmnet with OS bullseye |
[production] |
15:06 |
<tappof@dns1004> |
END - running authdns-update |
[production] |
15:05 |
<tappof@dns1004> |
START - running authdns-update |
[production] |
15:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82573 and previous config saved to /var/cache/conftool/dbconfig/20250904-150221-fceratto.json |
[production] |
15:02 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
15:00 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
15:00 |
<jhancock@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
15:00 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2188 (T401906)', diff saved to https://phabricator.wikimedia.org/P82572 and previous config saved to /var/cache/conftool/dbconfig/20250904-150011-fceratto.json |
[production] |
15:00 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2188.codfw.wmnet with reason: Maintenance |
[production] |
14:59 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82571 and previous config saved to /var/cache/conftool/dbconfig/20250904-145948-fceratto.json |
[production] |
14:57 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P82570 and previous config saved to /var/cache/conftool/dbconfig/20250904-145716-ladsgroup.json |
[production] |
14:54 |
<jhancock@cumin1002> |
START - Cookbook sre.hosts.provision for host cp2044.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
14:52 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
14:51 |
<moritzm> |
upgrade Envoyproxy on Puppet servers T402584 |
[production] |
14:51 |
<XioNoX> |
disable OSPF on mr1-ulsfo to test BGP |
[production] |
14:46 |
<pt1979@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mr1-ulsfo with reason: Bgp testing |
[production] |
14:44 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P82569 and previous config saved to /var/cache/conftool/dbconfig/20250904-144441-fceratto.json |
[production] |
14:44 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir3005.esams.wmnet to drbd |
[production] |
14:42 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P82567 and previous config saved to /var/cache/conftool/dbconfig/20250904-144208-ladsgroup.json |
[production] |
14:41 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
14:34 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir3005.esams.wmnet to drbd |
[production] |
14:31 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti3007.esams.wmnet to cluster esams03 and group B |
[production] |
14:29 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P82566 and previous config saved to /var/cache/conftool/dbconfig/20250904-142933-fceratto.json |
[production] |
14:28 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti3007.esams.wmnet to cluster esams03 and group B |
[production] |
14:27 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2147 (T402925)', diff saved to https://phabricator.wikimedia.org/P82565 and previous config saved to /var/cache/conftool/dbconfig/20250904-142701-ladsgroup.json |
[production] |
14:25 |
<moritzm> |
upgrade Envoyproxy on webperf* T402584 |
[production] |
14:25 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
14:23 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3007.esams.wmnet |
[production] |
14:14 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
14:14 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82564 and previous config saved to /var/cache/conftool/dbconfig/20250904-141426-fceratto.json |
[production] |
14:13 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti3007.esams.wmnet |
[production] |
14:12 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2176 (T401906)', diff saved to https://phabricator.wikimedia.org/P82562 and previous config saved to /var/cache/conftool/dbconfig/20250904-141215-fceratto.json |
[production] |
14:12 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2176.codfw.wmnet with reason: Maintenance |
[production] |
14:11 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2174 (T401906)', diff saved to https://phabricator.wikimedia.org/P82561 and previous config saved to /var/cache/conftool/dbconfig/20250904-141152-fceratto.json |
[production] |
14:11 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudcephosd1052.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
14:02 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 270735 |
[production] |
14:01 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'configure' for AS: 270735 |
[production] |
14:00 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
13:57 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from dumpsdata1007 to an-worker1236 |
[production] |
13:57 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1236 |
[production] |
13:56 |
<btullis@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host an-worker1236 |
[production] |
13:56 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) an-worker1236 on all recursors |
[production] |
13:56 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P82560 and previous config saved to /var/cache/conftool/dbconfig/20250904-135645-fceratto.json |
[production] |
13:56 |
<btullis@cumin1003> |
START - Cookbook sre.dns.wipe-cache an-worker1236 on all recursors |
[production] |
13:56 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
13:56 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming dumpsdata1007 to an-worker1236 - btullis@cumin1003" |
[production] |
13:56 |
<btullis@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming dumpsdata1007 to an-worker1236 - btullis@cumin1003" |
[production] |