|
2026-05-21
ยง
|
| 09:28 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode2001.codfw.wmnet |
[production] |
| 09:27 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance es1037', diff saved to https://phabricator.wikimedia.org/P92741 and previous config saved to /var/cache/conftool/dbconfig/20260521-092746-fceratto.json |
[production] |
| 09:27 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wikikube-worker1380.eqiad.wmnet |
[production] |
| 09:27 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wikikube-worker1379.eqiad.wmnet |
[production] |
| 09:27 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dragonfly-supernode1001.eqiad.wmnet |
[production] |
| 09:26 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti1023.eqiad.wmnet |
[production] |
| 09:25 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host chartmuseum2001.codfw.wmnet |
[production] |
| 09:24 |
<jayme@cumin1003> |
conftool action : set/pooled=false; selector: dnsdisc=helm-charts.*,name=codfw |
[production] |
| 09:23 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti1056.eqiad.wmnet to cluster eqiad and group A |
[production] |
| 09:23 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode1001.eqiad.wmnet |
[production] |
| 09:22 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-ctrl1002.eqiad.wmnet |
[production] |
| 09:22 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-ctrl1002.eqiad.wmnet |
[production] |
| 09:22 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:wikikube-master-eqiad |
[production] |
| 09:22 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wikikube-worker1379.eqiad.wmnet |
[production] |
| 09:22 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wikikube-worker1378.eqiad.wmnet |
[production] |
| 09:21 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-ctrl2001.codfw.wmnet |
[production] |
| 09:21 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-ctrl2001.codfw.wmnet |
[production] |
| 09:21 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:wikikube-master-codfw |
[production] |
| 09:21 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti1056.eqiad.wmnet to cluster eqiad and group A |
[production] |
| 09:20 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reimage for host kafka-jumbo1016.eqiad.wmnet with OS trixie |
[production] |
| 09:18 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-jumbo1016.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 09:18 |
<moritzm> |
remove ganeti1023 foom eqiad Ganeti cluster T424680 |
[production] |
| 09:17 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance es1037 (T426633)', diff saved to https://phabricator.wikimedia.org/P92740 and previous config saved to /var/cache/conftool/dbconfig/20260521-091738-fceratto.json |
[production] |
| 09:16 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wikikube-worker1378.eqiad.wmnet |
[production] |
| 09:16 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wikikube-worker1377.eqiad.wmnet |
[production] |
| 09:12 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wikikube-worker1377.eqiad.wmnet |
[production] |
| 09:12 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wikikube-worker1376.eqiad.wmnet |
[production] |
| 09:07 |
<fceratto@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es1036: Repooling |
[production] |
| 09:07 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wikikube-worker1376.eqiad.wmnet |
[production] |
| 09:07 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wikikube-worker1375.eqiad.wmnet |
[production] |
| 09:06 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling es1037 (T426633)', diff saved to https://phabricator.wikimedia.org/P92738 and previous config saved to /var/cache/conftool/dbconfig/20260521-090609-fceratto.json |
[production] |
| 09:06 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1037.eqiad.wmnet with reason: Maintenance |
[production] |
| 09:02 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wikikube-worker1375.eqiad.wmnet |
[production] |
| 09:01 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.provision for host kafka-jumbo1016.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 08:55 |
<slyngshede@cumin1003> |
cookbooks.sre.cdn.roll-reboot finished rebooting cp6011.drmrs.wmnet |
[production] |
| 08:49 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1023.eqiad.wmnet |
[production] |
| 08:47 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) |
[production] |
| 08:47 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1256: Migration of db1256.eqiad.wmnet completed |
[production] |
| 08:44 |
<slyngshede@cumin1003> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp601[1-2].drmrs.wmnet} and A:cp |
[production] |
| 08:42 |
<slyngshede@cumin1003> |
END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp600[3-4].drmrs.wmnet} and A:cp |
[production] |
| 08:42 |
<slyngshede@cumin1003> |
cookbooks.sre.cdn.roll-reboot finished rebooting cp6004.drmrs.wmnet |
[production] |
| 08:37 |
<fceratto@cumin1003> |
START - Cookbook sre.mysql.pool pool es1036: Repooling |
[production] |
| 08:29 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance es1036 (T426633)', diff saved to https://phabricator.wikimedia.org/P92733 and previous config saved to /var/cache/conftool/dbconfig/20260521-082951-fceratto.json |
[production] |
| 08:29 |
<hashar@deploy1003> |
rebuilt and synchronized wikiversions files: group2 to 1.47.0-wmf.3 refs T423912 |
[production] |
| 08:16 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling es1036 (T426633)', diff saved to https://phabricator.wikimedia.org/P92731 and previous config saved to /var/cache/conftool/dbconfig/20260521-081642-fceratto.json |
[production] |
| 08:16 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1036.eqiad.wmnet with reason: Maintenance |
[production] |
| 08:02 |
<cwilliams@cumin1003> |
START - Cookbook sre.mysql.pool pool db1256: Migration of db1256.eqiad.wmnet completed |
[production] |
| 08:01 |
<slyngshede@cumin1003> |
cookbooks.sre.cdn.roll-reboot finished rebooting cp6003.drmrs.wmnet |
[production] |
| 08:00 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1256.eqiad.wmnet with OS trixie |
[production] |
| 07:52 |
<slyngshede@cumin1003> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp600[3-4].drmrs.wmnet} and A:cp |
[production] |