|
2026-03-03
ยง
|
| 11:43 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1354.eqiad.wmnet with reason: host reimage |
[production] |
| 11:43 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2210', diff saved to https://phabricator.wikimedia.org/P89684 and previous config saved to /var/cache/conftool/dbconfig/20260303-114341-marostegui.json |
[production] |
| 11:43 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1353.eqiad.wmnet with reason: host reimage |
[production] |
| 11:42 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1352.eqiad.wmnet with reason: host reimage |
[production] |
| 11:40 |
<fabfur@cumin1003> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_eqsin and A:cp - 3.0 upgrade () |
[production] |
| 11:36 |
<fabfur@cumin1003> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_eqsin and A:cp - 3.0 upgrade () |
[production] |
| 11:31 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1355.eqiad.wmnet with OS trixie |
[production] |
| 11:31 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1354.eqiad.wmnet with OS trixie |
[production] |
| 11:30 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1353.eqiad.wmnet with OS trixie |
[production] |
| 11:30 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1352.eqiad.wmnet with OS trixie |
[production] |
| 11:28 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2210 (T418465)', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20260303-112828-marostegui.json |
[production] |
| 11:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1241 (T418465)', diff saved to https://phabricator.wikimedia.org/P89683 and previous config saved to /var/cache/conftool/dbconfig/20260303-112535-marostegui.json |
[production] |
| 11:25 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1241.eqiad.wmnet with reason: Maintenance |
[production] |
| 11:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238 (T418465)', diff saved to https://phabricator.wikimedia.org/P89682 and previous config saved to /var/cache/conftool/dbconfig/20260303-112511-marostegui.json |
[production] |
| 11:21 |
<jayme@deploy1003> |
helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:18 |
<jayme@deploy1003> |
helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:18 |
<jayme@deploy1003> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:17 |
<jayme@deploy1003> |
helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 11:17 |
<jayme@deploy1003> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:16 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1350-1351].eqiad.wmnet |
[production] |
| 11:16 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1350-1351].eqiad.wmnet |
[production] |
| 11:15 |
<jayme@deploy1003> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:15 |
<jayme@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:15 |
<jayme@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 11:15 |
<jayme@deploy1003> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:14 |
<jayme@deploy1003> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:14 |
<jayme@deploy1003> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:13 |
<jayme@deploy1003> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:13 |
<jayme@deploy1003> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:13 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host an-worker1172.eqiad.wmnet |
[production] |
| 11:13 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1171.eqiad.wmnet |
[production] |
| 11:13 |
<jayme@deploy1003> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 11:13 |
<jayme@deploy1003> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:12 |
<jayme@deploy1003> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:11 |
<jayme@deploy1003> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P89681 and previous config saved to /var/cache/conftool/dbconfig/20260303-111003-marostegui.json |
[production] |
| 11:09 |
<jayme@deploy1003> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:08 |
<jayme@deploy1003> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:08 |
<jayme@deploy1003> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:07 |
<jayme@deploy1003> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 11:07 |
<jayme@deploy1003> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:06 |
<jayme@deploy1003> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 11:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2210 (T418465)', diff saved to https://phabricator.wikimedia.org/P89680 and previous config saved to /var/cache/conftool/dbconfig/20260303-110551-marostegui.json |
[production] |
| 11:05 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2210.codfw.wmnet with reason: Maintenance |
[production] |
| 11:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2206 (T418465)', diff saved to https://phabricator.wikimedia.org/P89679 and previous config saved to /var/cache/conftool/dbconfig/20260303-110527-marostegui.json |
[production] |
| 10:59 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host an-worker1171.eqiad.wmnet |
[production] |
| 10:59 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1170.eqiad.wmnet |
[production] |
| 10:57 |
<slyngshede@dns1004> |
END - running authdns-update |
[production] |
| 10:55 |
<slyngshede@dns1004> |
START - running authdns-update |
[production] |
| 10:54 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P89678 and previous config saved to /var/cache/conftool/dbconfig/20260303-105455-marostegui.json |
[production] |