|
2026-03-03
ยง
|
| 09:43 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
| 09:40 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'article-models' for release 'main' . |
[production] |
| 09:38 |
<fceratto@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1176.eqiad.wmnet with reason: host reimage |
[production] |
| 09:35 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2199.codfw.wmnet with reason: Maintenance |
[production] |
| 09:35 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172 (T418465)', diff saved to https://phabricator.wikimedia.org/P89668 and previous config saved to /var/cache/conftool/dbconfig/20260303-093542-marostegui.json |
[production] |
| 09:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1221 (T418465)', diff saved to https://phabricator.wikimedia.org/P89667 and previous config saved to /var/cache/conftool/dbconfig/20260303-093224-marostegui.json |
[production] |
| 09:32 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'article-descriptions' for release 'main' . |
[production] |
| 09:23 |
<fabfur@cumin1003> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_ulsfo and A:cp - 3.0 upgrade () |
[production] |
| 09:23 |
<fceratto@cumin1003> |
START - Cookbook sre.hosts.reimage for host db1176.eqiad.wmnet with OS trixie |
[production] |
| 09:21 |
<fceratto@cumin1003> |
END (PASS) - Cookbook sre.mysql.update-replication (exit_code=0) |
[production] |
| 09:20 |
<fceratto@cumin1003> |
START - Cookbook sre.mysql.update-replication |
[production] |
| 09:20 |
<fabfur@cumin1003> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_ulsfo and A:cp - 3.0 upgrade () |
[production] |
| 09:20 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P89666 and previous config saved to /var/cache/conftool/dbconfig/20260303-092034-marostegui.json |
[production] |
| 09:19 |
<arnaudb@dns1004> |
END - running authdns-update |
[production] |
| 09:18 |
<arnaudb@dns1004> |
START - running authdns-update |
[production] |
| 09:17 |
<moritzm> |
installing libbpf updates from Bookworm point release |
[production] |
| 09:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1221 (T418465)', diff saved to https://phabricator.wikimedia.org/P89665 and previous config saved to /var/cache/conftool/dbconfig/20260303-090818-marostegui.json |
[production] |
| 09:08 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on 6 hosts with reason: Maintenance |
[production] |
| 09:07 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1221.eqiad.wmnet with reason: Maintenance |
[production] |
| 09:07 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1199 (T418465)', diff saved to https://phabricator.wikimedia.org/P89664 and previous config saved to /var/cache/conftool/dbconfig/20260303-090731-marostegui.json |
[production] |
| 09:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P89663 and previous config saved to /var/cache/conftool/dbconfig/20260303-090526-marostegui.json |
[production] |
| 08:54 |
<dpogorzelski@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-cluster (exit_code=0) pool all services in codfw/ml-serve-codfw: maintenance |
[production] |
| 08:53 |
<dpogorzelski@cumin1003> |
START - Cookbook sre.k8s.pool-depool-cluster pool all services in codfw/ml-serve-codfw: maintenance |
[production] |
| 08:52 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P89662 and previous config saved to /var/cache/conftool/dbconfig/20260303-085224-marostegui.json |
[production] |
| 08:50 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172 (T418465)', diff saved to https://phabricator.wikimedia.org/P89661 and previous config saved to /var/cache/conftool/dbconfig/20260303-085019-marostegui.json |
[production] |
| 08:47 |
<moritzm> |
powercycling lvs1013 |
[production] |
| 08:41 |
<fabfur@cumin1003> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_ulsfo and A:cp - 3.0 upgrade () |
[production] |
| 08:41 |
<fabfur@cumin1003> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_ulsfo and A:cp - 3.0 upgrade () |
[production] |
| 08:37 |
<fabfur> |
start upgrading haproxy to 3.0 on A:cp-ulsfo (T417253) |
[production] |
| 08:37 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P89660 and previous config saved to /var/cache/conftool/dbconfig/20260303-083716-marostegui.json |
[production] |
| 08:32 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . |
[production] |
| 08:32 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 08:31 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 08:30 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 08:28 |
<dpogorzelski@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-cluster (exit_code=0) depool all services in codfw/ml-serve-codfw: maintenance |
[production] |
| 08:27 |
<dpogorzelski@cumin1003> |
START - Cookbook sre.k8s.pool-depool-cluster depool all services in codfw/ml-serve-codfw: maintenance |
[production] |
| 08:24 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2172 (T418465)', diff saved to https://phabricator.wikimedia.org/P89659 and previous config saved to /var/cache/conftool/dbconfig/20260303-082424-marostegui.json |
[production] |
| 08:24 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2172.codfw.wmnet with reason: Maintenance |
[production] |
| 08:24 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2155 (T418465)', diff saved to https://phabricator.wikimedia.org/P89658 and previous config saved to /var/cache/conftool/dbconfig/20260303-082400-marostegui.json |
[production] |
| 08:22 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1199 (T418465)', diff saved to https://phabricator.wikimedia.org/P89657 and previous config saved to /var/cache/conftool/dbconfig/20260303-082209-marostegui.json |
[production] |
| 08:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P89656 and previous config saved to /var/cache/conftool/dbconfig/20260303-080853-marostegui.json |
[production] |
| 08:07 |
<moritzm> |
installing PAM security updates on Bookworm |
[production] |
| 07:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1199 (T418465)', diff saved to https://phabricator.wikimedia.org/P89655 and previous config saved to /var/cache/conftool/dbconfig/20260303-075526-marostegui.json |
[production] |
| 07:55 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1199.eqiad.wmnet with reason: Maintenance |
[production] |
| 07:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190 (T418465)', diff saved to https://phabricator.wikimedia.org/P89654 and previous config saved to /var/cache/conftool/dbconfig/20260303-075502-marostegui.json |
[production] |
| 07:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P89653 and previous config saved to /var/cache/conftool/dbconfig/20260303-075345-marostegui.json |
[production] |
| 07:39 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P89652 and previous config saved to /var/cache/conftool/dbconfig/20260303-073955-marostegui.json |
[production] |
| 07:38 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2155 (T418465)', diff saved to https://phabricator.wikimedia.org/P89651 and previous config saved to /var/cache/conftool/dbconfig/20260303-073838-marostegui.json |
[production] |
| 07:24 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P89650 and previous config saved to /var/cache/conftool/dbconfig/20260303-072447-marostegui.json |
[production] |
| 07:20 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1051.eqiad.wmnet |
[production] |