|
2025-11-26
ยง
|
| 13:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s3 codfw T411088', diff saved to https://phabricator.wikimedia.org/P85741 and previous config saved to /var/cache/conftool/dbconfig/20251126-131803-marostegui.json |
[production] |
| 13:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in x3 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85740 and previous config saved to /var/cache/conftool/dbconfig/20251126-131606-marostegui.json |
[production] |
| 13:15 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s7 and s8 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85739 and previous config saved to /var/cache/conftool/dbconfig/20251126-131512-marostegui.json |
[production] |
| 13:13 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s6 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85738 and previous config saved to /var/cache/conftool/dbconfig/20251126-131304-marostegui.json |
[production] |
| 13:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s5 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85736 and previous config saved to /var/cache/conftool/dbconfig/20251126-131110-marostegui.json |
[production] |
| 13:11 |
<fceratto@cumin1003> |
START - Cookbook sre.mysql.pool db2166 gradually with 4 steps - Repooling |
[production] |
| 13:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s4 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85735 and previous config saved to /var/cache/conftool/dbconfig/20251126-131018-marostegui.json |
[production] |
| 13:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s2 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85734 and previous config saved to /var/cache/conftool/dbconfig/20251126-130856-marostegui.json |
[production] |
| 13:07 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s1 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85733 and previous config saved to /var/cache/conftool/dbconfig/20251126-130757-marostegui.json |
[production] |
| 13:06 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P85731 and previous config saved to /var/cache/conftool/dbconfig/20251126-130630-marostegui.json |
[production] |
| 13:06 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Unify weights in s3 codfw T408663', diff saved to https://phabricator.wikimedia.org/P85730 and previous config saved to /var/cache/conftool/dbconfig/20251126-130620-marostegui.json |
[production] |
| 13:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s8 T411088', diff saved to https://phabricator.wikimedia.org/P85729 and previous config saved to /var/cache/conftool/dbconfig/20251126-130255-marostegui.json |
[production] |
| 13:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s7 T411088', diff saved to https://phabricator.wikimedia.org/P85728 and previous config saved to /var/cache/conftool/dbconfig/20251126-130237-marostegui.json |
[production] |
| 13:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s6 T411088', diff saved to https://phabricator.wikimedia.org/P85727 and previous config saved to /var/cache/conftool/dbconfig/20251126-130220-marostegui.json |
[production] |
| 13:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s5 T411088', diff saved to https://phabricator.wikimedia.org/P85726 and previous config saved to /var/cache/conftool/dbconfig/20251126-130202-marostegui.json |
[production] |
| 12:53 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
| 12:51 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply |
[production] |
| 12:51 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply |
[production] |
| 12:50 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1231 (T410531)', diff saved to https://phabricator.wikimedia.org/P85725 and previous config saved to /var/cache/conftool/dbconfig/20251126-125049-marostegui.json |
[production] |
| 12:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1231 (T410531)', diff saved to https://phabricator.wikimedia.org/P85724 and previous config saved to /var/cache/conftool/dbconfig/20251126-124838-marostegui.json |
[production] |
| 12:48 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1039.eqiad.wmnet |
[production] |
| 12:48 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1231.eqiad.wmnet with reason: Maintenance |
[production] |
| 12:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1227 (T410531)', diff saved to https://phabricator.wikimedia.org/P85723 and previous config saved to /var/cache/conftool/dbconfig/20251126-124815-marostegui.json |
[production] |
| 12:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1039.eqiad.wmnet |
[production] |
| 12:46 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s4 T411088', diff saved to https://phabricator.wikimedia.org/P85722 and previous config saved to /var/cache/conftool/dbconfig/20251126-124609-marostegui.json |
[production] |
| 12:45 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1039.eqiad.wmnet |
[production] |
| 12:44 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s2 T411088', diff saved to https://phabricator.wikimedia.org/P85721 and previous config saved to /var/cache/conftool/dbconfig/20251126-124441-marostegui.json |
[production] |
| 12:43 |
<root@cumin2002> |
DONE (FAIL) - Cookbook sre.puppet.renew-cert (exit_code=99) for backup2014.codfw.wmnet: Renew puppet certificate - root@cumin2002 |
[production] |
| 12:38 |
<root@cumin2002> |
DONE (FAIL) - Cookbook sre.puppet.renew-cert (exit_code=99) for backup2014.codfw.wmnet: Renew puppet certificate - root@cumin2002 |
[production] |
| 12:35 |
<mvolz@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/citoid: apply |
[production] |
| 12:35 |
<mvolz@deploy2002> |
helmfile [eqiad] START helmfile.d/services/citoid: apply |
[production] |
| 12:33 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P85720 and previous config saved to /var/cache/conftool/dbconfig/20251126-123307-marostegui.json |
[production] |
| 12:31 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s1 T411088', diff saved to https://phabricator.wikimedia.org/P85719 and previous config saved to /var/cache/conftool/dbconfig/20251126-123131-marostegui.json |
[production] |
| 12:29 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
| 12:27 |
<cmooney@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 29357 |
[production] |
| 12:27 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'. |
[production] |
| 12:27 |
<cmooney@cumin1003> |
START - Cookbook sre.network.peering with action 'configure' for AS: 29357 |
[production] |
| 12:27 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove vslow/dump from s3 T411088', diff saved to https://phabricator.wikimedia.org/P85717 and previous config saved to /var/cache/conftool/dbconfig/20251126-122703-marostegui.json |
[production] |
| 12:22 |
<mvolz@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/citoid: apply |
[production] |
| 12:21 |
<mvolz@deploy2002> |
helmfile [eqiad] START helmfile.d/services/citoid: apply |
[production] |
| 12:20 |
<mvolz@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/citoid: apply |
[production] |
| 12:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P85716 and previous config saved to /var/cache/conftool/dbconfig/20251126-121759-marostegui.json |
[production] |
| 12:15 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.wdqs.restart-nginx-envoy (exit_code=0) rolling restart_daemons on A:wdqs-all |
[production] |
| 12:12 |
<mvolz@deploy2002> |
helmfile [codfw] START helmfile.d/services/citoid: apply |
[production] |
| 12:10 |
<root@cumin2002> |
DONE (FAIL) - Cookbook sre.puppet.renew-cert (exit_code=99) for backup2014.codfw.wmnet: Renew puppet certificate - root@cumin2002 |
[production] |
| 12:09 |
<mvolz@deploy2002> |
helmfile [staging] DONE helmfile.d/services/citoid: apply |
[production] |
| 12:09 |
<mvolz@deploy2002> |
helmfile [staging] START helmfile.d/services/citoid: apply |
[production] |
| 12:06 |
<claime> |
Starting kafka-main rebalance with 30MB/s throttle - T407185 |
[production] |
| 12:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1227 (T410531)', diff saved to https://phabricator.wikimedia.org/P85713 and previous config saved to /var/cache/conftool/dbconfig/20251126-120252-marostegui.json |
[production] |
| 12:02 |
<jmm@cumin2002> |
START - Cookbook sre.wdqs.restart-nginx-envoy rolling restart_daemons on A:wdqs-all |
[production] |