2025-10-01
ยง
|
12:14 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83552 and previous config saved to /var/cache/conftool/dbconfig/20251001-121429-fceratto.json |
[production] |
12:13 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depool db1258 T406116', diff saved to https://phabricator.wikimedia.org/P83551 and previous config saved to /var/cache/conftool/dbconfig/20251001-121339-ladsgroup.json |
[production] |
12:12 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: sync |
[production] |
12:11 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: sync |
[production] |
12:08 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
12:08 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
12:06 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Promote db1255 to x3 primary T406116', diff saved to https://phabricator.wikimedia.org/P83550 and previous config saved to /var/cache/conftool/dbconfig/20251001-120629-ladsgroup.json |
[production] |
12:06 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
12:06 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
12:05 |
<Amir1> |
Starting x3 eqiad failover from db1258 to db1255 - T406116 |
[production] |
12:05 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
12:04 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
12:01 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Set db1255 with weight 0 T406116', diff saved to https://phabricator.wikimedia.org/P83549 and previous config saved to /var/cache/conftool/dbconfig/20251001-120140-ladsgroup.json |
[production] |
12:00 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 16 hosts with reason: Primary switchover x3 T406116 |
[production] |
11:59 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83548 and previous config saved to /var/cache/conftool/dbconfig/20251001-115922-fceratto.json |
[production] |
11:59 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
11:59 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
11:58 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
11:49 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
11:48 |
<cgoubert@cumin1003> |
START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster wikikube-eqiad: eqiad Wikikube kubernetes cluster upgrade to 1.31 - T405703 |
[production] |
11:44 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83547 and previous config saved to /var/cache/conftool/dbconfig/20251001-114414-fceratto.json |
[production] |
11:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83546 and previous config saved to /var/cache/conftool/dbconfig/20251001-114259-fceratto.json |
[production] |
11:42 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1229.eqiad.wmnet with reason: Maintenance |
[production] |
11:42 |
<hnowlan> |
manually bumped thumbor replicas in codfw to 140 |
[production] |
11:42 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1225.eqiad.wmnet with reason: Maintenance |
[production] |
11:42 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1197 (T401906)', diff saved to https://phabricator.wikimedia.org/P83545 and previous config saved to /var/cache/conftool/dbconfig/20251001-114214-fceratto.json |
[production] |
11:41 |
<cgoubert@cumin1003> |
conftool action : set/pooled=false; selector: dnsdisc=thumbor.*,name=eqiad |
[production] |
11:39 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
11:39 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
11:37 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
11:37 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
11:35 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
11:35 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
11:29 |
<cgoubert@cumin1003> |
conftool action : set/pooled=false; selector: dnsdisc=swift.*,name=eqiad |
[production] |
11:27 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P83544 and previous config saved to /var/cache/conftool/dbconfig/20251001-112707-fceratto.json |
[production] |
11:25 |
<Amir1> |
dropping two unused tables in phabricator db (T403542) |
[production] |
11:18 |
<cgoubert@cumin1003> |
conftool action : set/pooled=true; selector: dnsdisc=thumbor.*,name=codfw |
[production] |
11:12 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P83542 and previous config saved to /var/cache/conftool/dbconfig/20251001-111159-fceratto.json |
[production] |
11:05 |
<cgoubert@cumin1003> |
conftool action : set/pooled=false; selector: dnsdisc=toolhub.* |
[production] |
11:04 |
<cgoubert@cumin1003> |
END (FAIL) - Cookbook sre.discovery.service-route (exit_code=99) depool toolhub in eqiad: maintenance |
[production] |
11:04 |
<cgoubert@cumin1003> |
START - Cookbook sre.discovery.service-route depool toolhub in eqiad: maintenance |
[production] |
11:03 |
<cgoubert@deploy2002> |
Locking from deployment [ALL REPOSITORIES]: eqiad Wikikube kubernetes cluster upgrade to 1.31 - T405703 |
[production] |
11:03 |
<cgoubert@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/zotero: apply |
[production] |
11:03 |
<cgoubert@deploy2002> |
helmfile [codfw] START helmfile.d/services/zotero: apply |
[production] |
11:03 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/zotero: apply |
[production] |
11:03 |
<cgoubert@deploy2002> |
helmfile [staging] DONE helmfile.d/services/zotero: apply |
[production] |
11:02 |
<cgoubert@deploy2002> |
helmfile [staging] START helmfile.d/services/zotero: apply |
[production] |
11:02 |
<cgoubert@deploy2002> |
helmfile [eqiad] START helmfile.d/services/zotero: apply |
[production] |
11:02 |
<cgoubert@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/ratelimit: apply |
[production] |
11:01 |
<cgoubert@deploy2002> |
helmfile [codfw] START helmfile.d/services/ratelimit: apply |
[production] |