301-350 of 10000 results (29ms)
2025-10-01 ยง
12:29 <cgoubert@cumin1003> conftool action : set/pooled=true; selector: dnsdisc=thumbor.*,name=eqiad [production]
12:29 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83554 and previous config saved to /var/cache/conftool/dbconfig/20251001-122936-fceratto.json [production]
12:27 <mvernon@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw [production]
12:21 <ladsgroup@cumin1003> START - Cookbook sre.mysql.pool db1258* gradually with 4 steps - Work done [production]
12:21 <ladsgroup@cumin1003> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1258.eqiad.wmnet [production]
12:19 <mvernon@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-codfw [production]
12:19 <mvernon@cumin2002> END (ERROR) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=97) rolling restart_daemons on A:swift-fe-eqiad [production]
12:19 <mvernon@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad [production]
12:15 <ladsgroup@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1258 - Upgrading db1258.eqiad.wmnet [production]
12:15 <ladsgroup@cumin1003> START - Cookbook sre.mysql.depool db1258 - Upgrading db1258.eqiad.wmnet [production]
12:15 <ladsgroup@cumin1003> START - Cookbook sre.mysql.upgrade for db1258.eqiad.wmnet [production]
12:14 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83552 and previous config saved to /var/cache/conftool/dbconfig/20251001-121429-fceratto.json [production]
12:13 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depool db1258 T406116', diff saved to https://phabricator.wikimedia.org/P83551 and previous config saved to /var/cache/conftool/dbconfig/20251001-121339-ladsgroup.json [production]
12:12 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: sync [production]
12:11 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: sync [production]
12:08 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
12:08 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
12:06 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Promote db1255 to x3 primary T406116', diff saved to https://phabricator.wikimedia.org/P83550 and previous config saved to /var/cache/conftool/dbconfig/20251001-120629-ladsgroup.json [production]
12:06 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
12:06 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
12:05 <Amir1> Starting x3 eqiad failover from db1258 to db1255 - T406116 [production]
12:05 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
12:04 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
12:01 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Set db1255 with weight 0 T406116', diff saved to https://phabricator.wikimedia.org/P83549 and previous config saved to /var/cache/conftool/dbconfig/20251001-120140-ladsgroup.json [production]
12:00 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 16 hosts with reason: Primary switchover x3 T406116 [production]
11:59 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83548 and previous config saved to /var/cache/conftool/dbconfig/20251001-115922-fceratto.json [production]
11:59 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
11:59 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
11:58 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
11:49 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
11:48 <cgoubert@cumin1003> START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster wikikube-eqiad: eqiad Wikikube kubernetes cluster upgrade to 1.31 - T405703 [production]
11:44 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83547 and previous config saved to /var/cache/conftool/dbconfig/20251001-114414-fceratto.json [production]
11:43 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83546 and previous config saved to /var/cache/conftool/dbconfig/20251001-114259-fceratto.json [production]
11:42 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1229.eqiad.wmnet with reason: Maintenance [production]
11:42 <hnowlan> manually bumped thumbor replicas in codfw to 140 [production]
11:42 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]
11:42 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1197 (T401906)', diff saved to https://phabricator.wikimedia.org/P83545 and previous config saved to /var/cache/conftool/dbconfig/20251001-114214-fceratto.json [production]
11:41 <cgoubert@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=thumbor.*,name=eqiad [production]
11:39 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
11:39 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
11:37 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
11:37 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
11:35 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
11:35 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
11:29 <cgoubert@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=swift.*,name=eqiad [production]
11:27 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P83544 and previous config saved to /var/cache/conftool/dbconfig/20251001-112707-fceratto.json [production]
11:25 <Amir1> dropping two unused tables in phabricator db (T403542) [production]
11:18 <cgoubert@cumin1003> conftool action : set/pooled=true; selector: dnsdisc=thumbor.*,name=codfw [production]
11:12 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P83542 and previous config saved to /var/cache/conftool/dbconfig/20251001-111159-fceratto.json [production]
11:05 <cgoubert@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=toolhub.* [production]