| 2025-10-01
      
      ยง | 
    
  | 12:19 | <mvernon@cumin2002> | START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad | [production] | 
            
  | 12:15 | <ladsgroup@cumin1003> | END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1258 - Upgrading db1258.eqiad.wmnet | [production] | 
            
  | 12:15 | <ladsgroup@cumin1003> | START - Cookbook sre.mysql.depool db1258 - Upgrading db1258.eqiad.wmnet | [production] | 
            
  | 12:15 | <ladsgroup@cumin1003> | START - Cookbook sre.mysql.upgrade for db1258.eqiad.wmnet | [production] | 
            
  | 12:14 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83552 and previous config saved to /var/cache/conftool/dbconfig/20251001-121429-fceratto.json | [production] | 
            
  | 12:13 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Depool db1258 T406116', diff saved to https://phabricator.wikimedia.org/P83551 and previous config saved to /var/cache/conftool/dbconfig/20251001-121339-ladsgroup.json | [production] | 
            
  | 12:12 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: sync | [production] | 
            
  | 12:11 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: sync | [production] | 
            
  | 12:08 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 12:08 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 12:06 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Promote db1255 to x3 primary T406116', diff saved to https://phabricator.wikimedia.org/P83550 and previous config saved to /var/cache/conftool/dbconfig/20251001-120629-ladsgroup.json | [production] | 
            
  | 12:06 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 12:06 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 12:05 | <Amir1> | Starting x3 eqiad failover from db1258 to db1255 - T406116 | [production] | 
            
  | 12:05 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 12:04 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 12:01 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Set db1255 with weight 0 T406116', diff saved to https://phabricator.wikimedia.org/P83549 and previous config saved to /var/cache/conftool/dbconfig/20251001-120140-ladsgroup.json | [production] | 
            
  | 12:00 | <ladsgroup@cumin1003> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 16 hosts with reason: Primary switchover x3 T406116 | [production] | 
            
  | 11:59 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83548 and previous config saved to /var/cache/conftool/dbconfig/20251001-115922-fceratto.json | [production] | 
            
  | 11:59 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:59 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:58 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:49 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:48 | <cgoubert@cumin1003> | START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster wikikube-eqiad: eqiad Wikikube kubernetes cluster upgrade to 1.31 - T405703 | [production] | 
            
  | 11:44 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83547 and previous config saved to /var/cache/conftool/dbconfig/20251001-114414-fceratto.json | [production] | 
            
  | 11:43 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Depooling db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83546 and previous config saved to /var/cache/conftool/dbconfig/20251001-114259-fceratto.json | [production] | 
            
  | 11:42 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1229.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 11:42 | <hnowlan> | manually bumped thumbor replicas in codfw to 140 | [production] | 
            
  | 11:42 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1225.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 11:42 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1197 (T401906)', diff saved to https://phabricator.wikimedia.org/P83545 and previous config saved to /var/cache/conftool/dbconfig/20251001-114214-fceratto.json | [production] | 
            
  | 11:41 | <cgoubert@cumin1003> | conftool action : set/pooled=false; selector: dnsdisc=thumbor.*,name=eqiad | [production] | 
            
  | 11:39 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:39 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:37 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:37 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:35 | <hnowlan@deploy2002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:35 | <hnowlan@deploy2002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:29 | <cgoubert@cumin1003> | conftool action : set/pooled=false; selector: dnsdisc=swift.*,name=eqiad | [production] | 
            
  | 11:27 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P83544 and previous config saved to /var/cache/conftool/dbconfig/20251001-112707-fceratto.json | [production] | 
            
  | 11:25 | <Amir1> | dropping two unused tables in phabricator db (T403542) | [production] | 
            
  | 11:18 | <cgoubert@cumin1003> | conftool action : set/pooled=true; selector: dnsdisc=thumbor.*,name=codfw | [production] | 
            
  | 11:12 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P83542 and previous config saved to /var/cache/conftool/dbconfig/20251001-111159-fceratto.json | [production] | 
            
  | 11:05 | <cgoubert@cumin1003> | conftool action : set/pooled=false; selector: dnsdisc=toolhub.* | [production] | 
            
  | 11:04 | <cgoubert@cumin1003> | END (FAIL) - Cookbook sre.discovery.service-route (exit_code=99) depool toolhub in eqiad: maintenance | [production] | 
            
  | 11:04 | <cgoubert@cumin1003> | START - Cookbook sre.discovery.service-route depool toolhub in eqiad: maintenance | [production] | 
            
  | 11:03 | <cgoubert@deploy2002> | Locking from deployment [ALL REPOSITORIES]: eqiad Wikikube kubernetes cluster upgrade to 1.31 - T405703 | [production] | 
            
  | 11:03 | <cgoubert@deploy2002> | helmfile [codfw] DONE helmfile.d/services/zotero: apply | [production] | 
            
  | 11:03 | <cgoubert@deploy2002> | helmfile [codfw] START helmfile.d/services/zotero: apply | [production] | 
            
  | 11:03 | <cgoubert@deploy2002> | helmfile [eqiad] DONE helmfile.d/services/zotero: apply | [production] | 
            
  | 11:03 | <cgoubert@deploy2002> | helmfile [staging] DONE helmfile.d/services/zotero: apply | [production] |