| 2025-01-20
      
      ยง | 
    
  | 15:18 | <jelto@cumin1002> | END (FAIL) - Cookbook sre.k8s.pool-depool-node (exit_code=99) depool for host mw2282.codfw.wmnet | [production] | 
            
  | 15:18 | <jelto@cumin1002> | START - Cookbook sre.k8s.pool-depool-node depool for host mw2282.codfw.wmnet | [production] | 
            
  | 15:18 | <jelto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on mw2282.codfw.wmnet with reason: decommissioning host | [production] | 
            
  | 15:18 | <volans> | issues power off via mgmt UI for db2131 (failed to power off during decommissioning) | [production] | 
            
  | 15:14 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2212 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72162 and previous config saved to /var/cache/conftool/dbconfig/20250120-151402-root.json | [production] | 
            
  | 15:04 | <Lucas_WMDE> | UTC afternoon backport+config window done | [production] | 
            
  | 15:02 | <taavi@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1112333|wikitech: Drop obsolete oauthadmin group (T384122)]], [[gerrit:1112335|wikitech: Drop oathauth group (T384123)]] (duration: 12m 21s) | [production] | 
            
  | 14:58 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2212 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72161 and previous config saved to /var/cache/conftool/dbconfig/20250120-145856-root.json | [production] | 
            
  | 14:56 | <taavi@deploy2002> | taavi: Continuing with sync | [production] | 
            
  | 14:55 | <taavi@deploy2002> | taavi: Backport for [[gerrit:1112333|wikitech: Drop obsolete oauthadmin group (T384122)]], [[gerrit:1112335|wikitech: Drop oathauth group (T384123)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:54 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2024.codfw.wmnet | [production] | 
            
  | 14:53 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2024.codfw.wmnet | [production] | 
            
  | 14:51 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2024.codfw.wmnet | [production] | 
            
  | 14:50 | <taavi@deploy2002> | Started scap sync-world: Backport for [[gerrit:1112333|wikitech: Drop obsolete oauthadmin group (T384122)]], [[gerrit:1112335|wikitech: Drop oathauth group (T384123)]] | [production] | 
            
  | 14:48 | <lucaswerkmeister-wmde@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1112707|Add dedicated experimentation lab test module (T373715)]] (duration: 12m 12s) | [production] | 
            
  | 14:45 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1120.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2212 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72160 and previous config saved to /var/cache/conftool/dbconfig/20250120-144351-root.json | [production] | 
            
  | 14:42 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1119.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:41 | <lucaswerkmeister-wmde@deploy2002> | lucaswerkmeister-wmde, cjming: Continuing with sync | [production] | 
            
  | 14:40 | <lucaswerkmeister-wmde@deploy2002> | lucaswerkmeister-wmde, cjming: Backport for [[gerrit:1112707|Add dedicated experimentation lab test module (T373715)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:40 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1122.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:38 | <vgutierrez@dns1004> | END - running authdns-update | [production] | 
            
  | 14:36 | <vgutierrez@dns1004> | START - running authdns-update | [production] | 
            
  | 14:36 | <lucaswerkmeister-wmde@deploy2002> | Started scap sync-world: Backport for [[gerrit:1112707|Add dedicated experimentation lab test module (T373715)]] | [production] | 
            
  | 14:33 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1118.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:30 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1121.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:30 | <oblivian@deploy2002> | Finished scap sync-world: Backport for [[gerrit:1109109|Use a bespoke database configuration for dumps (T382947)]] (duration: 18m 47s) | [production] | 
            
  | 14:29 | <volans@cumin2002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on sretest1002.eqiad.wmnet with reason: testing cumin | [production] | 
            
  | 14:28 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2212 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72159 and previous config saved to /var/cache/conftool/dbconfig/20250120-142846-root.json | [production] | 
            
  | 14:28 | <volans> | upgraded cumin to v5.0.0 on cumin2002 | [production] | 
            
  | 14:27 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1117.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:27 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1120.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:23 | <oblivian@deploy2002> | oblivian: Continuing with sync | [production] | 
            
  | 14:22 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti2023.codfw.wmnet to cluster codfw and group A | [production] | 
            
  | 14:22 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1119.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:22 | <jmm@cumin2002> | START - Cookbook sre.ganeti.addnode for new host ganeti2023.codfw.wmnet to cluster codfw and group A | [production] | 
            
  | 14:20 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2023.codfw.wmnet | [production] | 
            
  | 14:18 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1122.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:16 | <oblivian@deploy2002> | oblivian: Backport for [[gerrit:1109109|Use a bespoke database configuration for dumps (T382947)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:14 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1118.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:13 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db2212 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P72158 and previous config saved to /var/cache/conftool/dbconfig/20250120-141340-root.json | [production] | 
            
  | 14:12 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2023.codfw.wmnet | [production] | 
            
  | 14:11 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1121.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:11 | <oblivian@deploy2002> | Started scap sync-world: Backport for [[gerrit:1109109|Use a bespoke database configuration for dumps (T382947)]] | [production] | 
            
  | 14:09 | <kamila@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1117.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:07 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti2023.codfw.wmnet with OS bookworm | [production] | 
            
  | 14:06 | <kamila@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1120.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:06 | <kamila@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1121.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:06 | <kamila@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1119.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:06 | <kamila@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1122.eqiad.wmnet with reason: host reimage | [production] |