| 2024-10-15
      
      ยง | 
    
  | 16:41 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2188 (T367781)', diff saved to https://phabricator.wikimedia.org/P70010 and previous config saved to /var/cache/conftool/dbconfig/20241015-164050-arnaudb.json | [production] | 
            
  | 16:40 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P70009 and previous config saved to /var/cache/conftool/dbconfig/20241015-164032-ladsgroup.json | [production] | 
            
  | 16:39 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2188 (T367781)', diff saved to https://phabricator.wikimedia.org/P70008 and previous config saved to /var/cache/conftool/dbconfig/20241015-163834-arnaudb.json | [production] | 
            
  | 16:39 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2188.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:38 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2188.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:38 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2176 (T367781)', diff saved to https://phabricator.wikimedia.org/P70007 and previous config saved to /var/cache/conftool/dbconfig/20241015-163812-arnaudb.json | [production] | 
            
  | 16:35 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db1227 (T376905)', diff saved to https://phabricator.wikimedia.org/P70006 and previous config saved to /var/cache/conftool/dbconfig/20241015-163419-ladsgroup.json | [production] | 
            
  | 16:35 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1227.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 16:34 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1227.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 16:34 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1202 (T376905)', diff saved to https://phabricator.wikimedia.org/P70005 and previous config saved to /var/cache/conftool/dbconfig/20241015-163404-ladsgroup.json | [production] | 
            
  | 16:25 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P70004 and previous config saved to /var/cache/conftool/dbconfig/20241015-162525-ladsgroup.json | [production] | 
            
  | 16:23 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P70003 and previous config saved to /var/cache/conftool/dbconfig/20241015-162305-arnaudb.json | [production] | 
            
  | 16:21 | <ladsgroup@cumin1002> | START - Cookbook sre.mysql.clone of db2194.codfw.wmnet onto db2205.codfw.wmnet | [production] | 
            
  | 16:20 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depool for reclone (T375652)', diff saved to https://phabricator.wikimedia.org/P70002 and previous config saved to /var/cache/conftool/dbconfig/20241015-161934-ladsgroup.json | [production] | 
            
  | 16:18 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P70001 and previous config saved to /var/cache/conftool/dbconfig/20241015-161858-ladsgroup.json | [production] | 
            
  | 16:10 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154 (T371742)', diff saved to https://phabricator.wikimedia.org/P70000 and previous config saved to /var/cache/conftool/dbconfig/20241015-161018-ladsgroup.json | [production] | 
            
  | 16:07 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P69999 and previous config saved to /var/cache/conftool/dbconfig/20241015-160758-arnaudb.json | [production] | 
            
  | 16:03 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P69998 and previous config saved to /var/cache/conftool/dbconfig/20241015-160351-ladsgroup.json | [production] | 
            
  | 16:01 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depool db2205 T377164', diff saved to https://phabricator.wikimedia.org/P69997 and previous config saved to /var/cache/conftool/dbconfig/20241015-160106-ladsgroup.json | [production] | 
            
  | 15:53 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2176 (T367781)', diff saved to https://phabricator.wikimedia.org/P69996 and previous config saved to /var/cache/conftool/dbconfig/20241015-155251-arnaudb.json | [production] | 
            
  | 15:52 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Promote db2209 to s3 primary and set section read-write T377164', diff saved to https://phabricator.wikimedia.org/P69995 and previous config saved to /var/cache/conftool/dbconfig/20241015-155240-ladsgroup.json | [production] | 
            
  | 15:50 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1202 (T376905)', diff saved to https://phabricator.wikimedia.org/P69994 and previous config saved to /var/cache/conftool/dbconfig/20241015-154844-ladsgroup.json | [production] | 
            
  | 15:48 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Set s3 codfw as read-only for maintenance - T377164', diff saved to https://phabricator.wikimedia.org/P69993 and previous config saved to /var/cache/conftool/dbconfig/20241015-154834-ladsgroup.json | [production] | 
            
  | 15:48 | <Amir1> | Starting s3 codfw failover from db2205 to db2209 - T377164 | [production] | 
            
  | 15:46 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2176 (T367781)', diff saved to https://phabricator.wikimedia.org/P69992 and previous config saved to /var/cache/conftool/dbconfig/20241015-154318-arnaudb.json | [production] | 
            
  | 15:46 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2176.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:45 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2176.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 15:45 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2174 (T367781)', diff saved to https://phabricator.wikimedia.org/P69991 and previous config saved to /var/cache/conftool/dbconfig/20241015-154256-arnaudb.json | [production] | 
            
  | 15:44 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Set db2209 with weight 0 T377164', diff saved to https://phabricator.wikimedia.org/P69990 and previous config saved to /var/cache/conftool/dbconfig/20241015-154228-ladsgroup.json | [production] | 
            
  | 15:43 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 24 hosts with reason: Primary switchover s3 T377164 | [production] | 
            
  | 15:42 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on 24 hosts with reason: Primary switchover s3 T377164 | [production] | 
            
  | 15:42 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db1202 (T376905)', diff saved to https://phabricator.wikimedia.org/P69989 and previous config saved to /var/cache/conftool/dbconfig/20241015-154027-ladsgroup.json | [production] | 
            
  | 15:41 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1202.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:40 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1202.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 15:40 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1194 (T376905)', diff saved to https://phabricator.wikimedia.org/P69988 and previous config saved to /var/cache/conftool/dbconfig/20241015-154002-ladsgroup.json | [production] | 
            
  | 15:27 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P69987 and previous config saved to /var/cache/conftool/dbconfig/20241015-152749-arnaudb.json | [production] | 
            
  | 15:26 | <akosiaris> | run gnt-cluster verify-disks after ganeti1034 forceful reboot | [production] | 
            
  | 15:24 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P69986 and previous config saved to /var/cache/conftool/dbconfig/20241015-152456-ladsgroup.json | [production] | 
            
  | 15:22 | <volans> | force-rebooting ganeti1034 stuck due to drbd traces via mgmt | [production] | 
            
  | 15:19 | <akosiaris@cumin1002> | END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1034.eqiad.wmnet | [production] | 
            
  | 15:17 | <akosiaris> | drain ganeti1034 of VMs, hardware might be misbehaving | [production] | 
            
  | 15:16 | <akosiaris@cumin1002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet | [production] | 
            
  | 15:12 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P69985 and previous config saved to /var/cache/conftool/dbconfig/20241015-151243-arnaudb.json | [production] | 
            
  | 15:09 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P69984 and previous config saved to /var/cache/conftool/dbconfig/20241015-150948-ladsgroup.json | [production] | 
            
  | 14:57 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2174 (T367781)', diff saved to https://phabricator.wikimedia.org/P69983 and previous config saved to /var/cache/conftool/dbconfig/20241015-145734-arnaudb.json | [production] | 
            
  | 14:56 | <herron@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host titan1001.eqiad.wmnet | [production] | 
            
  | 14:56 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2174 (T367781)', diff saved to https://phabricator.wikimedia.org/P69982 and previous config saved to /var/cache/conftool/dbconfig/20241015-145517-arnaudb.json | [production] | 
            
  | 14:55 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2174.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:55 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2174.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:55 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2173 (T367781)', diff saved to https://phabricator.wikimedia.org/P69981 and previous config saved to /var/cache/conftool/dbconfig/20241015-145453-arnaudb.json | [production] |