| 2024-07-11
      
      § | 
    
  | 01:50 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:50 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:48 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:48 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:47 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:47 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:46 | <andrew@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1060.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 01:44 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:44 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:43 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:43 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:43 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:43 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:43 | <andrew@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1060.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 01:37 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2163', diff saved to https://phabricator.wikimedia.org/P66225 and previous config saved to /var/cache/conftool/dbconfig/20240711-013723-arnaudb.json | [production] | 
            
  | 01:36 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=99) on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:36 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1061.eqiad.wmnet' | [admin] | 
            
  | 01:27 | <andrew@cumin1002> | START - Cookbook sre.hosts.reimage for host cloudvirt1060.eqiad.wmnet with OS bookworm | [production] | 
            
  | 01:22 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2163 (T367781)', diff saved to https://phabricator.wikimedia.org/P66224 and previous config saved to /var/cache/conftool/dbconfig/20240711-012216-arnaudb.json | [production] | 
            
  | 01:21 | <mutante> | gerrit-replica.wikimedia.org (gerrit2002) - switched firewall provider from iptables to nftables - all seems fine to me but just in case: gerrit:1053068 can be reverted to go back | [production] | 
            
  | 01:20 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2163 (T367781)', diff saved to https://phabricator.wikimedia.org/P66223 and previous config saved to /var/cache/conftool/dbconfig/20240711-012006-arnaudb.json | [production] | 
            
  | 01:19 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2163.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 01:19 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2163.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 01:19 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2162 (T367781)', diff saved to https://phabricator.wikimedia.org/P66222 and previous config saved to /var/cache/conftool/dbconfig/20240711-011944-arnaudb.json | [production] | 
            
  | 01:04 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P66221 and previous config saved to /var/cache/conftool/dbconfig/20240711-010437-arnaudb.json | [production] | 
            
  | 00:55 | <mutante> | gerrit-replica.wikimedia.org (gerrit2002) - maintenance | [production] | 
            
  | 00:49 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P66220 and previous config saved to /var/cache/conftool/dbconfig/20240711-004930-arnaudb.json | [production] | 
            
  | 00:49 | <dzahn@cumin1002> | END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1:00:00 on gerrit-replica.wikimedia.org with reason: switch firewall provider | [production] | 
            
  | 00:49 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on gerrit-replica.wikimedia.org with reason: switch firewall provider | [production] | 
            
  | 00:49 | <dzahn@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit2002.wikimedia.org with reason: switch firewall provider | [production] | 
            
  | 00:48 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on gerrit2002.wikimedia.org with reason: switch firewall provider | [production] | 
            
  | 00:34 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2162 (T367781)', diff saved to https://phabricator.wikimedia.org/P66219 and previous config saved to /var/cache/conftool/dbconfig/20240711-003423-arnaudb.json | [production] | 
            
  | 00:32 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2162 (T367781)', diff saved to https://phabricator.wikimedia.org/P66218 and previous config saved to /var/cache/conftool/dbconfig/20240711-003212-arnaudb.json | [production] | 
            
  | 00:32 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2162.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 00:32 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2162.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 00:31 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154 (T367781)', diff saved to https://phabricator.wikimedia.org/P66217 and previous config saved to /var/cache/conftool/dbconfig/20240711-003150-arnaudb.json | [production] | 
            
  | 00:16 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P66216 and previous config saved to /var/cache/conftool/dbconfig/20240711-001643-arnaudb.json | [production] | 
            
  | 00:01 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P66215 and previous config saved to /var/cache/conftool/dbconfig/20240711-000136-arnaudb.json | [production] | 
            
  
    | 2024-07-10
      
      § | 
    
  | 23:46 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2154 (T367781)', diff saved to https://phabricator.wikimedia.org/P66214 and previous config saved to /var/cache/conftool/dbconfig/20240710-234629-arnaudb.json | [production] | 
            
  | 23:44 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2154 (T367781)', diff saved to https://phabricator.wikimedia.org/P66213 and previous config saved to /var/cache/conftool/dbconfig/20240710-234418-arnaudb.json | [production] | 
            
  | 23:44 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2154.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 23:44 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2154.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 23:43 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2152 (T367781)', diff saved to https://phabricator.wikimedia.org/P66212 and previous config saved to /var/cache/conftool/dbconfig/20240710-234356-arnaudb.json | [production] | 
            
  | 23:35 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db2182 (T367856)', diff saved to https://phabricator.wikimedia.org/P66211 and previous config saved to /var/cache/conftool/dbconfig/20240710-233558-marostegui.json | [production] | 
            
  | 23:35 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 23:35 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 23:35 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2168 (T367856)', diff saved to https://phabricator.wikimedia.org/P66210 and previous config saved to /var/cache/conftool/dbconfig/20240710-233535-marostegui.json | [production] | 
            
  | 23:35 | <rzl> | $ sudo cumin A:all-mw enable-puppet T367012 | [production] | 
            
  | 23:34 | <rzl@deploy1002> | Finished scap: T367012 (duration: 07m 45s) | [production] | 
            
  | 23:30 | <rzl@deploy1002> | rzl: Continuing with sync | [production] |