| 2024-05-03
      
      ยง | 
    
  | 15:26 | <dcausse> | depooled wdqs1012 (lagged) | [production] | 
            
  | 15:23 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2181', diff saved to https://phabricator.wikimedia.org/P61854 and previous config saved to /var/cache/conftool/dbconfig/20240503-152354-marostegui.json | [production] | 
            
  | 15:08 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2181 (T361627)', diff saved to https://phabricator.wikimedia.org/P61853 and previous config saved to /var/cache/conftool/dbconfig/20240503-150846-marostegui.json | [production] | 
            
  | 14:48 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "add install7001 - jmm@cumin2002" | [production] | 
            
  | 14:44 | <jnuche@deploy1002> | Finished deploy [releng/jenkins-deploy@5d3a06d] (releasing): update plugins to address vulnerabilities (duration: 00m 39s) | [production] | 
            
  | 14:44 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db2181 (T361627)', diff saved to https://phabricator.wikimedia.org/P61852 and previous config saved to /var/cache/conftool/dbconfig/20240503-144419-marostegui.json | [production] | 
            
  | 14:44 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2181.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:44 | <jnuche@deploy1002> | Started deploy [releng/jenkins-deploy@5d3a06d] (releasing): update plugins to address vulnerabilities | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2181.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2167 (T361627)', diff saved to https://phabricator.wikimedia.org/P61851 and previous config saved to /var/cache/conftool/dbconfig/20240503-144356-marostegui.json | [production] | 
            
  | 14:39 | <jnuche@deploy1002> | Finished deploy [releng/jenkins-deploy@5d3a06d] (releasing): test plugin update in secondary host (duration: 00m 22s) | [production] | 
            
  | 14:39 | <jnuche@deploy1002> | Started deploy [releng/jenkins-deploy@5d3a06d] (releasing): test plugin update in secondary host | [production] | 
            
  | 14:28 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2167', diff saved to https://phabricator.wikimedia.org/P61850 and previous config saved to /var/cache/conftool/dbconfig/20240503-142848-marostegui.json | [production] | 
            
  | 14:26 | <jmm@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "add install7001 - jmm@cumin2002" | [production] | 
            
  | 14:26 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host install7001.wikimedia.org | [production] | 
            
  | 14:26 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host install7001.wikimedia.org with OS bookworm | [production] | 
            
  | 14:16 | <sukhe@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 14:15 | <sukhe@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 14:14 | <sukhe> | sudo homer asw*magru* commit "add durum and doh hosts in magru" | [production] | 
            
  | 14:13 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2167', diff saved to https://phabricator.wikimedia.org/P61849 and previous config saved to /var/cache/conftool/dbconfig/20240503-141341-marostegui.json | [production] | 
            
  | 14:11 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on install7001.wikimedia.org with reason: host reimage | [production] | 
            
  | 14:08 | <jmm@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on install7001.wikimedia.org with reason: host reimage | [production] | 
            
  | 14:07 | <herron> | alert1001:~# systemctl restart prometheus-alertmanager.service | [production] | 
            
  | 13:58 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2167 (T361627)', diff saved to https://phabricator.wikimedia.org/P61848 and previous config saved to /var/cache/conftool/dbconfig/20240503-135834-marostegui.json | [production] | 
            
  | 13:43 | <jmm@cumin2002> | START - Cookbook sre.hosts.reimage for host install7001.wikimedia.org with OS bookworm | [production] | 
            
  | 13:36 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db2167 (T361627)', diff saved to https://phabricator.wikimedia.org/P61847 and previous config saved to /var/cache/conftool/dbconfig/20240503-133601-marostegui.json | [production] | 
            
  | 13:35 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2167.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 13:35 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2167.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 13:35 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2166 (T361627)', diff saved to https://phabricator.wikimedia.org/P61846 and previous config saved to /var/cache/conftool/dbconfig/20240503-133538-marostegui.json | [production] | 
            
  | 13:30 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM install7001.wikimedia.org - jmm@cumin2002" | [production] | 
            
  | 13:29 | <jmm@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM install7001.wikimedia.org - jmm@cumin2002" | [production] | 
            
  | 13:28 | <jmm@cumin2002> | END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) install7001.wikimedia.org on all recursors | [production] | 
            
  | 13:28 | <jmm@cumin2002> | START - Cookbook sre.dns.wipe-cache install7001.wikimedia.org on all recursors | [production] | 
            
  | 13:28 | <jmm@cumin2002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 13:28 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM install7001.wikimedia.org - jmm@cumin2002" | [production] | 
            
  | 13:26 | <elukey> | restart karma on alert1001 to verify if probe down alerts shown are stale | [production] | 
            
  | 13:26 | <jmm@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM install7001.wikimedia.org - jmm@cumin2002" | [production] | 
            
  | 13:23 | <cmooney@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 13:22 | <cmooney@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 13:20 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2166', diff saved to https://phabricator.wikimedia.org/P61845 and previous config saved to /var/cache/conftool/dbconfig/20240503-132030-marostegui.json | [production] | 
            
  | 13:05 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2166', diff saved to https://phabricator.wikimedia.org/P61844 and previous config saved to /var/cache/conftool/dbconfig/20240503-130523-marostegui.json | [production] | 
            
  | 13:04 | <cmooney@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 13:03 | <cmooney@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:51 | <cmooney@cumin1002> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 12:50 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2166 (T361627)', diff saved to https://phabricator.wikimedia.org/P61843 and previous config saved to /var/cache/conftool/dbconfig/20240503-125015-marostegui.json | [production] | 
            
  | 12:47 | <cmooney@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:26 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1203 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P61841 and previous config saved to /var/cache/conftool/dbconfig/20240503-122659-root.json | [production] | 
            
  | 12:25 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db2166 (T361627)', diff saved to https://phabricator.wikimedia.org/P61840 and previous config saved to /var/cache/conftool/dbconfig/20240503-122510-marostegui.json | [production] | 
            
  | 12:25 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2166.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 12:24 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db2166.codfw.wmnet with reason: Maintenance | [production] |