| 2021-04-20
      
      § | 
    
  | 10:20 | <moritzm> | drain ganeti5001 | [production] | 
            
  | 10:11 | <hnowlan> | opening access to cassandra on new AQS hosts (aqs101*) to analytics-in4 filter | [production] | 
            
  | 10:05 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aphlict1001.eqiad.wmnet | [production] | 
            
  | 10:04 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host aphlict1001.eqiad.wmnet | [production] | 
            
  | 09:42 | <volans@cumin2001> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cumin2001.codfw.wmnet,cumin1001.eqiad.wmnet | [production] | 
            
  | 09:42 | <volans@cumin2001> | START - Cookbook sre.hosts.remove-downtime for cumin2001.codfw.wmnet,cumin1001.eqiad.wmnet | [production] | 
            
  | 09:42 | <aborrero@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1002.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 09:40 | <aborrero@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 09:38 | <aborrero@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1002.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 09:38 | <aborrero@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 09:20 | <kharlan@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . | [production] | 
            
  | 09:20 | <kharlan@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . | [production] | 
            
  | 08:58 | <kharlan@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . | [production] | 
            
  | 08:58 | <kharlan@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . | [production] | 
            
  | 08:54 | <filippo@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-fe1003.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 08:51 | <filippo@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-fe1003.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 08:50 | <kharlan@deploy1002> | helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . | [production] | 
            
  | 08:17 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host orespoolcounter1003.eqiad.wmnet | [production] | 
            
  | 08:15 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host orespoolcounter1003.eqiad.wmnet | [production] | 
            
  | 08:14 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host orespoolcounter1004.eqiad.wmnet | [production] | 
            
  | 08:12 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host orespoolcounter1004.eqiad.wmnet | [production] | 
            
  | 08:12 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2128.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 08:10 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host orespoolcounter2004.codfw.wmnet | [production] | 
            
  | 08:10 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2128.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 08:09 | <dcaro> | reprepro updating thirdparty/ceph-octopus repo | [production] | 
            
  | 08:08 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host orespoolcounter2004.codfw.wmnet | [production] | 
            
  | 08:07 | <filippo@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-fe1002.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 08:06 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host orespoolcounter2003.codfw.wmnet | [production] | 
            
  | 08:05 | <filippo@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-fe1002.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 08:04 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host orespoolcounter2003.codfw.wmnet | [production] | 
            
  | 07:59 | <marostegui@cumin1001> | dbctl commit (dc=all): 'Remove db1086 from dbctl T278229', diff saved to https://phabricator.wikimedia.org/P15482 and previous config saved to /var/cache/conftool/dbconfig/20210420-075949-marostegui.json | [production] | 
            
  | 07:38 | <XioNoX> | BGP: prioritize directly connected peers - T280054 | [production] | 
            
  | 07:38 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db1161 (re)pooling @ 100%: Repool db1161', diff saved to https://phabricator.wikimedia.org/P15480 and previous config saved to /var/cache/conftool/dbconfig/20210420-073808-root.json | [production] | 
            
  | 07:35 | <filippo@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-fe2003.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 07:33 | <filippo@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-fe2003.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 07:23 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db1161 (re)pooling @ 75%: Repool db1161', diff saved to https://phabricator.wikimedia.org/P15479 and previous config saved to /var/cache/conftool/dbconfig/20210420-072305-root.json | [production] | 
            
  | 07:08 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db1161 (re)pooling @ 50%: Repool db1161', diff saved to https://phabricator.wikimedia.org/P15478 and previous config saved to /var/cache/conftool/dbconfig/20210420-070801-root.json | [production] | 
            
  | 07:05 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2074.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 07:03 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2074.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:52 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db1161 (re)pooling @ 25%: Repool db1161', diff saved to https://phabricator.wikimedia.org/P15477 and previous config saved to /var/cache/conftool/dbconfig/20210420-065257-root.json | [production] | 
            
  | 06:38 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2127.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:36 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2127.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:16 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2073.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:14 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2074.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:13 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2073.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:12 | <marostegui@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2105.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:11 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2074.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 06:10 | <marostegui@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db2105.codfw.wmnet with reason: REIMAGE | [production] |