| 
      
        2022-02-21
      
      ยง
     | 
  
    
  | 16:46 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1135 (T300381)', diff saved to https://phabricator.wikimedia.org/P21192 and previous config saved to /var/cache/conftool/dbconfig/20220221-164608-marostegui.json | 
  [production] | 
            
  | 16:44 | 
  <mforns@deploy1002> | 
  Finished deploy [analytics/refinery@ed5c9f9] (hadoop-test): Deploy Aqs Hourly for Airflow THIN [analytics/refinery@ed5c9f9] (duration: 07m 12s) | 
  [production] | 
            
  | 16:38 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P21191 and previous config saved to /var/cache/conftool/dbconfig/20220221-163847-kormat.json | 
  [production] | 
            
  | 16:38 | 
  <elukey@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve2002.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:37 | 
  <mforns@deploy1002> | 
  Started deploy [analytics/refinery@ed5c9f9] (hadoop-test): Deploy Aqs Hourly for Airflow THIN [analytics/refinery@ed5c9f9] | 
  [production] | 
            
  | 16:37 | 
  <mforns@deploy1002> | 
  Finished deploy [analytics/refinery@ed5c9f9] (thin): Deploy Aqs Hourly for Airflow THIN [analytics/refinery@ed5c9f9] (duration: 00m 07s) | 
  [production] | 
            
  | 16:36 | 
  <mforns@deploy1002> | 
  Started deploy [analytics/refinery@ed5c9f9] (thin): Deploy Aqs Hourly for Airflow THIN [analytics/refinery@ed5c9f9] | 
  [production] | 
            
  | 16:36 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Depooling db1135 (T300381)', diff saved to https://phabricator.wikimedia.org/P21190 and previous config saved to /var/cache/conftool/dbconfig/20220221-163555-marostegui.json | 
  [production] | 
            
  | 16:35 | 
  <marostegui@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 16:35 | 
  <marostegui@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 16:35 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1134 (T300381)', diff saved to https://phabricator.wikimedia.org/P21189 and previous config saved to /var/cache/conftool/dbconfig/20220221-163548-marostegui.json | 
  [production] | 
            
  | 16:35 | 
  <elukey@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve2002.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:30 | 
  <cmooney@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1093.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 16:23 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P21188 and previous config saved to /var/cache/conftool/dbconfig/20220221-162342-kormat.json | 
  [production] | 
            
  | 16:21 | 
  <cmooney@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on elastic1093.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:20 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P21187 and previous config saved to /var/cache/conftool/dbconfig/20220221-162043-marostegui.json | 
  [production] | 
            
  | 16:18 | 
  <elukey@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host ml-serve2002.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 16:17 | 
  <cmooney@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1093.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:08 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1174 (T300774)', diff saved to https://phabricator.wikimedia.org/P21186 and previous config saved to /var/cache/conftool/dbconfig/20220221-160838-kormat.json | 
  [production] | 
            
  | 16:05 | 
  <elukey@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=ml-serve200[5-8].codfw.wmnet | 
  [production] | 
            
  | 16:05 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P21185 and previous config saved to /var/cache/conftool/dbconfig/20220221-160538-marostegui.json | 
  [production] | 
            
  | 16:04 | 
  <elukey@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: dc=codfw,cluster=ml_serve,service=kubesvc | 
  [production] | 
            
  | 16:03 | 
  <elukey@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: dc=codfw,cluster=ml-serve,service=kubesvc | 
  [production] | 
            
  | 16:01 | 
  <cmooney@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host elastic1093.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 16:01 | 
  <elukey@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve2001.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 15:59 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Depooling db1174 (T300774)', diff saved to https://phabricator.wikimedia.org/P21184 and previous config saved to /var/cache/conftool/dbconfig/20220221-155924-kormat.json | 
  [production] | 
            
  | 15:59 | 
  <kormat@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:59 | 
  <kormat@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:52 | 
  <mforns@deploy1002> | 
  Finished deploy [analytics/refinery@ed5c9f9]: Deploy Aqs Hourly for Airflow [analytics/refinery@ed5c9f9] (duration: 21m 23s) | 
  [production] | 
            
  | 15:50 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1134 (T300381)', diff saved to https://phabricator.wikimedia.org/P21183 and previous config saved to /var/cache/conftool/dbconfig/20220221-155034-marostegui.json | 
  [production] | 
            
  | 15:47 | 
  <elukey@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve2001.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:45 | 
  <kormat@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on 10 hosts with reason: Maintenance | 
  [production] | 
            
  | 15:45 | 
  <kormat@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 12:00:00 on 10 hosts with reason: Maintenance | 
  [production] | 
            
  | 15:45 | 
  <kormat@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2121.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:45 | 
  <elukey@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve2001.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:45 | 
  <kormat@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 6:00:00 on db2121.codfw.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:45 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1127 (T300774)', diff saved to https://phabricator.wikimedia.org/P21182 and previous config saved to /var/cache/conftool/dbconfig/20220221-154518-kormat.json | 
  [production] | 
            
  | 15:41 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Depooling db1134 (T300381)', diff saved to https://phabricator.wikimedia.org/P21181 and previous config saved to /var/cache/conftool/dbconfig/20220221-154118-marostegui.json | 
  [production] | 
            
  | 15:41 | 
  <marostegui@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1134.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:41 | 
  <marostegui@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 6:00:00 on db1134.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:41 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1163 (T300381)', diff saved to https://phabricator.wikimedia.org/P21180 and previous config saved to /var/cache/conftool/dbconfig/20220221-154110-marostegui.json | 
  [production] | 
            
  | 15:30 | 
  <mforns@deploy1002> | 
  Started deploy [analytics/refinery@ed5c9f9]: Deploy Aqs Hourly for Airflow [analytics/refinery@ed5c9f9] | 
  [production] | 
            
  | 15:30 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P21179 and previous config saved to /var/cache/conftool/dbconfig/20220221-153013-kormat.json | 
  [production] | 
            
  | 15:28 | 
  <elukey@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host ml-serve2001.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 15:26 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P21178 and previous config saved to /var/cache/conftool/dbconfig/20220221-152606-marostegui.json | 
  [production] | 
            
  | 15:19 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1129 (re)pooling @ 100%: repooling after schema change', diff saved to https://phabricator.wikimedia.org/P21177 and previous config saved to /var/cache/conftool/dbconfig/20220221-151945-root.json | 
  [production] | 
            
  | 15:15 | 
  <kormat@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P21176 and previous config saved to /var/cache/conftool/dbconfig/20220221-151509-kormat.json | 
  [production] | 
            
  | 15:11 | 
  <hnowlan@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync | 
  [production] | 
            
  | 15:11 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P21175 and previous config saved to /var/cache/conftool/dbconfig/20220221-151101-marostegui.json | 
  [production] | 
            
  | 15:10 | 
  <hnowlan@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: sync | 
  [production] |