| 
      
        2024-04-10
      
      ยง
     | 
  
    
  | 16:05 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1219', diff saved to https://phabricator.wikimedia.org/P60265 and previous config saved to /var/cache/conftool/dbconfig/20240410-160531-arnaudb.json | 
  [production] | 
            
  | 15:50 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1219', diff saved to https://phabricator.wikimedia.org/P60264 and previous config saved to /var/cache/conftool/dbconfig/20240410-155024-arnaudb.json | 
  [production] | 
            
  | 15:35 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1219 (T360332)', diff saved to https://phabricator.wikimedia.org/P60262 and previous config saved to /var/cache/conftool/dbconfig/20240410-153516-arnaudb.json | 
  [production] | 
            
  | 15:32 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1219 (T360332)', diff saved to https://phabricator.wikimedia.org/P60261 and previous config saved to /var/cache/conftool/dbconfig/20240410-153229-arnaudb.json | 
  [production] | 
            
  | 15:32 | 
  <arnaudb@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1219.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:32 | 
  <arnaudb@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 12:00:00 on db1219.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:32 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1218 (T360332)', diff saved to https://phabricator.wikimedia.org/P60260 and previous config saved to /var/cache/conftool/dbconfig/20240410-153207-arnaudb.json | 
  [production] | 
            
  | 15:17 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P60259 and previous config saved to /var/cache/conftool/dbconfig/20240410-151659-arnaudb.json | 
  [production] | 
            
  | 15:14 | 
  <hnowlan@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 15:14 | 
  <hnowlan@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 15:13 | 
  <hnowlan@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 15:13 | 
  <hnowlan@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 15:03 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1243 (T356166)', diff saved to https://phabricator.wikimedia.org/P60258 and previous config saved to /var/cache/conftool/dbconfig/20240410-150327-marostegui.json | 
  [production] | 
            
  | 15:03 | 
  <marostegui@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1243.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:03 | 
  <marostegui@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 12:00:00 on db1243.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 15:03 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1242 (T356166)', diff saved to https://phabricator.wikimedia.org/P60257 and previous config saved to /var/cache/conftool/dbconfig/20240410-150304-marostegui.json | 
  [production] | 
            
  | 15:01 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P60256 and previous config saved to /var/cache/conftool/dbconfig/20240410-150152-arnaudb.json | 
  [production] | 
            
  | 14:58 | 
  <moritzm> | 
  installing debian-archive-keyring updates on buster | 
  [production] | 
            
  | 14:55 | 
  <akosiaris> | 
  kill all ffmpegs on mw1437 and increase weight of mw1347 from 10 to 30 to direct most queries to it while the other 3 videoscalers serve the backlog | 
  [production] | 
            
  | 14:54 | 
  <akosiaris@cumin1002> | 
  conftool action : set/weight=30; selector: name=mw1437.*.wmnet,dc=eqiad | 
  [production] | 
            
  | 14:51 | 
  <hnowlan@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 14:51 | 
  <hnowlan@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 14:50 | 
  <hnowlan@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 14:50 | 
  <hnowlan@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply | 
  [production] | 
            
  | 14:47 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1242', diff saved to https://phabricator.wikimedia.org/P60255 and previous config saved to /var/cache/conftool/dbconfig/20240410-144757-marostegui.json | 
  [production] | 
            
  | 14:46 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1218 (T360332)', diff saved to https://phabricator.wikimedia.org/P60254 and previous config saved to /var/cache/conftool/dbconfig/20240410-144644-arnaudb.json | 
  [production] | 
            
  | 14:44 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1218 (T360332)', diff saved to https://phabricator.wikimedia.org/P60253 and previous config saved to /var/cache/conftool/dbconfig/20240410-144400-arnaudb.json | 
  [production] | 
            
  | 14:43 | 
  <arnaudb@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1218.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 14:43 | 
  <arnaudb@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 12:00:00 on db1218.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 14:43 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1207 (T360332)', diff saved to https://phabricator.wikimedia.org/P60252 and previous config saved to /var/cache/conftool/dbconfig/20240410-144336-arnaudb.json | 
  [production] | 
            
  | 14:32 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1242', diff saved to https://phabricator.wikimedia.org/P60251 and previous config saved to /var/cache/conftool/dbconfig/20240410-143249-marostegui.json | 
  [production] | 
            
  | 14:28 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P60250 and previous config saved to /var/cache/conftool/dbconfig/20240410-142829-arnaudb.json | 
  [production] | 
            
  | 14:21 | 
  <sukhe@cumin1002> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cp4052.ulsfo.wmnet | 
  [production] | 
            
  | 14:20 | 
  <sukhe@cumin1002> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cp4052.ulsfo.wmnet | 
  [production] | 
            
  | 14:18 | 
  <wmbot~deltaquad@tools-sgebastion-10> | 
  ./stewardbots/StewardBot/manage.sh restart # RC reader not reading RC | 
  [tools.stewardbots] | 
            
  | 14:17 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1242 (T356166)', diff saved to https://phabricator.wikimedia.org/P60249 and previous config saved to /var/cache/conftool/dbconfig/20240410-141742-marostegui.json | 
  [production] | 
            
  | 14:17 | 
  <sukhe@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=cp1112.eqiad.wmnet,service=(cdn|ats-be) | 
  [production] | 
            
  | 14:13 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P60248 and previous config saved to /var/cache/conftool/dbconfig/20240410-141322-arnaudb.json | 
  [production] | 
            
  | 14:13 | 
  <wmbot~anticomposite@tools-sgebastion-10> | 
  ./stewardbots/StewardBot/manage.sh restart # RC reader not reading RC | 
  [tools.stewardbots] | 
            
  | 14:11 | 
  <wmbot~anticomposite@tools-sgebastion-10> | 
  SULWatcher/manage.sh restart # SULWatchers disconnected | 
  [tools.stewardbots] | 
            
  | 14:07 | 
  <sukhe@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1112.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 13:58 | 
  <sukhe@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=cp4052.ulsfo.wmnet,service=(cdn|ats-be) | 
  [production] | 
            
  | 13:58 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1207 (T360332)', diff saved to https://phabricator.wikimedia.org/P60246 and previous config saved to /var/cache/conftool/dbconfig/20240410-135814-arnaudb.json | 
  [production] | 
            
  | 13:55 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1207 (T360332)', diff saved to https://phabricator.wikimedia.org/P60245 and previous config saved to /var/cache/conftool/dbconfig/20240410-135525-arnaudb.json | 
  [production] | 
            
  | 13:55 | 
  <arnaudb@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1207.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 13:55 | 
  <arnaudb@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 12:00:00 on db1207.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 13:55 | 
  <arnaudb@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1206 (T360332)', diff saved to https://phabricator.wikimedia.org/P60244 and previous config saved to /var/cache/conftool/dbconfig/20240410-135502-arnaudb.json | 
  [production] | 
            
  | 13:54 | 
  <sukhe@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4052.ulsfo.wmnet with OS bullseye | 
  [production] | 
            
  | 13:49 | 
  <denisse> | 
  Delete unused Prometheus TLS certificates - T360414 | 
  [production] | 
            
  | 13:47 | 
  <moritzm> | 
  installing unbound security updates | 
  [production] |