| 2022-04-04
      
      ยง | 
    
  | 11:12 | <mmandere> | depool cp4028 for reimage - T290005 | [production] | 
            
  | 11:11 | <volans> | deploying python3-wmflib 1.2.0 fleet-wide | [production] | 
            
  | 11:09 | <jforrester@deploy1002> | Finished deploy [integration/docroot@63b762d]: Id56cd5bf64ed Adding WikiLambda doc block (duration: 00m 08s) | [production] | 
            
  | 11:09 | <jforrester@deploy1002> | Started deploy [integration/docroot@63b762d]: Id56cd5bf64ed Adding WikiLambda doc block | [production] | 
            
  | 11:07 | <moritzm> | installing cups security updates on buster (client side tools/libs) | [production] | 
            
  | 11:04 | <mmandere@cumin1001> | START - Cookbook sre.hosts.reimage for host cp3054.esams.wmnet with OS buster | [production] | 
            
  | 10:53 | <mmandere> | depool cp3054 for reimage - T290005 | [production] | 
            
  | 10:39 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-druid1003.eqiad.wmnet | [production] | 
            
  | 10:38 | <volans> | uploaded python3-wmflib_1.2.0 to apt.wikimedia.org buster-wikimedia,bullseye-wikimedia | [production] | 
            
  | 10:32 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-druid1003.eqiad.wmnet | [production] | 
            
  | 10:26 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depooling db1119 (T298565)', diff saved to https://phabricator.wikimedia.org/P24035 and previous config saved to /var/cache/conftool/dbconfig/20220404-102616-ladsgroup.json | [production] | 
            
  | 10:26 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:26 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:26 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24034 and previous config saved to /var/cache/conftool/dbconfig/20220404-102609-ladsgroup.json | [production] | 
            
  | 10:26 | <moritzm> | installing libxml2 security updates | [production] | 
            
  | 10:14 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-druid1004.eqiad.wmnet | [production] | 
            
  | 10:11 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1106', diff saved to https://phabricator.wikimedia.org/P24033 and previous config saved to /var/cache/conftool/dbconfig/20220404-101104-ladsgroup.json | [production] | 
            
  | 10:09 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-druid1004.eqiad.wmnet | [production] | 
            
  | 10:08 | <moritzm> | installing icu bugfix updates from buster 10.12 point release | [production] | 
            
  | 09:58 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-druid1005.eqiad.wmnet | [production] | 
            
  | 09:55 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1106', diff saved to https://phabricator.wikimedia.org/P24032 and previous config saved to /var/cache/conftool/dbconfig/20220404-095558-ladsgroup.json | [production] | 
            
  | 09:55 | <jelto@cumin1001> | END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM gitlab1001.wikimedia.org | [production] | 
            
  | 09:54 | <btullis@deploy1002> | helmfile [staging] START helmfile.d/services/datahub: apply on main | [production] | 
            
  | 09:52 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-druid1005.eqiad.wmnet | [production] | 
            
  | 09:51 | <mmandere> | pool cp6008 with HAProxy as TLS termination layer - T290005 | [production] | 
            
  | 09:48 | <jelto@cumin1001> | START - Cookbook sre.ganeti.reboot-vm for VM gitlab1001.wikimedia.org | [production] | 
            
  | 09:47 | <moritzm> | installing zlib security updates | [production] | 
            
  | 09:44 | <mmandere> | pool cp5003 with HAProxy as TLS termination layer - T290005 | [production] | 
            
  | 09:40 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1106 (T298565)', diff saved to https://phabricator.wikimedia.org/P24031 and previous config saved to /var/cache/conftool/dbconfig/20220404-094053-ladsgroup.json | [production] | 
            
  | 09:31 | <moritzm> | rolling restart of FPM/Apache on mw canaries to pick up updated zlib/glibc/openssl/libxml | [production] | 
            
  | 09:29 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-test-presto1001.eqiad.wmnet | [production] | 
            
  | 09:26 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-test-presto1001.eqiad.wmnet | [production] | 
            
  | 09:26 | <mmandere@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6008.drmrs.wmnet with OS buster | [production] | 
            
  | 09:26 | <btullis@cumin1001> | END (PASS) - Cookbook sre.presto.roll-restart-workers (exit_code=0) for Presto analytics cluster: Roll restart of all Presto's jvm daemons. | [production] | 
            
  | 09:25 | <mmandere@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5003.eqsin.wmnet with OS buster | [production] | 
            
  | 09:16 | <btullis@cumin1001> | START - Cookbook sre.presto.roll-restart-workers for Presto analytics cluster: Roll restart of all Presto's jvm daemons. | [production] | 
            
  | 09:12 | <moritzm> | installing openssl updates from Buster 10.12 point release | [production] | 
            
  | 09:03 | <mmandere@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6008.drmrs.wmnet with reason: host reimage | [production] | 
            
  | 08:59 | <mmandere@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on cp6008.drmrs.wmnet with reason: host reimage | [production] | 
            
  | 08:59 | <mmandere@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5003.eqsin.wmnet with reason: host reimage | [production] | 
            
  | 08:56 | <moritzm> | installing glibc updates from buster 10.12 point release | [production] | 
            
  | 08:55 | <mmandere@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on cp5003.eqsin.wmnet with reason: host reimage | [production] | 
            
  | 08:45 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db1130 (re)pooling @ 5%: After reimage', diff saved to https://phabricator.wikimedia.org/P24030 and previous config saved to /var/cache/conftool/dbconfig/20220404-084523-root.json | [production] | 
            
  | 08:43 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . | [production] | 
            
  | 08:42 | <mmandere@cumin1001> | START - Cookbook sre.hosts.reimage for host cp6008.drmrs.wmnet with OS buster | [production] | 
            
  | 08:39 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . | [production] | 
            
  | 08:37 | <moritzm> | installing flac security updates | [production] | 
            
  | 08:37 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . | [production] | 
            
  | 08:37 | <mmandere> | depool cp6008 for reimage - T290005 | [production] | 
            
  | 08:35 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | [production] |