| 
      
        2020-08-18
      
      ยง
     | 
  
    
  | 13:04 | 
  <kormat> | 
  disabling puppet on all db machines T259516 | 
  [production] | 
            
  | 12:57 | 
  <_joe_> | 
  rebooting appservers in eqiad, 3 at a time | 
  [production] | 
            
  | 12:57 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 12:37 | 
  <oblivian@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) | 
  [production] | 
            
  | 12:34 | 
  <kormat> | 
  deploying wmfmariadbpy 0.4 | 
  [production] | 
            
  | 12:21 | 
  <jayme@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 11:53 | 
  <XioNoX> | 
  add new icinga hosts to mr policies - T260533 | 
  [production] | 
            
  | 11:40 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 11:36 | 
  <Lucas_WMDE> | 
  EU backport&config done | 
  [production] | 
            
  | 11:33 | 
  <lucaswerkmeister-wmde@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:620888|Add Wikisource wordmark for trwikisource (T260658)]], part 2 (duration: 00m 55s) | 
  [production] | 
            
  | 11:32 | 
  <Lucas_WMDE> | 
  lucaswerkmeister-wmde@mwmaint1002:~$ printf '%s\n' 'https://en.wikipedia.org/static/images/mobile/copyright/wikisource-wordmark-tr.svg' | mwscript purgeList.php # T260658 | 
  [production] | 
            
  | 11:32 | 
  <lucaswerkmeister-wmde@deploy1001> | 
  Synchronized static/images/mobile/copyright/wikisource-wordmark-tr.svg: Config: [[gerrit:620888|Add Wikisource wordmark for trwikisource (T260658)]], part 1 (duration: 00m 55s) | 
  [production] | 
            
  | 11:24 | 
  <lucaswerkmeister-wmde@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:595543|Enable Data Bridge on Catalan Wikipedia (T232584)]] (duration: 01m 01s) | 
  [production] | 
            
  | 11:06 | 
  <jbond42> | 
  deploy net-snmp update to buster | 
  [production] | 
            
  | 10:56 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=yes; selector: cluster=api_appserver,dc=codfw,name=mw229.* | 
  [production] | 
            
  | 10:55 | 
  <oblivian@cumin1001> | 
  END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) | 
  [production] | 
            
  | 10:54 | 
  <marostegui> | 
  Reboot db2125 after running a full upgrade - T260670 | 
  [production] | 
            
  | 10:46 | 
  <marostegui> | 
  Powercycle db2125 from the idrac T260670 | 
  [production] | 
            
  | 10:07 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Depool db2125 - host down T260670', diff saved to https://phabricator.wikimedia.org/P12288 and previous config saved to /var/cache/conftool/dbconfig/20200818-100718-marostegui.json | 
  [production] | 
            
  | 09:45 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 09:43 | 
  <jiji@cumin1001> | 
  conftool action : set/pooled=yes; selector: name=mw2250.codfw.wmnet | 
  [production] | 
            
  | 09:40 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=yes; selector: cluster=api_appserver,dc=codfw,name=mw214[234].* | 
  [production] | 
            
  | 09:40 | 
  <oblivian@cumin1001> | 
  END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) | 
  [production] | 
            
  | 09:34 | 
  <kart_> | 
  Update cxserver to 2020-08-17-090424-production (T259980) | 
  [production] | 
            
  | 09:32 | 
  <kartik@deploy1001> | 
  helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . | 
  [production] | 
            
  | 09:29 | 
  <kartik@deploy1001> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' . | 
  [production] | 
            
  | 09:28 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 09:28 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=yes; selector: cluster=api_appserver,dc=codfw,name=mw214[02].* | 
  [production] | 
            
  | 09:26 | 
  <volans> | 
  upgraded spicerack to v0.0.39 on cumin hosts | 
  [production] | 
            
  | 09:25 | 
  <kartik@deploy1001> | 
  helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . | 
  [production] | 
            
  | 09:21 | 
  <volans> | 
  uploaded spicerack_0.0.39-1+deb10u1 to apt.wikimedia.org buster-wikimedia | 
  [production] | 
            
  | 09:05 | 
  <hashar> | 
  Restarting CI Jenkins | 
  [production] | 
            
  | 08:44 | 
  <vgutierrez> | 
  restart ats-tls on cp5006 | 
  [production] | 
            
  | 08:24 | 
  <oblivian@cumin1001> | 
  END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) | 
  [production] | 
            
  | 08:17 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 08:16 | 
  <oblivian@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=99) | 
  [production] | 
            
  | 08:10 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 08:02 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Fully repool db1089', diff saved to https://phabricator.wikimedia.org/P12284 and previous config saved to /var/cache/conftool/dbconfig/20200818-080256-marostegui.json | 
  [production] | 
            
  | 07:58 | 
  <oblivian@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=99) | 
  [production] | 
            
  | 07:53 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 07:45 | 
  <godog> | 
  VictorOps ack'd incidents will re-trigger after 24h if not resolved - T259465 | 
  [production] | 
            
  | 07:44 | 
  <oblivian@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=1) | 
  [production] | 
            
  | 07:43 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P12283 and previous config saved to /var/cache/conftool/dbconfig/20200818-074325-marostegui.json | 
  [production] | 
            
  | 07:42 | 
  <_joe_> | 
  performing rolling reboot of all codfw api servers | 
  [production] | 
            
  | 07:38 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.reboot-cluster | 
  [production] | 
            
  | 07:23 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P12282 and previous config saved to /var/cache/conftool/dbconfig/20200818-072349-marostegui.json | 
  [production] | 
            
  | 07:19 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=yes; selector: name=mw213[5-9].codfw.wmnet | 
  [production] | 
            
  | 07:16 | 
  <jynus> | 
  update rest of phabricator passwords T250361 | 
  [production] | 
            
  | 07:11 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P12281 and previous config saved to /var/cache/conftool/dbconfig/20200818-071121-marostegui.json | 
  [production] | 
            
  | 07:08 | 
  <oblivian@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=99) | 
  [production] |