| 
      
        2021-04-28
      
      ยง
     | 
  
    
  | 13:57 | 
  <andrew@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 13:15 | 
  <jayme> | 
  restarting pybal on lvs5001,lvs4005,lvs2007 - T271573 | 
  [production] | 
            
  | 13:14 | 
  <liw@deploy1002> | 
  rebuilt and synchronized wikiversions files: Revert "group1 wikis to 3.17.0-wmf.1" | 
  [production] | 
            
  | 13:10 | 
  <jayme> | 
  restarting pybal on lvs5002,lvs4006,lvs2008 - T271573 | 
  [production] | 
            
  | 13:04 | 
  <liw@deploy1002> | 
  Synchronized php: group1 wikis to 1.37.0-wmf.3 (duration: 01m 07s) | 
  [production] | 
            
  | 13:03 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) | 
  [production] | 
            
  | 13:03 | 
  <liw@deploy1002> | 
  rebuilt and synchronized wikiversions files: group1 wikis to 1.37.0-wmf.3 | 
  [production] | 
            
  | 13:02 | 
  <moritzm> | 
  upgrading deployment servers to PHP 7.4.32 | 
  [production] | 
            
  | 12:55 | 
  <moritzm> | 
  upgrading snapshot hosts to PHP 7.4.32 | 
  [production] | 
            
  | 12:48 | 
  <jayme> | 
  restarting pybal on lvs2009 - T271573 | 
  [production] | 
            
  | 12:45 | 
  <moritzm> | 
  upgrading labweb to PHP 7.4.32 | 
  [production] | 
            
  | 12:43 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.cassandra.roll-restart | 
  [production] | 
            
  | 12:42 | 
  <jayme> | 
  restarting pybal on lvs5003,lvs4007 - T271573 | 
  [production] | 
            
  | 12:39 | 
  <jayme> | 
  restarting pybal on lvs2010 - T271573 | 
  [production] | 
            
  | 12:36 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) | 
  [production] | 
            
  | 12:28 | 
  <apergos> | 
  manually edited /srv/deployment/dumps/dumps-cache/config on snapshots1011,12,13 to change deploy1001 to deploy1002 (where did it get the old value from? these are new installs!) | 
  [production] | 
            
  | 12:16 | 
  <moritzm> | 
  rolling restart of cassandra in restbase-dev to pick up Java security updates | 
  [production] | 
            
  | 12:15 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.cassandra.roll-restart | 
  [production] | 
            
  | 12:15 | 
  <jmm@cumin2001> | 
  END (FAIL) - Cookbook sre.cassandra.roll-restart (exit_code=99) | 
  [production] | 
            
  | 12:15 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.cassandra.roll-restart | 
  [production] | 
            
  | 11:53 | 
  <jayme> | 
  switching SRV record _etcd._tcp to new etcd cluster (for codfw, eqsin, ulsfo) | 
  [production] | 
            
  | 11:22 | 
  <Urbanecm> | 
  EU B&C window done | 
  [production] | 
            
  | 11:20 | 
  <urbanecm@deploy1002> | 
  Synchronized php-1.37.0-wmf.3/extensions/Popups/: 8d0ae5e8fedefa911fc216bfc810d7a6169ea7e5: Separate reference preview settings in beta & non-beta (T281235) (duration: 01m 08s) | 
  [production] | 
            
  | 11:16 | 
  <urbanecm@deploy1002> | 
  Synchronized wmf-config/InitialiseSettings.php: ddbc378e41783356e28cd90bbefa08624ea2844c: Enable partial action blocks on testwiki (T280528) (duration: 01m 07s) | 
  [production] | 
            
  | 11:05 | 
  <aborrero@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1002.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 11:03 | 
  <aborrero@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 11:03 | 
  <aborrero@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1002.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 11:01 | 
  <aborrero@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 10:44 | 
  <jbond42> | 
  updated the check-raid nrpe script to python3 | 
  [production] | 
            
  | 09:40 | 
  <moritzm> | 
  restarting Tomcat on idp-test1001 to pick up Java security updates | 
  [production] | 
            
  | 09:21 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 100%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15618 and previous config saved to /var/cache/conftool/dbconfig/20210428-092103-root.json | 
  [production] | 
            
  | 09:19 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host contint1001.wikimedia.org | 
  [production] | 
            
  | 09:12 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.hosts.reboot-single for host contint1001.wikimedia.org | 
  [production] | 
            
  | 09:09 | 
  <moritzm> | 
  restarting jenkins* on releases to pick up Java security updates | 
  [production] | 
            
  | 09:08 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host contint2001.wikimedia.org | 
  [production] | 
            
  | 09:06 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 75%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15617 and previous config saved to /var/cache/conftool/dbconfig/20210428-090559-root.json | 
  [production] | 
            
  | 08:59 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.hosts.reboot-single for host contint2001.wikimedia.org | 
  [production] | 
            
  | 08:50 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 50%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15616 and previous config saved to /var/cache/conftool/dbconfig/20210428-085056-root.json | 
  [production] | 
            
  | 08:42 | 
  <urbanecm@deploy1002> | 
  Synchronized wmf-config/InterwikiSortOrders.php: 96ad0d4ad294c442b4936a63ae1cd9de9c098aa9: Add alt, bcl, diq, mad, mni, mnw, nia, skr, tay and trv to InterwikiSortOrders (duration: 01m 08s) | 
  [production] | 
            
  | 08:41 | 
  <urbanecm@deploy1002> | 
  sync-file aborted: 96ad0d4ad294c442b4936a63ae1cd9de9c098aa9: Add alt, bcl, diq, mad, mni, mnw, nia, skr, tay and trv to InterwikiSortOrders (duration: 00m 02s) | 
  [production] | 
            
  | 08:36 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Fully repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15615 and previous config saved to /var/cache/conftool/dbconfig/20210428-083625-marostegui.json | 
  [production] | 
            
  | 08:35 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 25%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15614 and previous config saved to /var/cache/conftool/dbconfig/20210428-083552-root.json | 
  [production] | 
            
  | 08:34 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 25%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15613 and previous config saved to /var/cache/conftool/dbconfig/20210428-083458-root.json | 
  [production] | 
            
  | 08:26 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 100%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15612 and previous config saved to /var/cache/conftool/dbconfig/20210428-082625-root.json | 
  [production] | 
            
  | 08:25 | 
  <effie> | 
  update php7.2 on jobrunners and parsoid servers && rolling  php7.2-fpm restarts | 
  [production] | 
            
  | 08:11 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 75%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15611 and previous config saved to /var/cache/conftool/dbconfig/20210428-081121-root.json | 
  [production] | 
            
  | 07:56 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 50%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15610 and previous config saved to /var/cache/conftool/dbconfig/20210428-075618-root.json | 
  [production] | 
            
  | 07:52 | 
  <effie> | 
  update php7.2 on api servers && rolling  php7.2-fpm restarts | 
  [production] | 
            
  | 07:41 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 25%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15609 and previous config saved to /var/cache/conftool/dbconfig/20210428-074114-root.json | 
  [production] | 
            
  | 07:40 | 
  <marostegui> | 
  Deploy schema change on db1098:3316 and db1098:3316 T266486 T268392 T273360 | 
  [production] |