| 
      
        2019-08-30
      
      §
     | 
  
    
  | 11:48 | 
  <marostegui> | 
  Start s2 replication on labsdb1012 | 
  [production] | 
            
  | 11:33 | 
  <jynus> | 
  switching db1125:s2 (eqiad sanitarium) to replicate from codfw T231638 | 
  [production] | 
            
  | 11:31 | 
  <marostegui> | 
  Temporary stop s2 replication on labsdb1009-labsdb1012 | 
  [production] | 
            
  | 10:23 | 
  <jynus> | 
  reseting db1074 from iLo | 
  [production] | 
            
  | 10:10 | 
  <jynus@deploy1001> | 
  Synchronized wmf-config/db-eqiad.php: Mirror dbctl depool of db1074 (duration: 00m 55s) | 
  [production] | 
            
  | 09:57 | 
  <jynus@cumin1001> | 
  dbctl commit (dc=all): 'Depool db1074 after crash', diff saved to https://phabricator.wikimedia.org/P9013 and previous config saved to /var/cache/conftool/dbconfig/20190830-095747-jynus.json | 
  [production] | 
            
  | 09:24 | 
  <ema> | 
  cp1075: depool ats-be due to low but constant 504 rate after 8.0.5-1wm4 upgrade | 
  [production] | 
            
  | 09:20 | 
  <ema@puppetmaster1001> | 
  conftool action : set/pooled=no; selector: name=cp1075.eqiad.wmnet,service=ats-be | 
  [production] | 
            
  | 09:13 | 
  <ema> | 
  cp1075: upgrade ATS to 8.0.5-1wm4 | 
  [production] | 
            
  | 08:50 | 
  <ema> | 
  repool ats-be on cp1075 and verify if T231504 is fixed | 
  [production] | 
            
  | 08:49 | 
  <ema@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=cp1075.eqiad.wmnet,service=ats-be | 
  [production] | 
            
  | 08:03 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Fully repool db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9011 and previous config saved to /var/cache/conftool/dbconfig/20190830-080334-marostegui.json | 
  [production] | 
            
  | 07:42 | 
  <marostegui> | 
  Upgrade db2055 db2071 db2072 db2092 | 
  [production] | 
            
  | 07:10 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9010 and previous config saved to /var/cache/conftool/dbconfig/20190830-071043-marostegui.json | 
  [production] | 
            
  | 06:39 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9009 and previous config saved to /var/cache/conftool/dbconfig/20190830-063949-marostegui.json | 
  [production] | 
            
  | 06:25 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9008 and previous config saved to /var/cache/conftool/dbconfig/20190830-062517-marostegui.json | 
  [production] | 
            
  | 06:15 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Slowly repool db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9007 and previous config saved to /var/cache/conftool/dbconfig/20190830-061546-marostegui.json | 
  [production] | 
            
  | 06:07 | 
  <marostegui> | 
  Upgrade db1076 | 
  [production] | 
            
  | 06:07 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Depool db1076 for upgrade - T230785', diff saved to https://phabricator.wikimedia.org/P9006 and previous config saved to /var/cache/conftool/dbconfig/20190830-060702-marostegui.json | 
  [production] | 
            
  | 05:25 | 
  <marostegui> | 
  Stop MySQL on db2060 - T231625 | 
  [production] | 
            
  | 05:23 | 
  <marostegui> | 
  Remove db2060 from tendril and zarcillo - T231625 | 
  [production] | 
            
  | 05:15 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-codfw.php: Remove db2060 from config T231625 (duration: 00m 53s) | 
  [production] | 
            
  | 05:14 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-eqiad.php: Remove db2060 from config T231625 (duration: 00m 53s) | 
  [production] | 
            
  | 05:10 | 
  <marostegui> | 
  Restart wikibugs | 
  [production] | 
            
  
    | 
      
        2019-08-29
      
      §
     | 
  
    
  | 23:23 | 
  <ejegg> | 
  updated payments-wiki from 1d5d7503b0 to 51d9ed79b6 | 
  [production] | 
            
  | 23:15 | 
  <krinkle@deploy1001> | 
  Synchronized wmf-config/CommonSettings.php: 4cdfebe (duration: 00m 54s) | 
  [production] | 
            
  | 21:36 | 
  <ejegg> | 
  re-enabled fundraising python jobs | 
  [production] | 
            
  | 20:18 | 
  <ejegg> | 
  updated fundraising python tools from c0f4e7a379 to b42bda6bf3 | 
  [production] | 
            
  | 20:14 | 
  <foks> | 
  removing two files for legal compliance | 
  [production] | 
            
  | 20:14 | 
  <ejegg> | 
  disabled fundraising python jobs | 
  [production] | 
            
  | 19:56 | 
  <ebernhardson> | 
  cloudelastic-chi run frwiki_content/_forcemerge?only_expunge_deletes=true to try and fix 5gb segments with 96% deleted documents | 
  [production] | 
            
  | 18:59 | 
  <ebernhardson> | 
  restart elasticsearch on cloudelastic1003 (T231517) | 
  [production] | 
            
  | 18:50 | 
  <ebernhardson> | 
  restart elasticsearch on cloudelastic1002 (T231517) | 
  [production] | 
            
  | 18:41 | 
  <ebernhardson> | 
  set index.merge.scheduler.max_thread_count to null to accept default values on cloudelastic-chi (T231517) | 
  [production] | 
            
  | 18:36 | 
  <krinkle@deploy1001> | 
  Synchronized php-1.34.0-wmf.20/extensions/AbuseFilter/includes/AbuseFilterVariableHolder.php: T231542 f37f0bd50cf (duration: 00m 53s) | 
  [production] | 
            
  | 18:33 | 
  <krinkle@deploy1001> | 
  Synchronized php-1.34.0-wmf.20/extensions/CentralAuth/modules/ext.centralauth.ForeignApi.js: e7cd3cd313a4642 (duration: 00m 55s) | 
  [production] | 
            
  | 18:23 | 
  <ebernhardson> | 
  restart elasticsearch on cloudelastic1001 (T231517) | 
  [production] | 
            
  | 18:22 | 
  <urbanecm@deploy1001> | 
  Synchronized wmf-config/CommonSettings.php: SWAT: Fix "Assign all rights assigned to suppress group to oversight group" (T230601) (duration: 00m 54s) | 
  [production] | 
            
  | 18:07 | 
  <ebernhardson> | 
  increase index.refresh_interval to 5m for all indices on cloudelastic-chi | 
  [production] | 
            
  | 17:22 | 
  <crusnov@cumin1001> | 
  END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) | 
  [production] | 
            
  | 17:19 | 
  <crusnov@cumin1001> | 
  END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) | 
  [production] | 
            
  | 17:15 | 
  <dcausse> | 
  restarted elasticsearch on cloudelastic1004 (T231517) | 
  [production] | 
            
  | 17:10 | 
  <crusnov@cumin1001> | 
  START - Cookbook sre.ganeti.makevm | 
  [production] | 
            
  | 17:09 | 
  <crusnov@cumin1001> | 
  START - Cookbook sre.ganeti.makevm | 
  [production] | 
            
  | 17:09 | 
  <crusnov@cumin1001> | 
  END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) | 
  [production] | 
            
  | 16:59 | 
  <crusnov@cumin1001> | 
  START - Cookbook sre.ganeti.makevm | 
  [production] | 
            
  | 16:49 | 
  <crusnov@cumin1001> | 
  START - Cookbook sre.ganeti.makevm | 
  [production] | 
            
  | 16:49 | 
  <crusnov@cumin1001> | 
  END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) | 
  [production] | 
            
  | 16:49 | 
  <crusnov@cumin1001> | 
  START - Cookbook sre.ganeti.makevm | 
  [production] | 
            
  | 14:17 | 
  <ema@puppetmaster1001> | 
  conftool action : set/pooled=no; selector: name=cp1075.eqiad.wmnet,service=ats-be | 
  [production] |