| 
      
        2019-07-19
      
      §
     | 
  
    
  | 07:38 | 
  <moritzm> | 
  rebooting tungsten for kernel update | 
  [production] | 
            
  | 07:38 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | 
  [production] | 
            
  | 07:38 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.hosts.downtime | 
  [production] | 
            
  | 07:25 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | 
  [production] | 
            
  | 07:25 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.hosts.downtime | 
  [production] | 
            
  | 07:25 | 
  <jmm@cumin2001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | 
  [production] | 
            
  | 07:25 | 
  <jmm@cumin2001> | 
  START - Cookbook sre.hosts.downtime | 
  [production] | 
            
  | 07:03 | 
  <elukey> | 
  restart php-fpm on mw1330 - op-cache hit ratio low | 
  [production] | 
            
  | 07:02 | 
  <jynus> | 
  reloading dbproxy1004/9 | 
  [production] | 
            
  | 07:01 | 
  <elukey> | 
  depool wdqs2004 from all services (waiting for maintenance) | 
  [production] | 
            
  | 06:32 | 
  <legoktm@deploy1001> | 
  Synchronized php-1.34.0-wmf.13/extensions/EventBus/includes/EventBus.php: Add more debugging to figure out which events are invalid: T225199 (duration: 00m 55s) | 
  [production] | 
            
  | 06:30 | 
  <legoktm@deploy1001> | 
  Synchronized php-1.34.0-wmf.14/extensions/EventBus/includes/EventBus.php: Add more debugging to figure out which events are invalid: T225199 (duration: 00m 55s) | 
  [production] | 
            
  | 06:15 | 
  <elukey> | 
  clear opcache on mwdebug* | 
  [production] | 
            
  | 05:26 | 
  <fsero> | 
  repool ms-fe2005 - T228196 | 
  [production] | 
            
  | 05:11 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-codfw.php: Repool db2116 (duration: 00m 55s) | 
  [production] | 
            
  | 04:11 | 
  <eileen> | 
  I think I didn't push the turn it on commit - tried again  process-control config revision is 9f7eba2193 | 
  [production] | 
            
  | 03:03 | 
  <eileen> | 
  process-control config revision is 7598dc1bf9 (jobs reenabled) | 
  [production] | 
            
  | 01:52 | 
  <XioNoX> | 
  enable outbound sampling on eqiad's router | 
  [production] | 
            
  | 00:52 | 
  <sbassett@deploy1001> | 
  Synchronized private/PrivateSettings.php: Add even more severe rate limits for eswikiquote and some other, smaller wikis (T227416) (duration: 00m 58s) | 
  [production] | 
            
  | 00:38 | 
  <mutante> | 
  mwmaint2001 - puppet fails - not removing a bunch of log dirs for maintenance crons | 
  [production] | 
            
  | 00:10 | 
  <dzahn@cumin1001> | 
  conftool action : set/pooled=yes; selector: name=mw2250.codfw.wmnet | 
  [production] | 
            
  | 00:08 | 
  <eileen> | 
  process-control config revision is 7598dc1bf9 - jobs disabled | 
  [production] | 
            
  | 00:04 | 
  <mutante> | 
  install1002 - exported indices for new scap version - copied back from buster to stretch - upgraded scap version on mw2250 - scap pull now works and starts to rsync (T228482, T228328, T226948) | 
  [production] | 
            
  
    | 
      
        2019-07-18
      
      §
     | 
  
    
  | 23:50 | 
  <mutante> | 
  built new scap version 3.11.1-1 on boron, copied to install1002, imported package with reprepro, copied from stretch to jessie and buster (T228482) | 
  [production] | 
            
  | 23:22 | 
  <Lucas_WMDE> | 
  Evening SWAT done | 
  [production] | 
            
  | 23:17 | 
  <lucaswerkmeister-wmde@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings-labs.php: SWAT: [[gerrit:523141|Configure Citoid+Wikibase integration on Beta (production no-op) (T228411)]] (duration: 00m 54s) | 
  [production] | 
            
  | 23:13 | 
  <lucaswerkmeister-wmde@deploy1001> | 
  Synchronized wmf-config/Wikibase.php: SWAT: [[gerrit:523140|Set $wgWBRepoSettings[enableRefTabs] in Wikibase.php (T228414)]] (duration: 01m 16s) | 
  [production] | 
            
  | 23:09 | 
  <lucaswerkmeister-wmde@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:523139|Define settings for Citoid+Wikibase integration (T228414)]] (duration: 00m 55s) | 
  [production] | 
            
  | 22:23 | 
  <gehel@puppetmaster1001> | 
  conftool action : set/pooled=inactive; selector: dc=eqiad,name=wdqs1008.eqiad.wmnet | 
  [production] | 
            
  | 22:16 | 
  <gehel@cumin1001> | 
  END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) | 
  [production] | 
            
  | 22:00 | 
  <eevans@> | 
  helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . | 
  [production] | 
            
  | 21:49 | 
  <bd808> | 
  Cleaned up stale striker logs on labweb1001 and labweb1002. Logs go to journald now so log rotate is not triggered to rotate out logs from before that change. | 
  [production] | 
            
  | 21:42 | 
  <eevans@> | 
  helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . | 
  [production] | 
            
  | 21:35 | 
  <bd808@deploy1001> | 
  Finished deploy [striker/deploy@91594df]: Fixes for deprecation warnings and editing Tool models (T228222, T228332) (duration: 01m 13s) | 
  [production] | 
            
  | 21:34 | 
  <bd808@deploy1001> | 
  Started deploy [striker/deploy@91594df]: Fixes for deprecation warnings and editing Tool models (T228222, T228332) | 
  [production] | 
            
  | 21:15 | 
  <mutante> | 
  gerrit (cobalt) - scheduled 1h downtime, rebooting for kernel upgrade | 
  [production] | 
            
  | 21:03 | 
  <jforrester@deploy1001> | 
  Synchronized php-1.34.0-wmf.14/extensions/Flow: T228290 Fix fatal in ChangesListFormatter::getLogTextLinks() (duration: 01m 02s) | 
  [production] | 
            
  | 20:57 | 
  <mutante> | 
  gerrit2001 - icinga downtime for 1h | 
  [production] | 
            
  | 20:56 | 
  <mutante> | 
  gerrit2001 - reboot for kernel upgrade | 
  [production] | 
            
  | 20:51 | 
  <mutante> | 
  gerrit2001 - apt-get upgrade; apt-get autoremove ; puppet agent -tv | 
  [production] | 
            
  | 19:55 | 
  <eevans@> | 
  helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . | 
  [production] | 
            
  | 19:33 | 
  <jforrester@deploy1001> | 
  Synchronized wmf-config/CommonSettings.php: T228374 Enable SecureLinkFixer in beta cluster (2/2) (duration: 00m 55s) | 
  [production] | 
            
  | 19:31 | 
  <jforrester@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: T228374 Enable SecureLinkFixer in beta cluster (1/2) (duration: 00m 55s) | 
  [production] | 
            
  | 19:27 | 
  <jforrester@deploy1001> | 
  Synchronized wmf-config/CommonSettings.php: T207750 Revoke editmyuserjsredirect from all users (duration: 00m 54s) | 
  [production] | 
            
  | 19:25 | 
  <otto@> | 
  helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-main' for release 'main' . | 
  [production] | 
            
  | 19:21 | 
  <otto@> | 
  helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-analytics' for release 'analytics' . | 
  [production] | 
            
  | 19:20 | 
  <eevans@> | 
  helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . | 
  [production] | 
            
  | 18:45 | 
  <mutante> | 
  contint2001 - had puppet failure in puppet board / dpkg issue due to unfinished zuul install which was done on contint1001 - stopped zuul and zuul-merger, apt-install zuul (was already latest version but needed to finish configure step), apt-get autoremove to remove unused packages, ran puppet. dpkg and puppet happy again | 
  [production] | 
            
  | 17:45 | 
  <krinkle@deploy1001> | 
  Synchronized php-1.34.0-wmf.14/includes/libs/objectcache/RedisBagOStuff.php: 69cd8b0f49e8caf8c7398ad76a1ce3d2da4f3e6b (duration: 00m 55s) | 
  [production] | 
            
  | 17:15 | 
  <Krinkle> | 
  krinkle@depoy1001: Pull down https://gerrit.wikimedia.org/r/#/c/mediawiki/extensions/CentralAuth/+/523844/ and  https://gerrit.wikimedia.org/r/#/c/mediawiki/extensions/CentralAuth/+/524276/ (no-op, not deploying) | 
  [production] |