| 2016-10-24
      
      § | 
    
  | 14:57 | <paravoid> | restarting ferm on es2015 | [production] | 
            
  | 14:54 | <bblack> | starting ferm server on eeden, radon | [production] | 
            
  | 14:41 | <gehel@puppetmaster1001> | conftool action : set/pooled=yes; selector: dc=eqiad,cluster=maps,service=kartotherian,name=maps1002.eqiad.wmnet | [production] | 
            
  | 14:38 | <dereckson@mira> | Synchronized wmf-config/CommonSettings.php: Toggle wgDefaultUserOptions['watchdefault'] on for cs.wikipedia, off elsewhere (T148328, 2/2) (duration: 00m 50s) | [production] | 
            
  | 14:36 | <dereckson@mira> | Synchronized wmf-config/InitialiseSettings.php: Toggle wgDefaultUserOptions['watchdefault'] on for cs.wikipedia, off elsewhere (T148328, 1/2) (duration: 00m 54s) | [production] | 
            
  | 14:36 | <bblack> | disabling puppet on all caches ahead of port# work, to test - T107749 / https://gerrit.wikimedia.org/r/#/c/317405 | [production] | 
            
  | 14:29 | <yurik> | re-deployed current kartotherian to all servers (maps1002 & maps-test* were stale) | [production] | 
            
  | 14:11 | <marostegui> | Deploy schema change s5 dewiki.revision - only codfw T148967 | [production] | 
            
  | 14:03 | <l10nupdate@mira> | ResourceLoader cache refresh completed at Mon Oct 24 14:03:07 UTC 2016 (duration 6m 17s) | [production] | 
            
  | 13:56 | <dereckson@mira> | scap sync-l10n completed (1.28.0-wmf.22) (duration: 10m 46s) | [production] | 
            
  | 13:42 | <bblack> | restarting all varnish frontends (serially per-cluster with proper depooling, etc) | [production] | 
            
  | 13:20 | <elukey> | reimaging mc120[89] and mc1030 | [production] | 
            
  | 13:18 | <Dereckson> | Started manually l10nupdate, as it didn't run for 6 days, and more especially to fix T148921 user-facing issue. | [production] | 
            
  | 13:13 | <dereckson@mira> | Synchronized wmf-config/throttle.php: Edit-a-thon BDA (Poitiers) throttle rule (T148852) (duration: 01m 13s) | [production] | 
            
  | 10:47 | <elukey> | reimaged mc102[56], currently doing mc1027 | [production] | 
            
  | 10:21 | <_joe_> | rebooting kubernetes1002 | [production] | 
            
  | 09:20 | <mobrovac> | change-prop deploying c7feda2 | [production] | 
            
  | 09:09 | <mobrovac> | restbase deploy end of f9017ad | [production] | 
            
  | 08:55 | <akosiaris> | rebooting cobalt (gerrit) for kernel upgrades | [production] | 
            
  | 08:53 | <elukey> | reimaging mc1024 | [production] | 
            
  | 08:46 | <mobrovac> | restbase deploy start of f9017ad | [production] | 
            
  | 08:38 | <gehel> | continue rolling restart of elasticsearch eqiad cluster | [production] | 
            
  | 08:38 | <hashar> | Restarting gallium (Jenkins/Zuul) for kernel upgrades | [production] | 
            
  | 08:36 | <akosiaris> | rebooting labnodepool1001 for kernel upgrades | [production] | 
            
  | 08:36 | <akosiaris> | rebooting scandium for kernel upgrades | [production] | 
            
  | 08:33 | <hashar> | rebooting contint1001 | [production] | 
            
  | 08:20 | <elukey> | reimaging mc1023.eqiad.wmnet | [production] | 
            
  | 07:46 | <elukey> | reimaging mc1022.eqiad.wmnet (T137345) | [production] | 
            
  | 07:09 | <marosteg1i> | Deploying alter table s1.enwiki on codfw - T147166 | [production] | 
            
  
    | 2016-10-21
      
      § | 
    
  | 23:45 | <mutante> | depooling maps1002 (by running "depool" on the server itself) | [production] | 
            
  | 23:35 | <yurik> | maps1002.eqiad is running older/incorrect/misbehaving software for some reason, restart didn't help. Need to depool | [production] | 
            
  | 22:17 | <mutante> | cp4006,cp4014 gzipped some logs in home for disk space | [production] | 
            
  | 22:08 | <mutante> | cp4006, cp4014 were running out of disk, apt-get clean | [production] | 
            
  | 21:40 | <mutante> | phab2001 that IP was also on iridium/phab1001, it should not be hardcoded in puppet, causing issues in T143363 | [production] | 
            
  | 21:37 | <mutante> | phab2001 - ip addr del 10.64.32.186/21 dev eth0 | [production] | 
            
  | 21:06 | <bblack> | restarting varnish backends (depooled, etc) for eqiad cache_upload: cp1049, cp1072, cp1074 | [production] | 
            
  | 19:50 | <cmjohnson1> | dataset1001 array 1 swap failed disk slot 4 | [production] | 
            
  | 19:40 | <cmjohnson1> | labvirt1005 swapping disk 0 | [production] | 
            
  | 19:40 | <gehel> | routing traffic for cache-maps in codfw -> eqiad | [production] | 
            
  | 19:29 | <gehel> | running puppet on eqiad cache nodes to activate maps traffic redirection | [production] | 
            
  | 19:06 | <gehel> | shutting down cassandra on maps2004, seems to have lost data | [production] | 
            
  | 18:22 | <ejegg> | updated SmashPig from d1ca0632d00dfb608f70ca4b70251a5ba49f4411 to e28b2cd9f0c1429acdd2a08c68f95884dbffb594 | [production] | 
            
  | 16:45 | <ejegg> | updated fundraising tools from 09ae6e24d8ca8350dc099d63a6ca0d9ec9fdef2b to f83e39291adc55677fc4b49307dc4807eba18019 | [production] | 
            
  | 16:33 | <mutante> | rebooting planet1001 - *.planet.wm.org will be right back | [production] |