| 2016-07-05
      
      § | 
    
  | 13:58 | <elukey@palladium> | conftool action : set/weight=20; selector: mw2242.codfw.wmnet | [production] | 
            
  | 13:57 | <elukey@palladium> | conftool action : set/weight=20; selector: mw2241.codfw.wmnet | [production] | 
            
  | 13:57 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw2245.codfw.wmnet | [production] | 
            
  | 13:57 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw2244.codfw.wmnet | [production] | 
            
  | 13:57 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw2243.codfw.wmnet | [production] | 
            
  | 13:56 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw2242.codfw.wmnet | [production] | 
            
  | 13:56 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw2241.codfw.wmnet | [production] | 
            
  | 13:56 | <elukey> | pooling new codfw appservers - mw224[12345] | [production] | 
            
  | 12:32 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw1024.eqiad.wmnet | [production] | 
            
  | 12:12 | <elukey@palladium> | conftool action : set/pooled=no; selector: mw1024.eqiad.wmnet | [production] | 
            
  | 12:11 | <elukey> | depooling/re-pooling mw1024.eqiad.wmnet to temporarily set up trace8 logging (503 investigation - T73487) | [production] | 
            
  | 12:08 | <jynus> | running schema change on db1019 T73563 | [production] | 
            
  | 11:15 | <jynus@tin> | Synchronized wmf-config/db-eqiad.php: Failover all commons special roles to db1081 (duration: 00m 24s) | [production] | 
            
  | 11:00 | <jynus@tin> | Synchronized wmf-config/db-eqiad.php: Failover commons recentachanges (duration: 00m 36s) | [production] | 
            
  | 10:45 | <jynus> | SET GLOBAL read_only=0; on db1040, our new m4-master | [production] | 
            
  | 10:38 | <jynus@tin> | Synchronized wmf-config/db-eqiad.php: Failover commons master to db1040 (duration: 00m 32s) | [production] | 
            
  | 10:23 | <jynus> | archiving m3-master phlegal* databases before dropping them | [production] | 
            
  | 10:20 | <mobrovac> | restbase staging started a no-op dump on cerium to test restbase on node 4.4.6 | [production] | 
            
  | 10:05 | <elukey@palladium> | conftool action : set/weight=30; selector: mw1275.eqiad.wmnet | [production] | 
            
  | 10:05 | <elukey@palladium> | conftool action : set/weight=30; selector: mw1274.eqiad.wmnet | [production] | 
            
  | 10:05 | <elukey@palladium> | conftool action : set/weight=30; selector: mw1273.eqiad.wmnet | [production] | 
            
  | 09:59 | <elukey@palladium> | conftool action : set/weight=30; selector: mw1272.eqiad.wmnet | [production] | 
            
  | 09:31 | <_joe_> | shutting down mw1009-16 for decommissioning | [production] | 
            
  | 09:06 | <_joe_> | decommissioning mw1009-16 | [production] | 
            
  | 08:38 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw1275.eqiad.wmnet | [production] | 
            
  | 08:36 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw1274.eqiad.wmnet | [production] | 
            
  | 08:32 | <gehel> | deleting enwikisource_titlesuggest on elasticsearch codfw (index creation issue during cluster restart) | [production] | 
            
  | 08:31 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw1273.eqiad.wmnet | [production] | 
            
  | 08:23 | <elukey@palladium> | conftool action : set/pooled=yes; selector: mw1272.eqiad.wmnet | [production] | 
            
  | 08:21 | <elukey> | adding and pooling new appservers - mw127[2345].eqiad | [production] | 
            
  | 08:07 | <godog> | swift codfw-prod: ms-be202[567] weight 1500 | [production] | 
            
  | 07:55 | <jynus> | dropping etherpad_restore2 database from m1 T138516 | [production] | 
            
  | 07:40 | <akosiaris> | T138516 forcing a puppet run on cache::misc hosts after merging https://gerrit.wikimedia.org/r/297352 | [production] | 
            
  | 07:29 | <akosiaris> | T138516 stop the secondary etherpad instance on etherpad1001. etherpad-restore.wikimedia.org has served its purpose, killing it | [production] | 
            
  | 02:44 | <l10nupdate@tin> | ResourceLoader cache refresh completed at Tue Jul  5 02:44:09 UTC 2016 (duration 6m 12s) | [production] | 
            
  | 02:37 | <mwdeploy@tin> | scap sync-l10n completed (1.28.0-wmf.8) (duration: 17m 13s) | [production] | 
            
  
    | 2016-07-04
      
      § | 
    
  | 20:28 | <jynus> | removing /tmp/joal/sstables on all analytics10* hosts | [production] | 
            
  | 20:22 | <jynus> | deleted 21GB worth of temporary files from analytics1050 | [production] | 
            
  | 19:58 | <aaron@tin> | Synchronized wmf-config/filebackend-production.php: Increase redis lockmanager timeout to 2 (duration: 00m 31s) | [production] | 
            
  | 19:57 | <legoktm@tin> | Synchronized php-1.28.0-wmf.8/extensions/MassMessage/: MassMessage is no longer accepting lists in the MassMessageList content model - T139303 (duration: 00m 39s) | [production] | 
            
  | 17:37 | <jynus> | testing slave_parallel_threads=5 on db1073 | [production] | 
            
  | 14:27 | <moritzm> | rebooting lithium for kernel update | [production] | 
            
  | 14:22 | <moritzm> | installing tomcat7/ libservlet3.0-java security update on the kafka brokers | [production] | 
            
  | 14:06 | <_joe_> | shutting down mw1001-1008 for decommissioning | [production] | 
            
  | 14:03 | <gehel> | rolling restart of elasticsearch codfw/eqiad for kernel upgrade (T138811) | [production] | 
            
  | 13:47 | <_joe_> | stopping jobrunner on mw1011-16 as well, befor decommissioning | [production] | 
            
  | 13:46 | <moritzm> | depooling mw1153-mw1160 (trusty image scalers), replaced by mw1291-mw1298 (jessie image scalers) | [production] | 
            
  | 13:44 | <godog> | ack all mr1-codfw related alerts in librenms | [production] | 
            
  | 13:43 | <akosiaris> | restart smokeping on netmon1001, temporarily disabled msw1-codfw | [production] | 
            
  | 13:38 | <gehel> | resuming writes on Cirrus / elasticsearch, this did not speedup cluster recovery | [production] |