| 2015-01-16
      
      § | 
    
  | 23:33 | <bd808> | ran `LTRIM logstash -50000 9999999` on redis queues to drop ~4M events in backlog | [production] | 
            
  | 22:14 | <bd808> | restarted elasticsearch on logstash1001; OOM errors | [production] | 
            
  | 21:21 | <bd808> | restarted elasticsearch on logstash1001 | [production] | 
            
  | 21:18 | <marktraceur> | Finished scap: Fix UploadWizard regression and EventLogging errors (duration: 31m 06s) | [production] | 
            
  | 21:17 | <bd808> | OOM for elasticsearch on logstash1001 caused a dropped shard and icinga alerts | [production] | 
            
  | 20:47 | <marktraceur> | Started scap: Fix UploadWizard regression and EventLogging errors | [production] | 
            
  | 20:17 | <bd808> | Synchronized wmf-config/InitialiseSettings.php: Allow wgDebugLogGroups to exclude logstash append (e808e690) (duration: 00m 05s) | [production] | 
            
  | 20:17 | <bd808> | Synchronized wmf-config/logging.php: Allow wgDebugLogGroups to exclude logstash append (e808e690) (duration: 00m 07s) | [production] | 
            
  | 18:13 | <bd808> | document count not changing for logstash-2015.01.16 index | [production] | 
            
  | 17:59 | <bd808> | Synchronized wmf-config/logging-labs.php: beta: Allow wgDebugLogGroups to exclude logstash append (03c3ab27) (duration: 00m 06s) | [production] | 
            
  | 17:50 | <bblack> | depooled amssq42 text cache in esams | [production] | 
            
  | 17:44 | <ejegg> | updated tools from 88b57fea517d2232e8ae906df550f426b6574f24 to 84442d51a841af4265ff103827cda83d5dd9dc54 | [production] | 
            
  | 17:24 | <demon> | Synchronized wmf-config/: (no message) (duration: 00m 05s) | [production] | 
            
  | 17:21 | <ejegg> | updated civicrm from d648ededf5c9fc2b0ebf989300ca2037956418e3 to 4fa10ec9e3afbf65e6cbd523138cdc4b4485c482 | [production] | 
            
  | 17:17 | <demon> | Synchronized wmf-config/: (no message) (duration: 00m 06s) | [production] | 
            
  | 16:48 | <ottomata> | finished hadoop namenode migration.  Hadoop cluster is back online | [production] | 
            
  | 16:48 | <bd808> | Upgraded elasticsearch and restarted on all logstash nodes | [production] | 
            
  | 16:43 | <bd808> | shutdown whole elasticsearch cluster for logstash | [production] | 
            
  | 16:39 | <bd808> | restarted elasticsearch on logstash1001 | [production] | 
            
  | 16:07 | <ottomata> | stopping hadoop cluster | [production] | 
            
  | 08:01 | <springle> | Synchronized wmf-config/db-eqiad.php: repool db1051 db1056, warm up (duration: 00m 10s) | [production] | 
            
  | 05:54 | <ori> | <jgage> mtr shows me packet loss between cr2-eqiad.wikimedia.org and 206.126.236.21 aka eqixva-google-gige.google.com | [production] | 
            
  | 04:40 | <LocalisationUpdate> | ResourceLoader cache refresh completed at Fri Jan 16 04:40:10 UTC 2015 (duration 40m 9s) | [production] | 
            
  | 04:22 | <Tim> | on mw1228 doing some tests to figure out why incorrect Expires header is being sent on requests for /images/* | [production] | 
            
  | 03:09 | <ori> | Synchronized php-1.25wmf14/includes/content/JsonContent.php: I2f4f9cb343: Let subclasses specify content model in JsonContent (duration: 00m 06s) | [production] | 
            
  | 03:01 | <springle> | xtrabackup clone db1020 to db1046 | [production] | 
            
  | 02:31 | <LocalisationUpdate> | completed (1.25wmf15) at 2015-01-16 02:31:37+00:00 | [production] | 
            
  | 02:31 | <l10nupdate> | Synchronized php-1.25wmf15/cache/l10n: (no message) (duration: 00m 01s) | [production] | 
            
  | 02:19 | <LocalisationUpdate> | completed (1.25wmf14) at 2015-01-16 02:19:04+00:00 | [production] | 
            
  | 02:19 | <l10nupdate> | Synchronized php-1.25wmf14/cache/l10n: (no message) (duration: 00m 01s) | [production] | 
            
  | 02:06 | <ori> | EventLogging syncs were of I335ad42bb: JsonSchemaContent: Fix html rendering of objects and arrays | [production] | 
            
  | 02:03 | <ori> | Synchronized php-1.25wmf14/extensions/EventLogging: (no message) (duration: 00m 05s) | [production] | 
            
  | 02:03 | <ori> | Synchronized php-1.25wmf15/extensions/EventLogging: (no message) (duration: 00m 06s) | [production] | 
            
  | 00:47 | <mutante> | on both puppetmasters: chown gitpuppet /var/lib/git/operations/puppet/.git/logs/refs/heads/production & .git/logs/HEAD & .git/logs/refs/remotes/origin to fix puppet-merge. git pulled on strontium | [production] | 
            
  | 00:46 | <mutante> | restarted morebots | [production] | 
            
  
    | 2015-01-15
      
      § | 
    
  | 22:08 | <bd808> | restarted elasticsaerch on logstash1003; died from OOM | [production] | 
            
  | 21:06 | <subbu> | deployed parsoid version 2fdf9298 | [production] | 
            
  | 20:38 | <ori> | Synchronized wmf-config/InitialiseSettings.php: I250ecfceb: Switch all wikis to monolog logger (duration: 00m 05s) | [production] | 
            
  | 20:04 | <bd808> | logstash redis queue backlog 384k events and climbing; likely related to the elasticsearch cluster flapping | [production] | 
            
  | 19:57 | <Coren> | aborting labs filesystem move (not enough contiguous free space) and postponing until new shelf | [production] | 
            
  | 18:59 | <YuviPanda> | this works? | [production] | 
            
  | 18:23 | <csteipp> | deployed patches for T85349 T85850 T86711 | [production] | 
            
  | 17:26 | <ejegg> | updated crm from bb05adf9279bd7a795906ca476e1850a85c21711 to d648ededf5c9fc2b0ebf989300ca2037956418e3 | [production] | 
            
  | 16:51 | <demon> | Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 06s) | [production] | 
            
  | 16:09 | <bd808> | Deleted 2015-12-* indices from logstash elasticsearch cluster | [production] | 
            
  | 16:07 | <anomie> | Synchronized php-1.25wmf14/extensions/FlaggedRevs/api/actions/ApiReview.php: SWAT: Fix FlaggedRevs action=review for binary flagging [[gerrit:185180]] (duration: 00m 07s) | [production] | 
            
  | 16:01 | <bd808> | Elasticsearch cluster for logstash has indices for events dated 2015-12-* again | [production] | 
            
  | 15:49 | <Jeff_Green> | many frack host package updates and reboots | [production] | 
            
  | 10:09 | <aude> | Synchronized php-1.25wmf14/extensions/Wikidata: fix noexternallanglinks bug (duration: 00m 13s) | [production] | 
            
  | 08:24 | <qchris> | Ran kafka leader re-election to bring analytics1021 back into the set of leaders | [production] |