| 
      
        2010-07-14
      
      §
     | 
  
    
  | 23:44 | 
  <Fred> | 
  re-added ccron job to periodically save rrds on our ganglia server. (cron job seems to have vanished for some reason) | 
  [production] | 
            
  | 17:59 | 
  <catrope> | 
  synchronized php-1.5/wmf-config/InitialiseSettings.php  'Favicon for wikimaniateamwiki per Guillaume' | 
  [production] | 
            
  | 16:06 | 
  <Fred> | 
  restarted apache on mobile1 (had begun to return 500) | 
  [production] | 
            
  | 14:07 | 
  <mark> | 
  Fixed memcached on srv110 | 
  [production] | 
            
  | 12:19 | 
  <mark> | 
  Fixed ganglia and puppet on stafford | 
  [production] | 
            
  | 11:54 | 
  <mark> | 
  Migrated DNS monitoring to puppet | 
  [production] | 
            
  | 10:31 | 
  <mark> | 
  Migrated ZFS RAID nagios check to puppet | 
  [production] | 
            
  | 10:14 | 
  <mark> | 
  Migrated monitoring of lucene to puppet | 
  [production] | 
            
  | 09:37 | 
  <mark> | 
  Migrated monitoring of image scalers to puppet | 
  [production] | 
            
  | 08:49 | 
  <Tim> | 
  using stafford for some pbuilder experimentation | 
  [production] | 
            
  
    | 
      
        2010-07-12
      
      §
     | 
  
    
  | 16:54 | 
  <Fred> | 
  changed LONGQUERIES check threshold  | 
  [production] | 
            
  | 16:08 | 
  <Fred> | 
  restarting morebots since it had died. | 
  [production] | 
            
  | 16:08 | 
  <Fred> | 
  restarting Nagios since it was down. | 
  [production] | 
            
  | 14:29 | 
  <mark> | 
  Added "cfg_file=/etc/nagios/puppet_hosts.cfg" to nagios.cfg | 
  [production] | 
            
  | 13:25 | 
  <JeLuF> | 
  added disk space monitoring for apaches | 
  [production] | 
            
  | 12:51 | 
  <jeluf> | 
  synchronized php-1.5/wmf-config/InitialiseSettings.php  '24306 - Create namespaces for Lithuanian Wiktionary' | 
  [production] | 
            
  | 12:48 | 
  <jeluf> | 
  synchronized php-1.5/wmf-config/InitialiseSettings.php  '24321 - ml.wikiquote.org lost its project namespace' | 
  [production] | 
            
  | 12:46 | 
  <jeluf> | 
  synchronized php-1.5/wmf-config/InitialiseSettings.php  '24321 - ml.wikiquote.org lost its project namespace' | 
  [production] | 
            
  | 12:41 | 
  <jeluf> | 
  synchronized php-1.5/wmf-config/InitialiseSettings.php  '24344 - Namespace changes - si.wiktionary' | 
  [production] | 
            
  | 11:45 | 
  <JeLuF> | 
  fixed broken ganglia-metrics installation on srv146 (chown gmetric /var/log/gmetricd/gmetricd.log) | 
  [production] | 
            
  | 11:41 | 
  <JeLuF> | 
  added DPKG status monitoring for all app servers to nagios. Reports all packages that are not in state 'rc' or 'ii'. | 
  [production] | 
            
  | 10:43 | 
  <JeLuF> | 
  lots of false alerts from nagios due to missing SSL setup for NRPE. Working on it. | 
  [production] | 
            
  | 09:53 | 
  <JeLuF> | 
  changed puppet config to install nrpe on all app servers | 
  [production] | 
            
  | 09:28 | 
  <JeLuF> | 
  replacing opsview-nrpe agents by nagios-nrpe agents (image_scalers, some other apaches). Most apaches already use nagios-nrpe | 
  [production] | 
            
  | 07:40 | 
  <Tim> | 
  set up NRPE disk space monitoring on ms4, discovered that /mnt2 is full | 
  [production] | 
            
  | 04:54 | 
  <Tim> | 
  updated NFS host/service groups to monitor the actual NFS servers, not a random collection of miscellaneous ex-NFS servers | 
  [production] | 
            
  | 04:46 | 
  <Tim> | 
  installed NRPE on nfs1 and nfs2 | 
  [production] | 
            
  | 04:08 | 
  <Tim> | 
  adding rendering, m, bits.esams, recursor0, recursor1, recursor0.esams to nagios | 
  [production] | 
            
  | 04:02 | 
  <Tim> | 
  added forward DNS entry for recursor0.esams, modified reverse DNS entry resolver0.esams -> recursor0.esams | 
  [production] | 
            
  | 03:55 | 
  <Tim> | 
  fixed reverse DNS entries for recursor0 and recursor1, were set incorrectly to non-existent hostnames "resolver0" and "recursor1" | 
  [production] | 
            
  | 03:36 | 
  <Tim> | 
  renamed db6.mgmt to locke.mgmt | 
  [production] | 
            
  
    | 
      
        2010-07-09
      
      §
     | 
  
    
  | 18:07 | 
  <domas> | 
  forgot to log, rebooted locke, put startup stuff to rc.local, maybe Tim changed it afterwards, hehe. beer is good too. | 
  [production] | 
            
  | 15:31 | 
  <Rob> | 
  wikimania2011wiki is now using vector | 
  [production] | 
            
  | 15:31 | 
  <robh> | 
  synchronized php-1.5/wmf-config/InitialiseSettings.php  | 
  [production] | 
            
  | 12:48 | 
  <robh> | 
  ran sync-common-all  | 
  [production] | 
            
  | 01:06 | 
  <tstarling> | 
  synchronized php-1.5/includes/filerepo/RepoGroup.php  | 
  [production] | 
            
  | 01:04 | 
  <tstarling> | 
  synchronized php-1.5/includes/filerepo/RepoGroup.php  | 
  [production] | 
            
  | 01:04 | 
  <root> | 
  synchronized php-1.5/includes/filerepo/RepoGroup.php  | 
  [production] | 
            
  | 01:03 | 
  <tstarling> | 
  synchronized php-1.5/includes/filerepo/RepoGroup.php  | 
  [production] | 
            
  | 00:59 | 
  <tstarling> | 
  synchronized php-1.5/includes/filerepo/RepoGroup.php  | 
  [production] |