| 
      
        2018-10-04
      
      ยง
     | 
  
    
  | 22:51 | 
  <ejegg> | 
  updated fundraising CiviCRM from 944b954bac to ebc2e0076c | 
  [production] | 
            
  | 21:27 | 
  <XioNoX> | 
  bounce phab1001 switch port - T201039 | 
  [production] | 
            
  | 20:47 | 
  <ejegg> | 
  updated fundraising CiviCRM from ddf4865650 to 944b954bac | 
  [production] | 
            
  | 20:23 | 
  <mforns@deploy1001> | 
  Finished deploy [analytics/refinery@3eb9bf2]: deploying refinery together with refinery-source v0.0.76 (duration: 00m 17s) | 
  [production] | 
            
  | 20:22 | 
  <mforns@deploy1001> | 
  Started deploy [analytics/refinery@3eb9bf2]: deploying refinery together with refinery-source v0.0.76 | 
  [production] | 
            
  | 20:10 | 
  <mforns@deploy1001> | 
  Finished deploy [analytics/refinery@3eb9bf2]: deploying refinery together with refinery-source v0.0.76 (duration: 14m 04s) | 
  [production] | 
            
  | 19:56 | 
  <mforns@deploy1001> | 
  Started deploy [analytics/refinery@3eb9bf2]: deploying refinery together with refinery-source v0.0.76 | 
  [production] | 
            
  | 19:30 | 
  <marxarelli> | 
  rise in fatals "Fatal error: entire web request took longer than 60 seconds and timed out in /srv/mediawiki/php-1.32.0-wmf.24/includes/Title.php" | 
  [production] | 
            
  | 19:26 | 
  <dduvall@deploy1001> | 
  rebuilt and synchronized wikiversions files: all wikis to 1.32.0-wmf.24 | 
  [production] | 
            
  | 19:15 | 
  <ppchelko@deploy1001> | 
  Finished deploy [cpjobqueue/deploy@6dc89c0]: Bump cirrusSearchLinksUpdate concurrency to 50 (duration: 00m 53s) | 
  [production] | 
            
  | 19:14 | 
  <ppchelko@deploy1001> | 
  Started deploy [cpjobqueue/deploy@6dc89c0]: Bump cirrusSearchLinksUpdate concurrency to 50 | 
  [production] | 
            
  | 18:49 | 
  <sbisson@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:460202|]] (duration: 00m 59s) | 
  [production] | 
            
  | 18:24 | 
  <XioNoX> | 
  bounce lvs1002:eth1 switch port | 
  [production] | 
            
  | 18:23 | 
  <sbisson@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:464510|Enable PageTriage/ORES on enwiki (T206149)]] (duration: 01m 01s) | 
  [production] | 
            
  | 18:21 | 
  <bblack> | 
  lvs1002: puppet disabled, stopping pybal (fail to 1005) | 
  [production] | 
            
  | 18:07 | 
  <_joe_> | 
  disabled notifications for etcd replication lag on conf1005, not in production | 
  [production] | 
            
  | 17:47 | 
  <banyek> | 
  repooling labsb1010 (T195747) | 
  [production] | 
            
  | 17:41 | 
  <_joe_> | 
  uploaded new python-etcd packages for jessie, stretch | 
  [production] | 
            
  | 17:38 | 
  <XioNoX> | 
  asw2-b-eqiad recabling done - T201039 | 
  [production] | 
            
  | 17:34 | 
  <elukey> | 
  pool kafka1002 (eventbus) after maintenance | 
  [production] | 
            
  | 17:22 | 
  <elukey> | 
  re-enable ircecho after alarms shower | 
  [production] | 
            
  | 17:15 | 
  <andrewbogott> | 
  triggering some alerts on labvirt1018 to figure out about alert thresholds | 
  [production] | 
            
  | 17:06 | 
  <elukey> | 
  stop ircecho on einstenium - alarms shower | 
  [production] | 
            
  | 17:02 | 
  <gtirloni> | 
  tools - published updated toollabs-* Docker images | 
  [production] | 
            
  | 16:54 | 
  <ejegg> | 
  updated standalone SmashPig deploy from 82f9d49c23 to 5f21d3f2db | 
  [production] | 
            
  | 16:52 | 
  <XioNoX> | 
  Step 3)  Add missing links - T201039 | 
  [production] | 
            
  | 16:45 | 
  <shdubsh> | 
  etherpad1001 running systemctl reset-failed | 
  [production] | 
            
  | 16:41 | 
  <XioNoX> | 
  Connect/enable fpc2:0/51-fpc5:1/0 (5m DAC) - T201039 | 
  [production] | 
            
  | 16:39 | 
  <XioNoX> | 
  Enable fpc5-fpc7 - T201039 | 
  [production] | 
            
  | 16:33 | 
  <twentyafterfour> | 
  started phd on phab1001 and re-enabled puppet (I had it disabled to prevent starting phd during read-only) | 
  [production] | 
            
  | 16:25 | 
  <twentyafterfour> | 
  phabricator is read-write | 
  [production] | 
            
  | 16:21 | 
  <jynus> | 
  reloading dbproxy1003,8 | 
  [production] | 
            
  | 16:16 | 
  <marostegui> | 
  Stop and reboot db1072 (phabricator master) for maintenance | 
  [production] | 
            
  | 16:16 | 
  <twentyafterfour> | 
  phabricator is read-only | 
  [production] | 
            
  | 16:14 | 
  <XioNoX> | 
  Enable all VC ports on FPC2 and FPC7 - T201039 | 
  [production] | 
            
  | 16:13 | 
  <XioNoX> | 
  starting asw2-b-eqiad re-cabling - T201039 | 
  [production] | 
            
  | 16:08 | 
  <twentyafterfour> | 
  logged downtime for phabricator in icinga, stopped phd queue processing in preparation for read-only mode | 
  [production] | 
            
  | 16:07 | 
  <jynus> | 
  reloading haproxy @ dbproxy1005 | 
  [production] | 
            
  | 16:00 | 
  <marostegui> | 
  Stop MySQL on db1073 for mariadb and kernel upgrade - T201039 T148507 | 
  [production] | 
            
  | 15:58 | 
  <arturo> | 
  icinga downtime every server in the main cloudvps deployment for 2h T201039 | 
  [production] | 
            
  | 15:56 | 
  <arturo> | 
  icinga downtime every server with the cloudXXXX scheme for 2h T201039 | 
  [production] | 
            
  | 15:54 | 
  <ppchelko@deploy1001> | 
  Finished deploy [cpjobqueue/deploy@55dbb8b]: Proper reconnect on topics change T199444 (duration: 00m 55s) | 
  [production] | 
            
  | 15:53 | 
  <ppchelko@deploy1001> | 
  Started deploy [cpjobqueue/deploy@55dbb8b]: Proper reconnect on topics change T199444 | 
  [production] | 
            
  | 15:52 | 
  <ppchelko@deploy1001> | 
  Finished deploy [changeprop/deploy@5d00448]: Proper reconnect on topics change T199444 (duration: 01m 40s) | 
  [production] | 
            
  | 15:51 | 
  <ppchelko@deploy1001> | 
  Started deploy [changeprop/deploy@5d00448]: Proper reconnect on topics change T199444 | 
  [production] | 
            
  | 15:41 | 
  <elukey> | 
  depool kafka1002 from eventbus as precautionary step for T201039 | 
  [production] | 
            
  | 14:48 | 
  <banyek> | 
  depooling labsb1010 (T195747) | 
  [production] | 
            
  | 14:09 | 
  <marostegui> | 
  Sanitize enwikivoyage cebwiki shwiki srwiki mgwiktionary on db1124:3315 T184805 | 
  [production] | 
            
  | 13:46 | 
  <pmiazga@deploy1001> | 
  Finished deploy [proton/deploy@ecb9a0e]: Bugfix:handle undefined response and fix grafana stats (T186748,T201158) (duration: 02m 55s) | 
  [production] | 
            
  | 13:43 | 
  <pmiazga@deploy1001> | 
  Started deploy [proton/deploy@ecb9a0e]: Bugfix:handle undefined response and fix grafana stats (T186748,T201158) | 
  [production] |