| 
      
        2017-06-21
      
      ยง
     | 
  
    
  | 13:16 | 
  <marostegui> | 
  Deploy alter table s5 - labsdb1001 - T166207 | 
  [production] | 
            
  | 13:15 | 
  <marostegui> | 
  Deploy alter table s5 - db1045 - T166207 | 
  [production] | 
            
  | 13:14 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-eqiad.php: Depool db1045 - T166207 (duration: 00m 44s) | 
  [production] | 
            
  | 13:08 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-eqiad.php: Repool db1070 - T166207 (duration: 00m 46s) | 
  [production] | 
            
  | 13:05 | 
  <elukey> | 
  reboot analytics1003 (Hue, Camus, Oozie, Hive master) for kernel upgrade | 
  [production] | 
            
  | 12:32 | 
  <gehel> | 
  deploying T167871 and restarting kartotherian / tilerator on maps eqiad | 
  [production] | 
            
  | 12:32 | 
  <moritzm> | 
  rebooting mw1189-mw1199 for kernel update | 
  [production] | 
            
  | 12:10 | 
  <akosiaris@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=sca1004.eqiad.wmnet | 
  [production] | 
            
  | 12:09 | 
  <akosiaris@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=mwdebug1002.eqiad.wmnet | 
  [production] | 
            
  | 11:59 | 
  <moritzm> | 
  rebooting mw1209-mw1220 for kernel update | 
  [production] | 
            
  | 11:45 | 
  <moritzm> | 
  rebooting mediawiki api servers in codfw for kernel update | 
  [production] | 
            
  | 11:42 | 
  <akosiaris> | 
  rollback change in asw-a-eqiad for ganeti interface range due to alerts | 
  [production] | 
            
  | 11:23 | 
  <akosiaris> | 
  reboot ganeti1007 for insertion into ganeti cluster | 
  [production] | 
            
  | 11:14 | 
  <elukey> | 
  reboot aqs1006 for kernel update | 
  [production] | 
            
  | 11:04 | 
  <moritzm> | 
  rebooting mw1180-mw1188 for kernel update | 
  [production] | 
            
  | 11:02 | 
  <akosiaris> | 
  starting up all instances on ganeti01.svc.codfw.wmnet | 
  [production] | 
            
  | 11:01 | 
  <godog> | 
  reimage ms-be1018 / 1019 with stretch | 
  [production] | 
            
  | 10:58 | 
  <ema> | 
  reboot lvs[2004-2006] (codfw secondaries) for kernel update | 
  [production] | 
            
  | 10:50 | 
  <akosiaris> | 
  rebooting all ganeti200X nodes | 
  [production] | 
            
  | 10:47 | 
  <akosiaris> | 
  shutdown all VMs on the ganeti01.svc.codfw.wmnet cluster | 
  [production] | 
            
  | 10:43 | 
  <elukey> | 
  reboot analytics1001 (Hadoop master) for kernel update | 
  [production] | 
            
  | 10:35 | 
  <akosiaris> | 
  rebooting the entire codfw ganeti cluster for kernel upgrades. Silenced hosts in icinga already. T167643 | 
  [production] | 
            
  | 10:30 | 
  <moritzm> | 
  rebooting bast4001 for kernel update | 
  [production] | 
            
  | 10:21 | 
  <ema> | 
  reboot lvs[1001-1003] (eqiad primaries) for kernel update | 
  [production] | 
            
  | 10:17 | 
  <elukey> | 
  running a script in tmux on rdb[12]003 called "check" to dump periodically LLEN enwiki:jobqueue:enqueue:l-unclaimed and stopped the one on rdb2004 | 
  [production] | 
            
  | 10:07 | 
  <ema> | 
  reboot lvs[1004-1006] (eqiad secondaries) for kernel update | 
  [production] | 
            
  | 10:01 | 
  <elukey> | 
  reboot analytics1002 (Hadoop master standby) for kernel update | 
  [production] | 
            
  | 10:01 | 
  <moritzm> | 
  rebooting auth* servers for kernel update | 
  [production] | 
            
  | 09:48 | 
  <ema> | 
  reboot lvs[1010-1012] for kernel update | 
  [production] | 
            
  | 09:48 | 
  <elukey> | 
  reboot aqs1005 for kernel update | 
  [production] | 
            
  | 09:10 | 
  <elukey> | 
  reboot kafka2001 for kernel update (eventbus codfw) | 
  [production] | 
            
  | 09:06 | 
  <moritzm> | 
  rebooting restbase1017 for kernel update | 
  [production] | 
            
  | 08:52 | 
  <oblivian@puppetmaster1001> | 
  conftool action : set/pooled=inactive; selector: name=restbase2001.codfw.wmnet,dc=codfw,service=restbase | 
  [production] | 
            
  | 08:49 | 
  <_joe_> | 
  correction: restarting pybal | 
  [production] | 
            
  | 08:49 | 
  <_joe_> | 
  restarting etcd on lvs2003/2006, connection lost to etcd | 
  [production] | 
            
  | 08:34 | 
  <elukey> | 
  reboot kafka1012 for kernel upgrades | 
  [production] | 
            
  | 08:34 | 
  <marostegui> | 
  Deploy alter table db1070 s5 - T166207 | 
  [production] | 
            
  | 08:33 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-eqiad.php: Depool db1070 - T166207 (duration: 00m 44s) | 
  [production] | 
            
  | 08:27 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-eqiad.php: Repool db1082 - T166207 (duration: 00m 45s) | 
  [production] | 
            
  | 08:26 | 
  <godog> | 
  reimage ms-be1014 / 1015 with jessie | 
  [production] | 
            
  | 07:37 | 
  <marostegui> | 
  Stop and reset slave s5 on dbstore2001 - T168354 | 
  [production] | 
            
  | 06:23 | 
  <mutante> | 
  planet2001 wget missing unpuppetized logo file from https://en.planet.wikimedia.org/images/planet-wm2.png - should fix puppet run | 
  [production] | 
            
  | 06:19 | 
  <marostegui> | 
  Stop replication and puppet on db2066 for maintenance - T168354 | 
  [production] | 
            
  | 06:18 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-codfw.php: Depool db2066 - T168354 (duration: 00m 43s) | 
  [production] | 
            
  | 06:08 | 
  <elukey> | 
  reboot thorium for kernel upgrades (outage to all the analytics websites) | 
  [production] | 
            
  | 06:05 | 
  <marostegui> | 
  Deploy alter table s5 - db1082 - T166207 | 
  [production] | 
            
  | 06:04 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-eqiad.php: Depool db1082 - T166207 (duration: 00m 44s) | 
  [production] | 
            
  | 06:04 | 
  <marostegui> | 
  Deploy alter table s5 - dbstore1002 - T166207 | 
  [production] | 
            
  | 05:59 | 
  <elukey> | 
  reboot stat100[2,3,4] for kernel upgrades | 
  [production] | 
            
  | 05:57 | 
  <marostegui@tin> | 
  Synchronized wmf-config/db-eqiad.php: Repool db1087 - T166207 (duration: 00m 44s) | 
  [production] |