| 
      
        2019-08-13
      
      ยง
     | 
  
    
  | 12:08 | 
  <_joe_> | 
  restarted php-fpm on mw1221 | 
  [production] | 
            
  | 12:03 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'sessionstore' for release 'production' . | 
  [production] | 
            
  | 12:00 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . | 
  [production] | 
            
  | 11:56 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' . | 
  [production] | 
            
  | 11:56 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' . | 
  [production] | 
            
  | 11:49 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' . | 
  [production] | 
            
  | 11:44 | 
  <fsero> | 
  recreating cxserver blubber and sessionstore namespace - T228836 | 
  [production] | 
            
  | 11:39 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'mathoid' for release 'production' . | 
  [production] | 
            
  | 11:35 | 
  <gehel> | 
  restart wdqs-blazegraph on wdqs2001 | 
  [production] | 
            
  | 11:34 | 
  <gehel> | 
  restart wdqs-updater on wdqs2001 | 
  [production] | 
            
  | 11:30 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-main' for release 'main' . | 
  [production] | 
            
  | 11:29 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-analytics' for release 'analytics' . | 
  [production] | 
            
  | 11:25 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'citoid' for release 'production' . | 
  [production] | 
            
  | 11:21 | 
  <fsero> | 
  recreating citoid eventgate-analytics eventgate-main mathoid namespace - T228836 | 
  [production] | 
            
  | 11:20 | 
  <fsero@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'termbox' for release 'production' . | 
  [production] | 
            
  | 11:18 | 
  <raynor> | 
  EU SWAT finished | 
  [production] | 
            
  | 11:15 | 
  <pmiazga@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:529925|Undeploy editor gender surveys (T227793)]] (duration: 00m 48s) | 
  [production] | 
            
  | 11:13 | 
  <fsero> | 
  recreating termbox namespace - T228836 | 
  [production] | 
            
  | 11:06 | 
  <oblivian@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'zotero' for release 'production' . | 
  [production] | 
            
  | 11:04 | 
  <fsero> | 
  resetting net.netfilter.nf_conntrack_tcp_timeout_time_wait to 65 in kubernetes2006 | 
  [production] | 
            
  | 10:59 | 
  <_joe_> | 
  [eqiad] downtiming zotero on icinga for 10 minutes while recreating the deployment with helmfile | 
  [production] | 
            
  | 10:57 | 
  <oblivian@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) | 
  [production] | 
            
  | 10:57 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.downtime | 
  [production] | 
            
  | 10:56 | 
  <oblivian@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) | 
  [production] | 
            
  | 10:56 | 
  <oblivian@cumin1001> | 
  START - Cookbook sre.hosts.downtime | 
  [production] | 
            
  | 10:49 | 
  <oblivian@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'coredns' . | 
  [production] | 
            
  | 10:44 | 
  <oblivian@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'coredns' . | 
  [production] | 
            
  | 10:39 | 
  <oblivian@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'rbac-deploy-clusterrole' . | 
  [production] | 
            
  | 10:39 | 
  <_joe_> | 
  recreating rbac roles via helmfile | 
  [production] | 
            
  | 10:32 | 
  <oblivian@> | 
  helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . | 
  [production] | 
            
  | 10:29 | 
  <_joe_> | 
  deleting calico deploy and configmap in kubernetes in eqiad, recreating with helmfile | 
  [production] | 
            
  | 10:25 | 
  <jbond42> | 
  rolling update of ghostscript | 
  [production] | 
            
  | 10:23 | 
  <fsero@puppetmaster1001> | 
  conftool action : set/pooled=false; selector: dnsdisc=sessionstore|citoid|cxserver|eventgate-analytics|eventgate-main|termbox|blubberoid|mathoid|zotero,name=eqiad | 
  [production] | 
            
  | 10:10 | 
  <fsero> | 
  initialize_cluster.sh kube-system kubemaster.svc.eqiad.wmnet 6443 - T228836 | 
  [production] | 
            
  | 10:10 | 
  <fsero> | 
  creating tiller in kube-system for helmfile T228836 | 
  [production] | 
            
  | 09:58 | 
  <vgutierrez> | 
  upgrading the rest of cache@upload to 8.0.3-1wm3 - T221594 | 
  [production] | 
            
  | 08:49 | 
  <marostegui> | 
  Stop MySQL on db2057 - T230394 | 
  [production] | 
            
  | 08:48 | 
  <marostegui> | 
  Remove db2057 from tendril and zarcillo T230394 | 
  [production] | 
            
  | 07:55 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-eqiad.php: Remove db2057 from config T230394 (duration: 00m 47s) | 
  [production] | 
            
  | 07:54 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-codfw.php: Remove db2057 from config T230394 (duration: 00m 48s) | 
  [production] | 
            
  | 06:59 | 
  <volans> | 
  upgrading spicerack to 0.0.26 on cumin2001 | 
  [production] | 
            
  | 06:49 | 
  <vgutierrez> | 
  Rolling restart of fifo-log-demux and atsmtail services across cache@upload | 
  [production] | 
            
  | 06:38 | 
  <vgutierrez> | 
  upgrading fifo-log-demux to version 0.5 in cache@upload | 
  [production] | 
            
  | 06:11 | 
  <vgutierrez> | 
  Upgrading ATS to 8.0.3-1wm3 in cp2002, cp1076, cp3034 and cp4021 - T221594 | 
  [production] | 
            
  | 05:47 | 
  <marostegui> | 
  Stop mysql on db2050 - T230391 | 
  [production] | 
            
  | 05:40 | 
  <marostegui> | 
  Remove db2050 from tendril and zarcillo T230391 | 
  [production] | 
            
  | 05:35 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Remove db2050 from config, host will be decommissioned T230391', diff saved to https://phabricator.wikimedia.org/P8904 and previous config saved to /var/cache/conftool/dbconfig/20190813-053514-marostegui.json | 
  [production] | 
            
  | 05:33 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-codfw.php: Remove db2050 from config T230391 (duration: 00m 48s) | 
  [production] | 
            
  | 05:32 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-eqiad.php: Remove db2050 from config T230391 (duration: 00m 48s) | 
  [production] | 
            
  | 05:12 | 
  <marostegui@deploy1001> | 
  Synchronized wmf-config/db-eqiad.php: Provision db2122 into s7 T228969 (duration: 00m 47s) | 
  [production] |