| 2020-02-06
      
      ยง | 
    
  | 15:28 | <vgutierrez> | pooling ncredir4002 running buster - T243391 | [production] | 
            
  | 15:27 | <moritzm> | installing sudo security updates on jessie | [production] | 
            
  | 15:23 | <vgutierrez> | pooling cp4025 with buster - T242093 | [production] | 
            
  | 15:14 | <ema> | A:mw-api: force puppet run to increase keepalive_requests from 100 to 200 https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/570670/ T241145 | [production] | 
            
  | 15:09 | <vgutierrez@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 15:07 | <vgutierrez@cumin1001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 14:59 | <godog> | extend graphite1004 / graphite2003 fs +200G | [production] | 
            
  | 14:56 | <vgutierrez> | depool and reimage ncredir4002 as buster - T243391 | [production] | 
            
  | 14:46 | <vgutierrez> | depool & reimage cp4025 as buster - T242093 | [production] | 
            
  | 14:16 | <akosiaris> | 20mins in with eventgate-analytics/eqiad depooled from discovery, no issues yet. | [production] | 
            
  | 14:14 | <ema> | run puppet on mw-api-canary to revert nginx keepalive_requests bump T241145 | [production] | 
            
  | 13:55 | <marostegui> | Stop MySQL on es1019, upgrade and poweroff for on-site maintenance - T243963 | [production] | 
            
  | 13:54 | <akosiaris@cumin1001> | conftool action : set/pooled=false; selector: name=eqiad,dnsdisc=eventgate-analytics | [production] | 
            
  | 13:53 | <akosiaris> | depool eqiad eventgate-analytics for testing purposes. Requests will flow to codfw, monitoring https://grafana.wikimedia.org/d/RIA1lzDZk/application-servers-red-dashboard?orgId=1&from=now-30m&to=now for issues. | [production] | 
            
  | 13:51 | <marostegui@cumin1001> | dbctl commit (dc=all): 'Depool es1019 for onsite maintenance T243963', diff saved to https://phabricator.wikimedia.org/P10321 and previous config saved to /var/cache/conftool/dbconfig/20200206-135157-marostegui.json | [production] | 
            
  | 13:45 | <XioNoX> | rollback deactivate BGP transits on cr3-knams | [production] | 
            
  | 13:34 | <elukey> | repool mw1347 with mcrouter running with 10 proxy threads (was: 5) | [production] | 
            
  | 13:31 | <XioNoX> | reboot cr3-knams | [production] | 
            
  | 13:30 | <elukey> | depool mw1347 to test some mcrouter settings | [production] | 
            
  | 13:27 | <XioNoX> | deactivate BGP transits on cr3-knams | [production] | 
            
  | 13:22 | <vgutierrez> | Enable server session sharing on ats-tls in cp4031 - T244464 | [production] | 
            
  | 13:10 | <XioNoX> | rollback: deactivate BGP transits on cr2-eqsin | [production] | 
            
  | 13:00 | <XioNoX> | reboot cr2-eqsin for sw upgrade | [production] | 
            
  | 13:00 | <addshore> | SWAT done | [production] | 
            
  | 13:00 | <addshore@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: resync REVERT Enable EntitySourceBasedFederation for group1 (duration: 01m 07s) | [production] | 
            
  | 12:59 | <XioNoX> | deactivate BGP transits on cr2-eqsin | [production] | 
            
  | 12:58 | <addshore@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: REVERT Enable EntitySourceBasedFederation for group1 T243395, due to T244479 (duration: 01m 07s) | [production] | 
            
  | 12:52 | <addshore@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: Enable EntitySourceBasedFederation for group1 T243395 (duration: 01m 06s) | [production] | 
            
  | 12:46 | <addshore@deploy1001> | Synchronized php-1.35.0-wmf.18/extensions/Babel: REVERT Fetch central babel information over SQL query, not API (T243726) (duration: 01m 07s) | [production] | 
            
  | 12:44 | <addshore@deploy1001> | sync-file aborted: Fetch central babel information over SQL query, not API (T243726) (duration: 01m 04s) | [production] | 
            
  | 12:40 | <vgutierrez> | pooling cp3065 - T242093 | [production] | 
            
  | 12:39 | <addshore@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: Enable EntitySourceBasedFederation for group0 T243395 (duration: 01m 07s) | [production] | 
            
  | 12:34 | <cparle@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: Re-enable delayed new upload jobs for MachineVision extension (duration: 01m 08s) | [production] | 
            
  | 12:26 | <cparle@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: Remove handler deleted from the MachineVision extension (duration: 01m 05s) | [production] | 
            
  | 12:25 | <XioNoX> | remove full-duplex statement from eqsin Tata link (not supported on Junos 18, as 10G is full duplex anyway) | [production] | 
            
  | 12:24 | <cparle@deploy1001> | Synchronized php-1.35.0-wmf.18/extensions/MachineVision: Use the wbsetclaim API to add depicts statements (duration: 01m 09s) | [production] | 
            
  | 12:07 | <urbanecm@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: SWAT: 5e1cbb2: Enable CX in te, kn, gu, mr and pawiki as a default tool (T243271, T243272, T243273, T243274, T243275) (duration: 01m 09s) | [production] | 
            
  | 11:41 | <akosiaris> | upgrade etherpad-lite on etherpad1002 to 1.8.0-1 | [production] | 
            
  | 11:38 | <kart_> | Updated cxserver to 2020-02-05-051751-production (T244230, T234323) | [production] | 
            
  | 11:35 | <kartik@deploy1001> | helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . | [production] | 
            
  | 11:33 | <akosiaris> | upload etherpad-lite_1.8.0-1 to apt.wikimedia.org buster-wikimedia/main | [production] | 
            
  | 11:31 | <kartik@deploy1001> | helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' . | [production] | 
            
  | 11:28 | <kartik@deploy1001> | helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' . | [production] | 
            
  | 11:14 | <vgutierrez@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 11:11 | <vgutierrez@cumin1001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 10:21 | <akosiaris> | undo "switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348". no effect observed | [production] | 
            
  | 10:20 | <akosiaris> | undo "switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348" | [production] | 
            
  | 10:19 | <vgutierrez> | Enabling HTTP keepalive between ats-tls and varnish-frontend on cp4031 - T244464 | [production] | 
            
  | 10:00 | <vgutierrez> | depool and reimage cp3065 as buster - T242093 | [production] | 
            
  | 09:59 | <vgutierrez> | upload trafficserver 8.0.5-1wm14 to apt.wm.o (buster) - T242093 | [production] |