| 
      
        2020-09-02
      
      ยง
     | 
  
    
  | 22:55 | 
  <shdubsh> | 
  restart rsyslog on centrallog[12]001 | 
  [production] | 
            
  | 22:27 | 
  <ryankemper> | 
  `sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo systemctl restart wdqs-blazegraph.service"` | 
  [production] | 
            
  | 22:26 | 
  <ryankemper> | 
  Puppet finished on all external wdqs codfw nodes, nginx automatically reloaded as intended | 
  [production] | 
            
  | 22:24 | 
  <ryankemper> | 
  `sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo run-puppet-agent"` | 
  [production] | 
            
  | 21:47 | 
  <bd808@deploy1001> | 
  Finished deploy [striker/deploy@3c2090a]: Deploying r20200902 tag (T198114, T223610, T245804, T144111, T261810) (duration: 01m 34s) | 
  [production] | 
            
  | 21:46 | 
  <bd808@deploy1001> | 
  Started deploy [striker/deploy@3c2090a]: Deploying r20200902 tag (T198114, T223610, T245804, T144111, T261810) | 
  [production] | 
            
  | 21:10 | 
  <ryankemper> | 
  `sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo systemctl restart wdqs-blazegraph.service"` | 
  [production] | 
            
  | 21:10 | 
  <ryankemper> | 
  `sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal' "sudo systemctl restart nginx.service"` | 
  [production] | 
            
  | 21:02 | 
  <cmjohnson@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | 
  [production] | 
            
  | 21:01 | 
  <ryankemper> | 
  Restarted nginx on `wdqs2007` | 
  [production] | 
            
  | 21:00 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.hosts.downtime | 
  [production] | 
            
  | 20:47 | 
  <ryankemper> | 
  restarted blazegraph on `wdqs2001` as well | 
  [production] | 
            
  | 20:46 | 
  <ryankemper> | 
  `sudo cumin -b10 'P{wdqs2*} and not A:wdqs-test and not A:wdqs-internal and not P{wdqs2001.codfw.wmnet}' "sudo systemctl restart wdqs-blazegraph.service"` (restarted everything but 2001, will restart 2001 next) | 
  [production] | 
            
  | 20:02 | 
  <cmjohnson@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 19:57 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:26 | 
  <cmjohnson@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 19:24 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:23 | 
  <cmjohnson@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 19:20 | 
  <robh> | 
  scs-c1-eqiad firmware update complete and back online T238036 | 
  [production] | 
            
  | 19:14 | 
  <robh> | 
  updating firmware on scs-c1-eqiad via T238036 | 
  [production] | 
            
  | 19:14 | 
  <urbanecm@deploy1001> | 
  Synchronized private/PrivateSettings.php: Revert "Update T250887 mitigations" (duration: 00m 32s) | 
  [production] | 
            
  | 19:14 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:12 | 
  <robh> | 
  updating firmware on scs-c1-eqiad via T238036 | 
  [production] | 
            
  | 19:09 | 
  <Urbanecm> | 
  21:08 <+logmsgbot> !log urbanecm@deploy1001 Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 00m 54s) | 
  [production] | 
            
  | 19:08 | 
  <urbanecm@deploy1001> | 
  Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 00m 54s) | 
  [production] | 
            
  | 18:58 | 
  <herron> | 
  freeing some disk space on centrallog1001 with 'tune2fs -m 0 /dev/centrallog1001-vg/data' | 
  [production] | 
            
  | 18:43 | 
  <ppchelko@deploy1001> | 
  Synchronized wmf-config/CommonSettings.php: gerrit:622898 Install OAuthRateLimiter III: Install where enabled, ouch, forgot to rebase (duration: 00m 55s) | 
  [production] | 
            
  | 18:40 | 
  <ppchelko@deploy1001> | 
  Synchronized wmf-config/CommonSettings.php: gerrit:622898 Install OAuthRateLimiter III: Install where enabled (duration: 00m 55s) | 
  [production] | 
            
  | 18:38 | 
  <ottomata> | 
  execute kafka topics --alter --topic codfw.resource_change --partitions 3 and kafka topics --alter --topic eqiad.resource_change --partitions 3 on kafka jumbo-eqiad (for consistency with main) - T261865 | 
  [production] | 
            
  | 18:37 | 
  <ottomata> | 
  execute kafka topics --alter --topic codfw.resource_change --partitions 3 and kafka topics --alter --topic eqiad.resource_change --partitions 3 on kafka main-codfw - T261865 | 
  [production] | 
            
  | 18:36 | 
  <ppchelko@deploy1001> | 
  Synchronized wmf-config/InitialiseSettings.php: gerrit:622897 Install OAuthRateLimiter extension II: Add flag to IS (duration: 00m 56s) | 
  [production] | 
            
  | 18:34 | 
  <ottomata> | 
  execute kafka topics --alter --topic codfw.resource_change --partitions 3 and kafka topics --alter --topic eqiad.resource_change --partitions 3 on kafka main-eqiad - T261865 | 
  [production] | 
            
  | 18:33 | 
  <ppchelko@deploy1001> | 
  Synchronized wmf-config/extension-list: (no justification provided) (duration: 00m 54s) | 
  [production] | 
            
  | 18:32 | 
  <ottomata> | 
  execute kafka topics --alter --topic codfw.resource-purge --partitions 3 and kafka topics --alter --topic eqiad.resource-purge --partitions 3 on kafka jumbo-eqiad (for consistency with main) - T261865 | 
  [production] | 
            
  | 18:28 | 
  <ppchelko@deploy1001> | 
  Synchronized php-1.36.0-wmf.6/extensions/DiscussionTools/: Backport [[gerrit:623561|Fix parsing localised digits in PHP discussion parser]] (duration: 00m 56s) | 
  [production] | 
            
  | 18:19 | 
  <ppchelko@deploy1001> | 
  Synchronized php-1.36.0-wmf.6/extensions/DiscussionTools/: Backport [[gerrit:623560|Re-apply new reply API patches (again)]] (duration: 00m 58s) | 
  [production] | 
            
  | 17:34 | 
  <bstorm> | 
  re-enabled puppet on labsdb10[09-12] | 
  [production] | 
            
  | 17:28 | 
  <bstorm> | 
  disabled puppet on labsdb10[09-12] | 
  [production] | 
            
  | 17:18 | 
  <herron> | 
  restarted elasticsearch on logstash1012 | 
  [production] | 
            
  | 16:39 | 
  <Pchelolo> | 
  creating oauth_ratelimit_client_tier table T258711 | 
  [production] | 
            
  | 15:55 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=false; selector: dnsdisc=restbase-async,name=codfw | 
  [production] | 
            
  | 15:55 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=true; selector: dnsdisc=restbase-async | 
  [production] | 
            
  | 15:55 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=true; selector: dnsdisc=eventgate-main | 
  [production] | 
            
  | 15:32 | 
  <hnowlan> | 
  Temporarily disabling apache for configuration change T246945 | 
  [production] | 
            
  | 15:24 | 
  <godog> | 
  prometheus codfw lvextend --resizefs --size +50G /dev/mapper/vg--ssd-prometheus--k8s | 
  [production] | 
            
  | 15:19 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=false; selector: dnsdisc=restbase-async,name=eqiad | 
  [production] | 
            
  | 15:18 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=true; selector: dnsdisc=restbase-async | 
  [production] | 
            
  | 15:18 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=false; selector: dnsdisc=eventgate-main,name=eqiad | 
  [production] | 
            
  | 15:17 | 
  <ppchelko@deploy1001> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'changeprop' for release 'production' . | 
  [production] | 
            
  | 15:16 | 
  <oblivian@cumin1001> | 
  conftool action : set/pooled=false; selector: dnsdisc=eventgate-main,name=eqiad | 
  [production] |