| 2022-08-04
      
      ยง | 
    
  | 16:37 | <jayme@cumin1001> | START - Cookbook sre.hosts.remove-downtime for 18 hosts | [production] | 
            
  | 16:35 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2059.codfw.wmnet with reason: T310145 | [production] | 
            
  | 16:35 | <bking@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2059.codfw.wmnet with reason: T310145 | [production] | 
            
  | 16:34 | <jayme@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2003.codfw.wmnet with reason: PDU swap | [production] | 
            
  | 16:34 | <ebysans@deploy1002> | Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 00m 20s) | [production] | 
            
  | 16:34 | <jayme@cumin1001> | START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2003.codfw.wmnet with reason: PDU swap | [production] | 
            
  | 16:34 | <ebysans@deploy1002> | Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] | [production] | 
            
  | 16:32 | <ebysans@deploy1002> | Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 29m 59s) | [production] | 
            
  | 16:30 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depool D3 for PDU maint', diff saved to https://phabricator.wikimedia.org/P32286 and previous config saved to /var/cache/conftool/dbconfig/20220804-163037-ladsgroup.json | [production] | 
            
  | 16:28 | <mwdebug-deploy@deploy1002> | helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 16:28 | <ladsgroup@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:820376|Start reading from new templatelinks columns in commons (T306673)]] (duration: 03m 00s) | [production] | 
            
  | 16:27 | <mwdebug-deploy@deploy1002> | helmfile [codfw] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 16:27 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 16:26 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 16:17 | <brett> | deploying authdns - geodns: Map out African countries by DC latency (T311472) | [production] | 
            
  | 16:12 | <cwhite> | poweroff logstash2028 - T310145 | [production] | 
            
  | 16:06 | <Emperor> | shutdown ms-be20[39,49,54].codfw.wmnet,thanos-be2003 for PDU swap T310145 | [production] | 
            
  | 16:03 | <mvernon@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet with reason: PDU work | [production] | 
            
  | 16:02 | <mvernon@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet with reason: PDU work | [production] | 
            
  | 16:02 | <ebysans@deploy1002> | Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] | [production] | 
            
  | 15:50 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2048.codfw.wmnet with reason: T310145 | [production] | 
            
  | 15:50 | <bking@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2048.codfw.wmnet with reason: T310145 | [production] | 
            
  | 15:43 | <damilare> | payments-wiki upgraded from 0e4a5b3b to 6880236d | [production] | 
            
  | 15:37 | <_joe_> | uncordoning ml-serve200{1,6} | [production] | 
            
  | 15:27 | <sukhe> | power off cp2037,cp2038: PDU upgrade | [production] | 
            
  | 15:25 | <jelto@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:30:00 on phab2001.codfw.wmnet with reason: PDU swap | [production] | 
            
  | 15:25 | <jelto> | power off phab2001 | [production] | 
            
  | 15:25 | <jelto@cumin1001> | START - Cookbook sre.hosts.downtime for 3:30:00 on phab2001.codfw.wmnet with reason: PDU swap | [production] | 
            
  | 15:25 | <sukhe@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp[2037-2038].codfw.wmnet with reason: shutdown for PDU upgrade | [production] | 
            
  | 15:24 | <sukhe@cumin2002> | START - Cookbook sre.hosts.downtime for 4:00:00 on cp[2037-2038].codfw.wmnet with reason: shutdown for PDU upgrade | [production] | 
            
  | 15:24 | <sukhe@puppetmaster1001> | conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=varnish-fe | [production] | 
            
  | 15:23 | <sukhe@puppetmaster1001> | conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=ats-be | [production] | 
            
  | 15:23 | <sukhe@puppetmaster1001> | conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=ats-tls | [production] | 
            
  | 15:21 | <XioNoX> | un-drain codfw-ulsfo link - T310310 | [production] | 
            
  | 15:21 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db[2116,2127,2167-2168].codfw.wmnet,es2022.codfw.wmnet with reason: Maintenance (T310145) | [production] | 
            
  | 15:20 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 10:00:00 on db[2116,2127,2167-2168].codfw.wmnet,es2022.codfw.wmnet with reason: Maintenance (T310145) | [production] | 
            
  | 15:19 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depool C6 for PDU maint (T310145)', diff saved to https://phabricator.wikimedia.org/P32285 and previous config saved to /var/cache/conftool/dbconfig/20220804-151958-ladsgroup.json | [production] | 
            
  | 15:16 | <btullis@cumin1001> | END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) for AQS aqs cluster: Roll restart of all AQS's nodejs daemons. | [production] | 
            
  | 15:16 | <hnowlan@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on restbase[2016,2020,2025].codfw.wmnet with reason: PDU maintenance | [production] | 
            
  | 15:16 | <hnowlan@cumin1001> | START - Cookbook sre.hosts.downtime for 3:00:00 on restbase[2016,2020,2025].codfw.wmnet with reason: PDU maintenance | [production] | 
            
  | 15:13 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db[2114,2126,2166].codfw.wmnet with reason: Maintenance (T310145) | [production] | 
            
  | 15:13 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 10:00:00 on db[2114,2126,2166].codfw.wmnet with reason: Maintenance (T310145) | [production] | 
            
  | 15:13 | <sukhe@puppetmaster1001> | conftool action : set/pooled=yes; selector: name=cp203[12]\.codfw\.wmnet,service=varnish-fe | [production] | 
            
  | 15:13 | <sukhe@puppetmaster1001> | conftool action : set/pooled=yes; selector: name=cp203[12]\.codfw\.wmnet,service=ats-be | [production] | 
            
  | 15:13 | <sukhe@puppetmaster1001> | conftool action : set/pooled=yes; selector: name=cp203[12]\.codfw\.wmnet,service=ats-tls | [production] | 
            
  | 15:12 | <mvernon@cumin1001> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be[2058,2064].codfw.wmnet | [production] | 
            
  | 15:12 | <mvernon@cumin1001> | START - Cookbook sre.hosts.remove-downtime for ms-be[2058,2064].codfw.wmnet | [production] | 
            
  | 15:11 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depool hosts for PDU maint (T310145)', diff saved to https://phabricator.wikimedia.org/P32284 and previous config saved to /var/cache/conftool/dbconfig/20220804-151121-ladsgroup.json | [production] | 
            
  | 15:09 | <godog> | poweroff logstash2002 - T310145 | [production] | 
            
  | 15:07 | <_joe_> | pwoering down mc203{0,1} | [production] |