| 2021-01-28
      
      § | 
    
  | 19:10 | <dzahn@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2223.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 19:09 | <dzahn@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on mw2247.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 19:09 | <ebernhardson@deploy1001> | Started deploy [wikimedia/discovery/analytics@0742443]: hourly partitioning for ores tables | [production] | 
            
  | 19:08 | <dzahn@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on mw2223.codfw.wmnet with reason: REIMAGE | [production] | 
            
  | 19:07 | <cdanis> | decom Zayo IP transit on cr2-codfw T272675 | [production] | 
            
  | 19:06 | <ebernhardson@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: Enable canary events for mediawiki_revision_recommendation_create (duration: 01m 12s) | [production] | 
            
  | 19:02 | <dzahn@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1264.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 19:00 | <dzahn@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on mw1264.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 18:58 | <cdanis> | draining traffic from Zayo OGYX/123447 codfw<>ulsfo in preparation for decommission 🥃 T272675 | [production] | 
            
  | 18:58 | <mforns@deploy1001> | Started deploy [analytics/refinery@1e41f60]: Regular analytics weekly train [analytics/refinery@1e41f608fad96e7a9f77eb28cd1c082a0a01d562] | [production] | 
            
  | 18:58 | <urbanecm@deploy1001> | Synchronized private/PrivateSettings.php: Remove T257687 mitigations (duration: 01m 10s) | [production] | 
            
  | 18:46 | <robh@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 18:44 | <robh@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 18:34 | <mutante> | reimaging another canary appserver, mw1264, so that we will have at least 2 stretch and 2 buster canaries for the transitional period | [production] | 
            
  | 18:30 | <bblack@cumin1001> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 18:26 | <bblack@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 17:49 | <jgleeson> | fundraising-tools tools updated from 41cab089da to d64b2f8cee | [production] | 
            
  | 17:38 | <crusnov@deploy1001> | Finished deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 (duration: 01m 18s) | [production] | 
            
  | 17:37 | <crusnov@deploy1001> | Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 | [production] | 
            
  | 17:35 | <crusnov@deploy1001> | Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 | [production] | 
            
  | 17:28 | <ebernhardson> | ban elastic1063 from production-search-omega-eqiad and production-search-eqiad T265113 | [production] | 
            
  | 17:11 | <urbanecm@deploy1001> | Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 01m 06s) | [production] | 
            
  | 16:56 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy1002.eqiad.wmnet | [production] | 
            
  | 16:51 | <akosiaris@deploy1001> | helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'staging' . | [production] | 
            
  | 16:51 | <akosiaris@deploy1001> | helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' . | [production] | 
            
  | 16:49 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host deploy1002.eqiad.wmnet | [production] | 
            
  | 16:49 | <akosiaris@deploy1001> | helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . | [production] | 
            
  | 16:49 | <akosiaris@deploy1001> | helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'staging' . | [production] | 
            
  | 16:49 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy2002.codfw.wmnet | [production] | 
            
  | 16:48 | <akosiaris@deploy1001> | helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . | [production] | 
            
  | 16:48 | <akosiaris@deploy1001> | helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'production' . | [production] | 
            
  | 16:45 | <elukey@cumin1001> | END (FAIL) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=99) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] | 
            
  | 16:44 | <elukey@cumin1001> | START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] | 
            
  | 16:44 | <elukey@cumin1001> | END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] | 
            
  | 16:44 | <elukey@cumin1001> | START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] | 
            
  | 16:41 | <elukey@cumin1001> | END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] | 
            
  | 16:41 | <arturo> | running homer on cr*-eqiad* again for reverting latest changes (T271476) | [production] | 
            
  | 16:39 | <jmm@cumin2001> | START - Cookbook sre.hosts.reboot-single for host deploy2002.codfw.wmnet | [production] | 
            
  | 16:28 | <akosiaris@deploy1001> | helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' . | [production] | 
            
  | 16:28 | <akosiaris@deploy1001> | helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'staging' . | [production] | 
            
  | 16:28 | <akosiaris@deploy1001> | helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'plain' . | [production] | 
            
  | 16:26 | <akosiaris@deploy1001> | helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' . | [production] | 
            
  | 16:25 | <akosiaris@deploy1001> | helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'plain' . | [production] | 
            
  | 16:25 | <akosiaris@deploy1001> | helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'staging' . | [production] | 
            
  | 16:24 | <elukey@cumin1001> | START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] | 
            
  | 16:24 | <akosiaris> | stop scraping apertium from prometheus, it doesn't have a prometheus endpoint. | [production] | 
            
  | 16:23 | <akosiaris@deploy1001> | helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'production' . | [production] | 
            
  | 16:23 | <akosiaris@deploy1001> | helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'plain' . | [production] | 
            
  | 16:23 | <akosiaris@deploy1001> | helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'staging' . | [production] | 
            
  | 16:19 | <elukey@cumin1001> | END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 | [production] |