2021-01-28
§
|
18:58 |
<cdanis> |
draining traffic from Zayo OGYX/123447 codfw<>ulsfo in preparation for decommission 🥃 T272675 |
[production] |
18:58 |
<mforns@deploy1001> |
Started deploy [analytics/refinery@1e41f60]: Regular analytics weekly train [analytics/refinery@1e41f608fad96e7a9f77eb28cd1c082a0a01d562] |
[production] |
18:58 |
<urbanecm@deploy1001> |
Synchronized private/PrivateSettings.php: Remove T257687 mitigations (duration: 01m 10s) |
[production] |
18:46 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE |
[production] |
18:44 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE |
[production] |
18:34 |
<mutante> |
reimaging another canary appserver, mw1264, so that we will have at least 2 stretch and 2 buster canaries for the transitional period |
[production] |
18:30 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:26 |
<bblack@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
17:49 |
<jgleeson> |
fundraising-tools tools updated from 41cab089da to d64b2f8cee |
[production] |
17:38 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 (duration: 01m 18s) |
[production] |
17:37 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 |
[production] |
17:35 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 |
[production] |
17:28 |
<ebernhardson> |
ban elastic1063 from production-search-omega-eqiad and production-search-eqiad T265113 |
[production] |
17:11 |
<urbanecm@deploy1001> |
Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 01m 06s) |
[production] |
16:56 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy1002.eqiad.wmnet |
[production] |
16:51 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
16:51 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
16:49 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host deploy1002.eqiad.wmnet |
[production] |
16:49 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
16:49 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
16:49 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy2002.codfw.wmnet |
[production] |
16:48 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
16:48 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
16:45 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=99) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:44 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:44 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:44 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:41 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:41 |
<arturo> |
running homer on cr*-eqiad* again for reverting latest changes (T271476) |
[production] |
16:39 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host deploy2002.codfw.wmnet |
[production] |
16:28 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
16:28 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'staging' . |
[production] |
16:28 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'plain' . |
[production] |
16:26 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
16:25 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'plain' . |
[production] |
16:25 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'staging' . |
[production] |
16:24 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:24 |
<akosiaris> |
stop scraping apertium from prometheus, it doesn't have a prometheus endpoint. |
[production] |
16:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
16:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'plain' . |
[production] |
16:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'staging' . |
[production] |
16:19 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:17 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:06 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:03 |
<arturo> |
running homer on cr*-eqiad* for T271476 |
[production] |
15:55 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
15:54 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 |
[production] |
15:52 |
<cdanis> |
draining traffic from Zayo OGYX/120003 codfw<>eqiad in preparation for decommission 🥃 T272675 |
[production] |
15:49 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.stop-cluster for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 |
[production] |
15:49 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@d0a6933]: align threshold path references across days (duration: 01m 15s) |
[production] |