6151-6200 of 10000 results (40ms)
2021-01-28 §
19:10 <ebernhardson@deploy1001> Finished deploy [wikimedia/discovery/analytics@0742443]: hourly partitioning for ores tables (duration: 01m 25s) [production]
19:10 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2223.codfw.wmnet with reason: REIMAGE [production]
19:09 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2247.codfw.wmnet with reason: REIMAGE [production]
19:09 <ebernhardson@deploy1001> Started deploy [wikimedia/discovery/analytics@0742443]: hourly partitioning for ores tables [production]
19:08 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2223.codfw.wmnet with reason: REIMAGE [production]
19:07 <cdanis> decom Zayo IP transit on cr2-codfw T272675 [production]
19:06 <ebernhardson@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Enable canary events for mediawiki_revision_recommendation_create (duration: 01m 12s) [production]
19:02 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1264.eqiad.wmnet with reason: REIMAGE [production]
19:00 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1264.eqiad.wmnet with reason: REIMAGE [production]
18:58 <cdanis> draining traffic from Zayo OGYX/123447 codfw<>ulsfo in preparation for decommission 🥃 T272675 [production]
18:58 <mforns@deploy1001> Started deploy [analytics/refinery@1e41f60]: Regular analytics weekly train [analytics/refinery@1e41f608fad96e7a9f77eb28cd1c082a0a01d562] [production]
18:58 <urbanecm@deploy1001> Synchronized private/PrivateSettings.php: Remove T257687 mitigations (duration: 01m 10s) [production]
18:46 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE [production]
18:44 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE [production]
18:34 <mutante> reimaging another canary appserver, mw1264, so that we will have at least 2 stretch and 2 buster canaries for the transitional period [production]
18:30 <bblack@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:26 <bblack@cumin1001> START - Cookbook sre.dns.netbox [production]
17:49 <jgleeson> fundraising-tools tools updated from 41cab089da to d64b2f8cee [production]
17:38 <crusnov@deploy1001> Finished deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 (duration: 01m 18s) [production]
17:37 <crusnov@deploy1001> Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 [production]
17:35 <crusnov@deploy1001> Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 [production]
17:28 <ebernhardson> ban elastic1063 from production-search-omega-eqiad and production-search-eqiad T265113 [production]
17:11 <urbanecm@deploy1001> Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 01m 06s) [production]
16:56 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy1002.eqiad.wmnet [production]
16:51 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'staging' . [production]
16:51 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' . [production]
16:49 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host deploy1002.eqiad.wmnet [production]
16:49 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . [production]
16:49 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'staging' . [production]
16:49 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy2002.codfw.wmnet [production]
16:48 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . [production]
16:48 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'production' . [production]
16:45 <elukey@cumin1001> END (FAIL) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=99) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:44 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:44 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:44 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:41 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:41 <arturo> running homer on cr*-eqiad* again for reverting latest changes (T271476) [production]
16:39 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host deploy2002.codfw.wmnet [production]
16:28 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' . [production]
16:28 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'staging' . [production]
16:28 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'plain' . [production]
16:26 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' . [production]
16:25 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'plain' . [production]
16:25 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'staging' . [production]
16:24 <elukey@cumin1001> START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 [production]
16:24 <akosiaris> stop scraping apertium from prometheus, it doesn't have a prometheus endpoint. [production]
16:23 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'production' . [production]
16:23 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'plain' . [production]
16:23 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'staging' . [production]