production SAL

6151-6200 of 10000 results (42ms)

2021-01-28 §
19:10	<ebernhardson@deploy1001>	Finished deploy [wikimedia/discovery/analytics@0742443]: hourly partitioning for ores tables (duration: 01m 25s)	[production]
19:10	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2223.codfw.wmnet with reason: REIMAGE	[production]
19:09	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2247.codfw.wmnet with reason: REIMAGE	[production]
19:09	<ebernhardson@deploy1001>	Started deploy [wikimedia/discovery/analytics@0742443]: hourly partitioning for ores tables	[production]
19:08	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2223.codfw.wmnet with reason: REIMAGE	[production]
19:07	<cdanis>	decom Zayo IP transit on cr2-codfw T272675	[production]
19:06	<ebernhardson@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: Enable canary events for mediawiki_revision_recommendation_create (duration: 01m 12s)	[production]
19:02	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1264.eqiad.wmnet with reason: REIMAGE	[production]
19:00	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1264.eqiad.wmnet with reason: REIMAGE	[production]
18:58	<cdanis>	draining traffic from Zayo OGYX/123447 codfw<>ulsfo in preparation for decommission 🥃 T272675	[production]
18:58	<mforns@deploy1001>	Started deploy [analytics/refinery@1e41f60]: Regular analytics weekly train [analytics/refinery@1e41f608fad96e7a9f77eb28cd1c082a0a01d562]	[production]
18:58	<urbanecm@deploy1001>	Synchronized private/PrivateSettings.php: Remove T257687 mitigations (duration: 01m 10s)	[production]
18:46	<robh@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE	[production]
18:44	<robh@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1159.eqiad.wmnet with reason: REIMAGE	[production]
18:34	<mutante>	reimaging another canary appserver, mw1264, so that we will have at least 2 stretch and 2 buster canaries for the transitional period	[production]
18:30	<bblack@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
18:26	<bblack@cumin1001>	START - Cookbook sre.dns.netbox	[production]
17:49	<jgleeson>	fundraising-tools tools updated from 41cab089da to d64b2f8cee	[production]
17:38	<crusnov@deploy1001>	Finished deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084 (duration: 01m 18s)	[production]
17:37	<crusnov@deploy1001>	Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084	[production]
17:35	<crusnov@deploy1001>	Started deploy [netbox/deploy@52d6fb9]: Test deploy of 2.10.4 to netbox-next T265084	[production]
17:28	<ebernhardson>	ban elastic1063 from production-search-omega-eqiad and production-search-eqiad T265113	[production]
17:11	<urbanecm@deploy1001>	Synchronized private/PrivateSettings.php: Update T250887 mitigations (duration: 01m 06s)	[production]
16:56	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy1002.eqiad.wmnet	[production]
16:51	<akosiaris@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'staging' .	[production]
16:51	<akosiaris@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'cxserver' for release 'production' .	[production]
16:49	<jmm@cumin2001>	START - Cookbook sre.hosts.reboot-single for host deploy1002.eqiad.wmnet	[production]
16:49	<akosiaris@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' .	[production]
16:49	<akosiaris@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'staging' .	[production]
16:49	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy2002.codfw.wmnet	[production]
16:48	<akosiaris@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' .	[production]
16:48	<akosiaris@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'production' .	[production]
16:45	<elukey@cumin1001>	END (FAIL) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=99) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001	[production]
16:44	<elukey@cumin1001>	START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001	[production]
16:44	<elukey@cumin1001>	END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001	[production]
16:44	<elukey@cumin1001>	START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001	[production]
16:41	<elukey@cumin1001>	END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001	[production]
16:41	<arturo>	running homer on cr-eqiad again for reverting latest changes (T271476)	[production]
16:39	<jmm@cumin2001>	START - Cookbook sre.hosts.reboot-single for host deploy2002.codfw.wmnet	[production]
16:28	<akosiaris@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' .	[production]
16:28	<akosiaris@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'staging' .	[production]
16:28	<akosiaris@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'plain' .	[production]
16:26	<akosiaris@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' .	[production]
16:25	<akosiaris@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'plain' .	[production]
16:25	<akosiaris@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'staging' .	[production]
16:24	<elukey@cumin1001>	START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001	[production]
16:24	<akosiaris>	stop scraping apertium from prometheus, it doesn't have a prometheus endpoint.	[production]
16:23	<akosiaris@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'production' .	[production]
16:23	<akosiaris@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'plain' .	[production]
16:23	<akosiaris@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'staging' .	[production]