production SAL

1901-1950 of 10000 results (64ms)

2022-08-04 §
16:45	<hnowlan@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=restbase2016.codfw.wmnet	[production]
16:45	<hnowlan@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=restbase202[05].codfw.wmnet	[production]
16:45	<hnowlan@puppetmaster1001>	conftool action : set/pooled=no; selector: name=restbase202[05].codfw.wmnet	[production]
16:45	<hnowlan@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=maps2007.codfw.wmnet	[production]
16:43	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288]	[production]
16:43	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] (duration: 00m 07s)	[production]
16:43	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288]	[production]
16:37	<jayme@cumin1001>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 18 hosts	[production]
16:37	<jayme@cumin1001>	START - Cookbook sre.hosts.remove-downtime for 18 hosts	[production]
16:35	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2059.codfw.wmnet with reason: T310145	[production]
16:35	<bking@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2059.codfw.wmnet with reason: T310145	[production]
16:34	<jayme@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2003.codfw.wmnet with reason: PDU swap	[production]
16:34	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 00m 20s)	[production]
16:34	<jayme@cumin1001>	START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2003.codfw.wmnet with reason: PDU swap	[production]
16:34	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288]	[production]
16:32	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 29m 59s)	[production]
16:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depool D3 for PDU maint', diff saved to https://phabricator.wikimedia.org/P32286 and previous config saved to /var/cache/conftool/dbconfig/20220804-163037-ladsgroup.json	[production]
16:28	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
16:28	<ladsgroup@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:820376\|Start reading from new templatelinks columns in commons (T306673)]] (duration: 03m 00s)	[production]
16:27	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
16:27	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
16:26	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
16:17	<brett>	deploying authdns - geodns: Map out African countries by DC latency (T311472)	[production]
16:12	<cwhite>	poweroff logstash2028 - T310145	[production]
16:06	<Emperor>	shutdown ms-be20[39,49,54].codfw.wmnet,thanos-be2003 for PDU swap T310145	[production]
16:03	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet with reason: PDU work	[production]
16:02	<mvernon@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet with reason: PDU work	[production]
16:02	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288]	[production]
15:50	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2048.codfw.wmnet with reason: T310145	[production]
15:50	<bking@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2048.codfw.wmnet with reason: T310145	[production]
15:43	<damilare>	payments-wiki upgraded from 0e4a5b3b to 6880236d	[production]
15:37	<_joe_>	uncordoning ml-serve200{1,6}	[production]
15:27	<sukhe>	power off cp2037,cp2038: PDU upgrade	[production]
15:25	<jelto@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:30:00 on phab2001.codfw.wmnet with reason: PDU swap	[production]
15:25	<jelto>	power off phab2001	[production]
15:25	<jelto@cumin1001>	START - Cookbook sre.hosts.downtime for 3:30:00 on phab2001.codfw.wmnet with reason: PDU swap	[production]
15:25	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp[2037-2038].codfw.wmnet with reason: shutdown for PDU upgrade	[production]
15:24	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 4:00:00 on cp[2037-2038].codfw.wmnet with reason: shutdown for PDU upgrade	[production]
15:24	<sukhe@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=varnish-fe	[production]
15:23	<sukhe@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=ats-be	[production]
15:23	<sukhe@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=ats-tls	[production]
15:21	<XioNoX>	un-drain codfw-ulsfo link - T310310	[production]
15:21	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db[2116,2127,2167-2168].codfw.wmnet,es2022.codfw.wmnet with reason: Maintenance (T310145)	[production]
15:20	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db[2116,2127,2167-2168].codfw.wmnet,es2022.codfw.wmnet with reason: Maintenance (T310145)	[production]
15:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depool C6 for PDU maint (T310145)', diff saved to https://phabricator.wikimedia.org/P32285 and previous config saved to /var/cache/conftool/dbconfig/20220804-151958-ladsgroup.json	[production]
15:16	<btullis@cumin1001>	END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) for AQS aqs cluster: Roll restart of all AQS's nodejs daemons.	[production]
15:16	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on restbase[2016,2020,2025].codfw.wmnet with reason: PDU maintenance	[production]
15:16	<hnowlan@cumin1001>	START - Cookbook sre.hosts.downtime for 3:00:00 on restbase[2016,2020,2025].codfw.wmnet with reason: PDU maintenance	[production]
15:13	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db[2114,2126,2166].codfw.wmnet with reason: Maintenance (T310145)	[production]
15:13	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db[2114,2126,2166].codfw.wmnet with reason: Maintenance (T310145)	[production]