production SAL

1851-1900 of 10000 results (57ms)

2022-08-04 §
17:42	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on maps2008.codfw.wmnet with reason: codfw reboots	[production]
17:42	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 1:00:00 on maps2008.codfw.wmnet with reason: codfw reboots	[production]
17:42	<mutante>	thunmbor2006 - downtime and shutdown for D3 maintenance	[production]
17:42	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on thumbor2006.codfw.wmnet with reason: codfw reboots	[production]
17:41	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 1:00:00 on thumbor2006.codfw.wmnet with reason: codfw reboots	[production]
17:39	<mutante>	mw2386 - systemctl reset-failed	[production]
17:31	<mutante>	phab2001 - systemctl restart ssh-phab, attempting to clear Icinga pybal alerts, related to reboots	[production]
17:30	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade	[production]
17:30	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 3:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade	[production]
17:29	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade	[production]
17:29	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade	[production]
17:28	<Amir1>	dbmaint at s4@eqiad (T312863)	[production]
17:26	<bd808@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply	[production]
17:26	<bd808@deploy1002>	helmfile [eqiad] START helmfile.d/services/developer-portal: apply	[production]
17:24	<bd808@deploy1002>	helmfile [codfw] DONE helmfile.d/services/developer-portal: apply	[production]
17:23	<bd808@deploy1002>	helmfile [codfw] START helmfile.d/services/developer-portal: apply	[production]
17:23	<bd808@deploy1002>	helmfile [staging] DONE helmfile.d/services/developer-portal: apply	[production]
17:23	<bd808@deploy1002>	helmfile [staging] START helmfile.d/services/developer-portal: apply	[production]
17:20	<mutante>	[an-launcher1002:~] $ sudo systemctl reset-failed	[production]
17:20	<mvernon@cumin1001>	conftool action : set/pooled=no; selector: name=ms-fe2012.codfw.wmnet	[production]
17:18	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
17:18	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
17:18	<sukhe@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp2038.codfw.wmnet,service=varnish-fe	[production]
17:18	<sukhe@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp2038.codfw.wmnet,service=ats-be	[production]
17:18	<sukhe@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp2038.codfw.wmnet,service=ats-tls	[production]
17:16	<Emperor>	shutdown of moss-fe2002.codfw.wmnet,ms-be20[37,38,43,61,65,69].codfw.wmnet,ms-fe2012.codfw.wmnet,thanos-fe2003.codfw.wmnet for power work T310146	[production]
17:16	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp[2035-2036].codfw.wmnet with reason: shutdown for PDU upgrade	[production]
17:15	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 4:00:00 on cp[2035-2036].codfw.wmnet with reason: shutdown for PDU upgrade	[production]
17:15	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 9 hosts with reason: PDU work	[production]
17:15	<mvernon@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 9 hosts with reason: PDU work	[production]
17:15	<sukhe@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=varnish-fe	[production]
17:15	<sukhe@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=ats-be	[production]
17:15	<sukhe@puppetmaster1001>	conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=ats-tls	[production]
17:13	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet	[production]
17:13	<mvernon@cumin1001>	START - Cookbook sre.hosts.remove-downtime for ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet	[production]
17:12	<sukhe@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=varnish-fe	[production]
17:12	<sukhe@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=ats-be	[production]
17:12	<sukhe@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=ats-tls	[production]
17:12	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2050.codfw.wmnet with reason: T310146	[production]
17:12	<bking@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2050.codfw.wmnet with reason: T310146	[production]
17:11	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] (duration: 00m 04s)	[production]
17:11	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288]	[production]
17:11	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] (duration: 00m 07s)	[production]
17:10	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288]	[production]
17:10	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 00m 15s)	[production]
17:09	<ebysans@deploy1002>	Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288]	[production]
17:07	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on lvs2010.codfw.wmnet with reason: shutdown for PDU upgrade	[production]
17:07	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on lvs2010.codfw.wmnet with reason: shutdown for PDU upgrade	[production]
16:55	<hnowlan@puppetmaster1001>	conftool action : set/pooled=no; selector: name=maps2008.codfw.wmnet	[production]
16:51	<ebysans@deploy1002>	Finished deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] (duration: 07m 14s)	[production]