production SAL

2251-2300 of 10000 results (57ms)

2022-06-23 §
07:15	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 0:30:00 on 22 hosts with reason: Reboots	[production]
07:15	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 25 hosts with reason: Reboots	[production]
07:15	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 0:30:00 on 25 hosts with reason: Reboots	[production]
00:35	<brennen>	end of phabricator maintenance window	[production]
00:13	<brennen>	phabricator deploy finished (T311175)	[production]
00:01	<dzahn@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab2001.codfw.wmnet with reason: maintenance	[production]
00:01	<dzahn@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on phab2001.codfw.wmnet with reason: maintenance	[production]
00:01	<dzahn@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phabricator.wikimedia.org with reason: maintenance	[production]
00:01	<dzahn@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on phabricator.wikimedia.org with reason: maintenance	[production]
00:00	<dzahn@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab1001.eqiad.wmnet with reason: maintenance	[production]
00:00	<dzahn@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on phab1001.eqiad.wmnet with reason: maintenance	[production]
2022-06-22 §
22:56	<tzatziki>	removing 1 file for legal compliance	[production]
21:45	<cmjohnson@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1007.eqiad.wmnet with OS bullseye	[production]
21:44	<ebernhardson>	restart elasticsearch_6@cloudelastic-chi-eqiad on cloudelastic1003 to resolve Old GC Hell alert	[production]
21:44	<ebernhardson>	restart elasticsearch_6@cloudelastic-chi-eqiad to resolve Old GC Hell alert	[production]
21:28	<cmjohnson@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1006.eqiad.wmnet with OS bullseye	[production]
20:49	<aqu@deploy1002>	Finished deploy [analytics/refinery@99cca44]: Regular analytics weekly train retry force [analytics/refinery@99cca44] (duration: 01m 18s)	[production]
20:48	<aqu@deploy1002>	Started deploy [analytics/refinery@99cca44]: Regular analytics weekly train retry force [analytics/refinery@99cca44]	[production]
20:45	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host an-presto1007.eqiad.wmnet with OS bullseye	[production]
20:28	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host an-presto1006.eqiad.wmnet with OS bullseye	[production]
20:27	<cmjohnson@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-presto1006.eqiad.wmnet with OS buster	[production]
20:24	<cjming>	end of UTC late backport window	[production]
20:22	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host an-presto1006.eqiad.wmnet with OS buster	[production]
20:19	<aqu@deploy1002>	Finished deploy [analytics/refinery@99cca44] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@99cca44] (duration: 07m 36s)	[production]
20:16	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
20:14	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
20:14	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
20:13	<cjming@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:807593\|gawiki: Change category collation from `uppercase` to `uca-ga-u-kn` (T311136)]] (duration: 03m 39s)	[production]
20:13	<cmjohnson@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-presto1006.eqiad.wmnet with OS bullseye	[production]
20:11	<aqu@deploy1002>	Started deploy [analytics/refinery@99cca44] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@99cca44]	[production]
20:11	<aqu@deploy1002>	Finished deploy [analytics/refinery@99cca44] (thin): Regular analytics weekly train THIN [analytics/refinery@99cca44] (duration: 00m 07s)	[production]
20:11	<aqu@deploy1002>	Started deploy [analytics/refinery@99cca44] (thin): Regular analytics weekly train THIN [analytics/refinery@99cca44]	[production]
20:10	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
20:10	<aqu@deploy1002>	Finished deploy [analytics/refinery@99cca44]: Regular analytics weekly train retry [analytics/refinery@99cca44] (duration: 06m 16s)	[production]
20:03	<aqu@deploy1002>	Started deploy [analytics/refinery@99cca44]: Regular analytics weekly train retry [analytics/refinery@99cca44]	[production]
20:03	<aqu@deploy1002>	Finished deploy [analytics/refinery@99cca44]: Regular analytics weekly train [analytics/refinery@99cca44] (duration: 30m 58s)	[production]
19:42	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host an-presto1006.eqiad.wmnet with OS bullseye	[production]
19:42	<cmjohnson@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-presto1006.eqiad.wmnet with OS buster	[production]
19:39	<ebernhardson@deploy1002>	Finished deploy [wikimedia/discovery/analytics@1f2f286]: namespace maps: Exclude labtest database group from data collection (duration: 02m 03s)	[production]
19:37	<ebernhardson@deploy1002>	Started deploy [wikimedia/discovery/analytics@1f2f286]: namespace maps: Exclude labtest database group from data collection	[production]
19:32	<aqu@deploy1002>	Started deploy [analytics/refinery@99cca44]: Regular analytics weekly train [analytics/refinery@99cca44]	[production]
19:31	<aqu>	Deploying analytics/refinery (weekly train)	[production]
19:15	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host an-presto1006.eqiad.wmnet with OS buster	[production]
19:14	<herron>	bounced apache on lists1001	[production]
19:06	<hashar>	Restarting CI Jenkins	[production]
16:46	<jynus@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host backup1009.eqiad.wmnet with OS bullseye	[production]
16:45	<hashar>	Restarting CI Jenkins	[production]
16:43	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2063.codfw.wmnet	[production]
16:33	<jynus@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on backup1009.eqiad.wmnet with reason: host reimage	[production]
16:29	<jynus@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on backup1009.eqiad.wmnet with reason: host reimage	[production]