production SAL

6001-6050 of 10000 results (75ms)

2019-03-21 §
17:11	<gehel@cumin1001>	END (FAIL) - Cookbook sre.elasticsearch.e6-upgrade (exit_code=99)	[production]
17:07	<jforrester@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: SDC: Enable Depicts on TestCommons, with related config (duration: 00m 50s)	[production]
17:03	<gehel@cumin1001>	START - Cookbook sre.elasticsearch.e6-upgrade	[production]
17:03	<gehel@cumin1001>	END (ERROR) - Cookbook sre.elasticsearch.e6-upgrade (exit_code=97)	[production]
17:02	<gehel@cumin2001>	END (PASS) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=0)	[production]
17:02	<gehel@cumin2001>	START - Cookbook sre.elasticsearch.force-shard-allocation	[production]
16:39	<gehel@cumin1001>	START - Cookbook sre.elasticsearch.e6-upgrade	[production]
16:38	<gehel@cumin1001>	END (ERROR) - Cookbook sre.elasticsearch.e6-upgrade (exit_code=97)	[production]
16:38	<gehel@cumin1001>	START - Cookbook sre.elasticsearch.e6-upgrade	[production]
16:38	<gehel@cumin1001>	END (ERROR) - Cookbook sre.elasticsearch.e6-upgrade (exit_code=97)	[production]
16:38	<gehel@cumin2001>	END (PASS) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=0)	[production]
16:38	<gehel@cumin2001>	START - Cookbook sre.elasticsearch.force-shard-allocation	[production]
16:29	<gehel@cumin2001>	END (PASS) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=0)	[production]
16:29	<gehel@cumin2001>	START - Cookbook sre.elasticsearch.force-shard-allocation	[production]
16:03	<marostegui@deploy1001>	Synchronized wmf-config/db-codfw.php: Depool db2096 for onsite maintenance (duration: 00m 50s)	[production]
16:01	<marostegui>	Poweroff db2096 for onsite maintenance T218336	[production]
15:20	<moritzm>	rebooting flerovium/furud for kernel updates	[production]
14:35	<moritzm>	restarging jenkins on releases* after Java update	[production]
14:18	<gtirloni>	downtimed labtestweb2001 (T218881)	[production]
14:11	<vgutierrez>	re-enabling puppet in acme-chief clients - T218862	[production]
14:09	<arturo>	T218024 disabled icinga checks for labtestweb2001	[production]
14:07	<gehel@cumin1001>	START - Cookbook sre.elasticsearch.e6-upgrade	[production]
13:58	<vgutierrez>	update acme-chief to version 0.15 in acmechief1001 - T218862	[production]
13:54	<vgutierrez>	disabling puppet in acme-chief clients - T218862	[production]
13:48	<akosiaris>	reboot oresrdb2001	[production]
13:39	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Repool db1090:3317 (duration: 00m 51s)	[production]
13:37	<elukey>	upgrade openjdk-8 on an-worker1080 and restarted hadoop daemons	[production]
13:28	<moritzm>	installing Java security updates on notebook hosts	[production]
13:22	<zfilipin@deploy1001>	rebuilt and synchronized wikiversions files: all wikis to 1.33.0-wmf.22	[production]
13:18	<gtirloni>	downtimed cloudcontrol, cloudservices, labcontrol, labweb (T210818)	[production]
13:06	<moritzm>	installing Java security updates on stat hosts	[production]
12:40	<arturo>	T216497 remove python-cliff from jessie-wikimedia/openstack-mitaka-jessie	[production]
12:35	<jijiki>	Pooling mw1339 back	[production]
12:33	<jijiki>	Pooling mw1290 back	[production]
12:08	<arturo>	T216497 add python-cliff to jessie-wikimedia/openstack-mitaka-jessie	[production]
12:02	<vgutierrez>	uploaded acme-chief 0.15 to apt.wikimedia.org (buster) - T218862	[production]
11:54	<elukey>	restart yarn node managers on an-worker10[82,89,92] - shutdown after a long yarn failover and only now downtime is expired	[production]
11:36	<mutante>	gerrit2001 (not the master prod server)- scheduled downtime and rebooting for upgrade	[production]
11:04	<zeljkof>	EU SWAT finished	[production]
11:04	<zfilipin@deploy1001>	Synchronized wmf-config/throttle.php: SWAT: [[gerrit:495494\|Add new throttle rule for LMU Edit-a-thon (T217929)]] (duration: 00m 57s)	[production]
10:57	<filippo@puppetmaster1001>	conftool action : set/pooled=no; selector: name=prometheus2004.codfw.wmnet	[production]
10:52	<filippo@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=prometheus2003.codfw.wmnet	[production]
10:46	<elukey>	restart hadoop yarn resource managers on an-master100[1,2] to pick up new settings	[production]
10:23	<moritzm>	rebooting labtestcontrol2001 for kernel update	[production]
10:19	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Depool db1090:3317 (duration: 00m 56s)	[production]
09:59	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Fully repool db1086 (duration: 00m 58s)	[production]
09:42	<jiji@cumin1001>	conftool action : set/pooled=no; selector: dc=codfw,service=cxserver,cluster=scb,name=scb.*	[production]
09:42	<jijiki>	Depool scb* in codfw from serving cxserver, finishing its migration to k8s - T213195	[production]
09:29	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Slowly repool db1086 after mysql upgrade (duration: 00m 56s)	[production]
09:27	<moritzm>	rolling reboot of maps servers in codfw for kernel update	[production]